Managed Service for Greenplum® API, gRPC: PXFDatasourceService.Create
- gRPC request
- CreatePXFDatasourceRequest
- PXFDatasource
- PXFDatasourceS3
- PXFDatasourceJDBC
- PXFDatasourceHDFS
- PXFDatasourceCore
- PXFDatasourceKerberos
- PXFDatasourceHDFSDfs
- PXFDatasourceHDFSDfsNamenode
- PXFDatasourceHDFSYarn
- PXFDatasourceHDFSYarnHaRm
- PXFDatasourceHive
- operation.Operation
- CreatePXFDatasourceMetadata
- PXFDatasource
- PXFDatasourceS3
- PXFDatasourceJDBC
- PXFDatasourceHDFS
- PXFDatasourceCore
- PXFDatasourceKerberos
- PXFDatasourceHDFSDfs
- PXFDatasourceHDFSDfsNamenode
- PXFDatasourceHDFSYarn
- PXFDatasourceHDFSYarnHaRm
- PXFDatasourceHive
Creates PXF datasource
gRPC request
rpc Create (CreatePXFDatasourceRequest) returns (operation.Operation)
CreatePXFDatasourceRequest
{
"cluster_id": "string",
"datasource": {
"name": "string",
// Includes only one of the fields `s3`, `jdbc`, `hdfs`, `hive`
"s3": {
"access_key": "string",
"secret_key": "string",
"fast_upload": "google.protobuf.BoolValue",
"endpoint": "string"
},
"jdbc": {
"driver": "string",
"url": "string",
"user": "string",
"password": "string",
"statement_batch_size": "google.protobuf.Int64Value",
"statement_fetch_size": "google.protobuf.Int64Value",
"statement_query_timeout": "google.protobuf.Int64Value",
"pool_enabled": "google.protobuf.BoolValue",
"pool_maximum_size": "google.protobuf.Int64Value",
"pool_connection_timeout": "google.protobuf.Int64Value",
"pool_idle_timeout": "google.protobuf.Int64Value",
"pool_minimum_idle": "google.protobuf.Int64Value"
},
"hdfs": {
"core": {
"default_fs": "string",
"security_auth_to_local": "string"
},
"kerberos": {
"enable": "google.protobuf.BoolValue",
"primary": "string",
"realm": "string",
"kdc_servers": [
"string"
],
"admin_server": "string",
"default_domain": "string",
"keytab_base64": "string"
},
"user_impersonation": "google.protobuf.BoolValue",
"username": "string",
"sasl_connection_retries": "google.protobuf.Int64Value",
"zk_hosts": [
"string"
],
"dfs": {
"ha_automatic_failover_enabled": "google.protobuf.BoolValue",
"block_access_token_enabled": "google.protobuf.BoolValue",
"use_datanode_hostname": "google.protobuf.BoolValue",
"namenodes": "map<string, PXFDatasourceHDFSDfsNamenode>",
"nameservices": "string"
},
"yarn": {
"resourcemanager_ha_enabled": "google.protobuf.BoolValue",
"resourcemanager_ha_auto_failover_enabled": "google.protobuf.BoolValue",
"resourcemanager_ha_auto_failover_embedded": "google.protobuf.BoolValue",
"resourcemanager_cluster_id": "string",
"ha_rm": "map<string, PXFDatasourceHDFSYarnHaRm>"
}
},
"hive": {
"core": {
"default_fs": "string",
"security_auth_to_local": "string"
},
"kerberos": {
"enable": "google.protobuf.BoolValue",
"primary": "string",
"realm": "string",
"kdc_servers": [
"string"
],
"admin_server": "string",
"default_domain": "string",
"keytab_base64": "string"
},
"user_impersonation": "google.protobuf.BoolValue",
"username": "string",
"sasl_connection_retries": "google.protobuf.Int64Value",
"zk_hosts": [
"string"
],
"ppd": "google.protobuf.BoolValue",
"metastore_uris": [
"string"
],
"metastore_kerberos_principal": "string",
"auth_kerberos_principal": "string"
}
// end of the list of possible fields
}
}
|
Field |
Description |
|
cluster_id |
string Required field. The maximum string length in characters is 50. |
|
datasource |
PXFDatasource
|
Field |
Description |
|
name |
string Required field. Data source name. The string length in characters must be 3-200. Value must match the regular expression |
|
s3 |
Settings of an external S3 data source. Includes only one of the fields |
|
jdbc |
Settings of an external JDBC data source. Includes only one of the fields |
|
hdfs |
Settings of an external HDFS data source. Includes only one of the fields |
|
hive |
Settings of an external Hive data source. Includes only one of the fields |
PXFDatasourceS3
|
Field |
Description |
|
access_key |
string Public key to access S3 storage. The maximum string length in characters is 200. |
|
secret_key |
string Secret key to access S3 storage. The maximum string length in characters is 200. |
|
fast_upload |
Manages a fast upload of big files to S3 storage. In case of the The fast upload is enabled by default. |
|
endpoint |
string S3 storage address. The default value is The maximum string length in characters is 200. Value must match the regular expression |
PXFDatasourceJDBC
|
Field |
Description |
|
driver |
string JDBC driver class in Java. The possible values are the following:
The maximum string length in characters is 50. |
|
url |
string URL that the JDBC driver uses to connect to the database. Examples:
The maximum string length in characters is 1000. |
|
user |
string Username of the DB owner. The maximum string length in characters is 200. |
|
password |
string Password of the DB owner. The maximum string length in characters is 200. |
|
statement_batch_size |
Number of rows to read in an external table, in a batch. The default value is Acceptable values are 50 to 1000, inclusive. |
|
statement_fetch_size |
Number of rows to fetch (buffer) when reading from an external table. The default value is Acceptable values are 50 to 10000, inclusive. |
|
statement_query_timeout |
Amount of time (in seconds) the JDBC driver waits for a statement to run. This timeout applies to statements created for both read and write operations. The default value is Acceptable values are 5 to 43200, inclusive. |
|
pool_enabled |
Determines whether JDBC connection pooling is used in a server configuration. By default, it is used. |
|
pool_maximum_size |
Maximum number of connections to the DB backend. The default value is Acceptable values are 1 to 200, inclusive. |
|
pool_connection_timeout |
Maximum time, in milliseconds, to wait for a connection from the pool. The default value is Acceptable values are 5000 to 600000, inclusive. |
|
pool_idle_timeout |
Maximum amount of time, in milliseconds, after which an inactive connection is considered idle. The default value is Acceptable values are 5000 to 600000, inclusive. |
|
pool_minimum_idle |
Minimum number of idle connections maintained in the connection pool. The default value is Acceptable values are 0 to 200, inclusive. |
PXFDatasourceHDFS
|
Field |
Description |
|
core |
Settings of the file system and security rules. |
|
kerberos |
Settings of the Kerberos network authentication protocol. |
|
user_impersonation |
Enables authentication on behalf of the Greenplum® user when connecting to the remote file storage or DBMS. The authentication is disabled by default. |
|
username |
string Login username for the remote file storage or DBMS if authentication on behalf of the Greenplum® user is enabled. The maximum string length in characters is 128. |
|
sasl_connection_retries |
Maximum number of times that PXF retries a SASL connection request after a refused connection returns a The default value is Acceptable values are 1 to 50, inclusive. |
|
zk_hosts[] |
string ZooKeeper server hosts. Specify values in the The maximum number of elements is 200. The string length in characters for each value must be 1-200. |
|
dfs |
Settings of the distributed file system. |
|
yarn |
Settings of the ResourceManager service that is responsible for tracking resources in a cluster and scheduling applications (e.g., MapReduce jobs). |
PXFDatasourceCore
|
Field |
Description |
|
default_fs |
string URI whose scheme and authority determine the file system implementation. The maximum string length in characters is 200. |
|
security_auth_to_local |
string Rules for mapping Kerberos principals to operating system user accounts. The maximum string length in characters is 1000. |
PXFDatasourceKerberos
|
Field |
Description |
|
enable |
Determines whether the Kerberos authentication server is used. By default, it is not used. |
|
primary |
string Host of the primary KDC server (Key Distribution Center). The maximum string length in characters is 128. |
|
realm |
string Kerberos realm for a Greenplum® DB. The maximum string length in characters is 1000. |
|
kdc_servers[] |
string KDC server hosts. The maximum number of elements is 200. The string length in characters for each value must be 1-200. |
|
admin_server |
string Administration server host. Usually, this is the primary Kerberos server. The maximum string length in characters is 128. |
|
default_domain |
string Domain that is used for the host name extension. Applicable when Kerberos 4 service members become Kerberos 5 service members (for example, when rcmd.hostname is replaced with host/hostname.domain). The maximum string length in characters is 128. |
|
keytab_base64 |
string Base64 encoded contents of the keytab file. Value must match the regular expression |
PXFDatasourceHDFSDfs
|
Field |
Description |
|
ha_automatic_failover_enabled |
Determines whether automatic failover is enabled for the high availability of the file system. The automatic failover is enabled by default. |
|
block_access_token_enabled |
If The check of access tokens is enabled by default. |
|
use_datanode_hostname |
Determines whether the datanode hostname is used when connecting to datanodes. |
|
namenodes |
object (map<string, PXFDatasourceHDFSDfsNamenode>) List of HDFS service logical names. Specify them separated by commas. The names can be arbitrary. No more than 10000 per resource. |
|
nameservices |
string Corresponds well-known HDFS client setting "dfs.nameservices" for this datasource |
PXFDatasourceHDFSDfsNamenode
|
Field |
Description |
|
rpc_address |
string The maximum string length in characters is 1000. |
|
service_rpc_address |
string The maximum string length in characters is 1000. |
|
http_address |
string The maximum string length in characters is 1000. |
|
https_address |
string The maximum string length in characters is 1000. |
PXFDatasourceHDFSYarn
|
Field |
Description |
|
resourcemanager_ha_enabled |
Determines whether high availability is enabled for YARN's ResourceManager services. The high availability is enabled by default. |
|
resourcemanager_ha_auto_failover_enabled |
Determines whether another ResourceManager should automatically become active when the active ResourceManager has failed and does not respond. The switch of ResourceManagers is enabled by default if the high availability is enabled. |
|
resourcemanager_ha_auto_failover_embedded |
Determines whether the embedded ActiveStandbyElector method should be used for the election of the active ResourceManager. If the current active ResourceManager has failed and does not respond, the ActiveStandbyElector method makes another ResourceManager active which then takes over. |
|
resourcemanager_cluster_id |
string Cluster ID. Specify it, so the ResourceManager service does not become active for a different cluster. The maximum string length in characters is 1000. |
|
ha_rm |
object (map<string, PXFDatasourceHDFSYarnHaRm>) Highly available ResourceManager service. No more than 10000 per resource. |
PXFDatasourceHDFSYarnHaRm
|
Field |
Description |
|
resourcemanager_address |
string |
|
resourcemanager_scheduler_address |
string |
|
resourcemanager_resource_tracker_address |
string |
|
resourcemanager_admin_address |
string |
|
resourcemanager_webapp_address |
string |
|
resourcemanager_webapp_https_address |
string |
PXFDatasourceHive
|
Field |
Description |
|
core |
Settings of the file system and security rules. |
|
kerberos |
Settings of the Kerberos network authentication protocol. |
|
user_impersonation |
Enables authentication on behalf of the Greenplum® user when connecting to the remote file storage or DBMS. The authentication is disabled by default. |
|
username |
string Login username for the remote file storage or DBMS if authentication on behalf of the Greenplum® user is enabled. The maximum string length in characters is 128. |
|
sasl_connection_retries |
Maximum number of times that PXF retries a SASL connection request after a refused connection returns a The default value is Acceptable values are 1 to 50, inclusive. |
|
zk_hosts[] |
string ZooKeeper server hosts. Specify values in the The maximum number of elements is 200. The string length in characters for each value must be 1-200. |
|
ppd |
Specifies if predicate pushdown is enabled for queries on external tables. The predicate pushdown is enabled by default. |
|
metastore_uris[] |
string List of URIs separated by commas. To request metadata, the remote DBMS connects to Metastore by one of these URIs. The maximum number of elements is 200. The string length in characters for each value must be 1-200. |
|
metastore_kerberos_principal |
string Service principal for the Metastore Thrift server. The maximum string length in characters is 1000. |
|
auth_kerberos_principal |
string Kerberos server principal. The maximum string length in characters is 1000. |
operation.Operation
{
"id": "string",
"description": "string",
"created_at": "google.protobuf.Timestamp",
"created_by": "string",
"modified_at": "google.protobuf.Timestamp",
"done": "bool",
"metadata": {
"cluster_id": "string",
"datasource_name": "string"
},
// Includes only one of the fields `error`, `response`
"error": "google.rpc.Status",
"response": {
"name": "string",
// Includes only one of the fields `s3`, `jdbc`, `hdfs`, `hive`
"s3": {
"access_key": "string",
"secret_key": "string",
"fast_upload": "google.protobuf.BoolValue",
"endpoint": "string"
},
"jdbc": {
"driver": "string",
"url": "string",
"user": "string",
"password": "string",
"statement_batch_size": "google.protobuf.Int64Value",
"statement_fetch_size": "google.protobuf.Int64Value",
"statement_query_timeout": "google.protobuf.Int64Value",
"pool_enabled": "google.protobuf.BoolValue",
"pool_maximum_size": "google.protobuf.Int64Value",
"pool_connection_timeout": "google.protobuf.Int64Value",
"pool_idle_timeout": "google.protobuf.Int64Value",
"pool_minimum_idle": "google.protobuf.Int64Value"
},
"hdfs": {
"core": {
"default_fs": "string",
"security_auth_to_local": "string"
},
"kerberos": {
"enable": "google.protobuf.BoolValue",
"primary": "string",
"realm": "string",
"kdc_servers": [
"string"
],
"admin_server": "string",
"default_domain": "string",
"keytab_base64": "string"
},
"user_impersonation": "google.protobuf.BoolValue",
"username": "string",
"sasl_connection_retries": "google.protobuf.Int64Value",
"zk_hosts": [
"string"
],
"dfs": {
"ha_automatic_failover_enabled": "google.protobuf.BoolValue",
"block_access_token_enabled": "google.protobuf.BoolValue",
"use_datanode_hostname": "google.protobuf.BoolValue",
"namenodes": "map<string, PXFDatasourceHDFSDfsNamenode>",
"nameservices": "string"
},
"yarn": {
"resourcemanager_ha_enabled": "google.protobuf.BoolValue",
"resourcemanager_ha_auto_failover_enabled": "google.protobuf.BoolValue",
"resourcemanager_ha_auto_failover_embedded": "google.protobuf.BoolValue",
"resourcemanager_cluster_id": "string",
"ha_rm": "map<string, PXFDatasourceHDFSYarnHaRm>"
}
},
"hive": {
"core": {
"default_fs": "string",
"security_auth_to_local": "string"
},
"kerberos": {
"enable": "google.protobuf.BoolValue",
"primary": "string",
"realm": "string",
"kdc_servers": [
"string"
],
"admin_server": "string",
"default_domain": "string",
"keytab_base64": "string"
},
"user_impersonation": "google.protobuf.BoolValue",
"username": "string",
"sasl_connection_retries": "google.protobuf.Int64Value",
"zk_hosts": [
"string"
],
"ppd": "google.protobuf.BoolValue",
"metastore_uris": [
"string"
],
"metastore_kerberos_principal": "string",
"auth_kerberos_principal": "string"
}
// end of the list of possible fields
}
// end of the list of possible fields
}
An Operation resource. For more information, see Operation.
|
Field |
Description |
|
id |
string ID of the operation. |
|
description |
string Description of the operation. 0-256 characters long. |
|
created_at |
Creation timestamp. |
|
created_by |
string ID of the user or service account who initiated the operation. |
|
modified_at |
The time when the Operation resource was last modified. |
|
done |
bool If the value is |
|
metadata |
Service-specific metadata associated with the operation. |
|
error |
The error result of the operation in case of failure or cancellation. Includes only one of the fields The operation result. |
|
response |
The normal response of the operation in case of success. Includes only one of the fields The operation result. |
CreatePXFDatasourceMetadata
|
Field |
Description |
|
cluster_id |
string Required field. The maximum string length in characters is 50. |
|
datasource_name |
string Required field. The string length in characters must be 3-200. Value must match the regular expression |
PXFDatasource
|
Field |
Description |
|
name |
string Required field. Data source name. The string length in characters must be 3-200. Value must match the regular expression |
|
s3 |
Settings of an external S3 data source. Includes only one of the fields |
|
jdbc |
Settings of an external JDBC data source. Includes only one of the fields |
|
hdfs |
Settings of an external HDFS data source. Includes only one of the fields |
|
hive |
Settings of an external Hive data source. Includes only one of the fields |
PXFDatasourceS3
|
Field |
Description |
|
access_key |
string Public key to access S3 storage. The maximum string length in characters is 200. |
|
secret_key |
string Secret key to access S3 storage. The maximum string length in characters is 200. |
|
fast_upload |
Manages a fast upload of big files to S3 storage. In case of the The fast upload is enabled by default. |
|
endpoint |
string S3 storage address. The default value is The maximum string length in characters is 200. Value must match the regular expression |
PXFDatasourceJDBC
|
Field |
Description |
|
driver |
string JDBC driver class in Java. The possible values are the following:
The maximum string length in characters is 50. |
|
url |
string URL that the JDBC driver uses to connect to the database. Examples:
The maximum string length in characters is 1000. |
|
user |
string Username of the DB owner. The maximum string length in characters is 200. |
|
password |
string Password of the DB owner. The maximum string length in characters is 200. |
|
statement_batch_size |
Number of rows to read in an external table, in a batch. The default value is Acceptable values are 50 to 1000, inclusive. |
|
statement_fetch_size |
Number of rows to fetch (buffer) when reading from an external table. The default value is Acceptable values are 50 to 10000, inclusive. |
|
statement_query_timeout |
Amount of time (in seconds) the JDBC driver waits for a statement to run. This timeout applies to statements created for both read and write operations. The default value is Acceptable values are 5 to 43200, inclusive. |
|
pool_enabled |
Determines whether JDBC connection pooling is used in a server configuration. By default, it is used. |
|
pool_maximum_size |
Maximum number of connections to the DB backend. The default value is Acceptable values are 1 to 200, inclusive. |
|
pool_connection_timeout |
Maximum time, in milliseconds, to wait for a connection from the pool. The default value is Acceptable values are 5000 to 600000, inclusive. |
|
pool_idle_timeout |
Maximum amount of time, in milliseconds, after which an inactive connection is considered idle. The default value is Acceptable values are 5000 to 600000, inclusive. |
|
pool_minimum_idle |
Minimum number of idle connections maintained in the connection pool. The default value is Acceptable values are 0 to 200, inclusive. |
PXFDatasourceHDFS
|
Field |
Description |
|
core |
Settings of the file system and security rules. |
|
kerberos |
Settings of the Kerberos network authentication protocol. |
|
user_impersonation |
Enables authentication on behalf of the Greenplum® user when connecting to the remote file storage or DBMS. The authentication is disabled by default. |
|
username |
string Login username for the remote file storage or DBMS if authentication on behalf of the Greenplum® user is enabled. The maximum string length in characters is 128. |
|
sasl_connection_retries |
Maximum number of times that PXF retries a SASL connection request after a refused connection returns a The default value is Acceptable values are 1 to 50, inclusive. |
|
zk_hosts[] |
string ZooKeeper server hosts. Specify values in the The maximum number of elements is 200. The string length in characters for each value must be 1-200. |
|
dfs |
Settings of the distributed file system. |
|
yarn |
Settings of the ResourceManager service that is responsible for tracking resources in a cluster and scheduling applications (e.g., MapReduce jobs). |
PXFDatasourceCore
|
Field |
Description |
|
default_fs |
string URI whose scheme and authority determine the file system implementation. The maximum string length in characters is 200. |
|
security_auth_to_local |
string Rules for mapping Kerberos principals to operating system user accounts. The maximum string length in characters is 1000. |
PXFDatasourceKerberos
|
Field |
Description |
|
enable |
Determines whether the Kerberos authentication server is used. By default, it is not used. |
|
primary |
string Host of the primary KDC server (Key Distribution Center). The maximum string length in characters is 128. |
|
realm |
string Kerberos realm for a Greenplum® DB. The maximum string length in characters is 1000. |
|
kdc_servers[] |
string KDC server hosts. The maximum number of elements is 200. The string length in characters for each value must be 1-200. |
|
admin_server |
string Administration server host. Usually, this is the primary Kerberos server. The maximum string length in characters is 128. |
|
default_domain |
string Domain that is used for the host name extension. Applicable when Kerberos 4 service members become Kerberos 5 service members (for example, when rcmd.hostname is replaced with host/hostname.domain). The maximum string length in characters is 128. |
|
keytab_base64 |
string Base64 encoded contents of the keytab file. Value must match the regular expression |
PXFDatasourceHDFSDfs
|
Field |
Description |
|
ha_automatic_failover_enabled |
Determines whether automatic failover is enabled for the high availability of the file system. The automatic failover is enabled by default. |
|
block_access_token_enabled |
If The check of access tokens is enabled by default. |
|
use_datanode_hostname |
Determines whether the datanode hostname is used when connecting to datanodes. |
|
namenodes |
object (map<string, PXFDatasourceHDFSDfsNamenode>) List of HDFS service logical names. Specify them separated by commas. The names can be arbitrary. No more than 10000 per resource. |
|
nameservices |
string Corresponds well-known HDFS client setting "dfs.nameservices" for this datasource |
PXFDatasourceHDFSDfsNamenode
|
Field |
Description |
|
rpc_address |
string The maximum string length in characters is 1000. |
|
service_rpc_address |
string The maximum string length in characters is 1000. |
|
http_address |
string The maximum string length in characters is 1000. |
|
https_address |
string The maximum string length in characters is 1000. |
PXFDatasourceHDFSYarn
|
Field |
Description |
|
resourcemanager_ha_enabled |
Determines whether high availability is enabled for YARN's ResourceManager services. The high availability is enabled by default. |
|
resourcemanager_ha_auto_failover_enabled |
Determines whether another ResourceManager should automatically become active when the active ResourceManager has failed and does not respond. The switch of ResourceManagers is enabled by default if the high availability is enabled. |
|
resourcemanager_ha_auto_failover_embedded |
Determines whether the embedded ActiveStandbyElector method should be used for the election of the active ResourceManager. If the current active ResourceManager has failed and does not respond, the ActiveStandbyElector method makes another ResourceManager active which then takes over. |
|
resourcemanager_cluster_id |
string Cluster ID. Specify it, so the ResourceManager service does not become active for a different cluster. The maximum string length in characters is 1000. |
|
ha_rm |
object (map<string, PXFDatasourceHDFSYarnHaRm>) Highly available ResourceManager service. No more than 10000 per resource. |
PXFDatasourceHDFSYarnHaRm
|
Field |
Description |
|
resourcemanager_address |
string |
|
resourcemanager_scheduler_address |
string |
|
resourcemanager_resource_tracker_address |
string |
|
resourcemanager_admin_address |
string |
|
resourcemanager_webapp_address |
string |
|
resourcemanager_webapp_https_address |
string |
PXFDatasourceHive
|
Field |
Description |
|
core |
Settings of the file system and security rules. |
|
kerberos |
Settings of the Kerberos network authentication protocol. |
|
user_impersonation |
Enables authentication on behalf of the Greenplum® user when connecting to the remote file storage or DBMS. The authentication is disabled by default. |
|
username |
string Login username for the remote file storage or DBMS if authentication on behalf of the Greenplum® user is enabled. The maximum string length in characters is 128. |
|
sasl_connection_retries |
Maximum number of times that PXF retries a SASL connection request after a refused connection returns a The default value is Acceptable values are 1 to 50, inclusive. |
|
zk_hosts[] |
string ZooKeeper server hosts. Specify values in the The maximum number of elements is 200. The string length in characters for each value must be 1-200. |
|
ppd |
Specifies if predicate pushdown is enabled for queries on external tables. The predicate pushdown is enabled by default. |
|
metastore_uris[] |
string List of URIs separated by commas. To request metadata, the remote DBMS connects to Metastore by one of these URIs. The maximum number of elements is 200. The string length in characters for each value must be 1-200. |
|
metastore_kerberos_principal |
string Service principal for the Metastore Thrift server. The maximum string length in characters is 1000. |
|
auth_kerberos_principal |
string Kerberos server principal. The maximum string length in characters is 1000. |