The environment values to set at runtime.
JSON representation |
---|
{ "numWorkers": integer, "maxWorkers": integer, "zone": string, "serviceAccountEmail": string, "tempLocation": string, "bypassTempDirValidation": boolean, "machineType": string, "additionalExperiments": [ string ], "network": string, "subnetwork": string, "additionalUserLabels": { string: string, ... }, "kmsKeyName": string, "ipConfiguration": enum ( |
Fields | |
---|---|
num |
Optional. The initial number of Google Compute Engine instances for the job. The default value is 11. |
max |
Optional. The maximum number of Google Compute Engine instances to be made available to your pipeline during execution, from 1 to 1000. The default value is 1. |
zone |
Optional. The Compute Engine availability zone for launching worker instances to run your pipeline. In the future, workerZone will take precedence. |
service |
Optional. The email address of the service account to run the job as. |
temp |
Required. The Cloud Storage path to use for temporary files. Must be a valid Cloud Storage URL, beginning with |
bypass |
Optional. Whether to bypass the safety checks for the job's temporary directory. Use with caution. |
machine |
Optional. The machine type to use for the job. Defaults to the value from the template if not specified. |
additional |
Optional. Additional experiment flags for the job, specified with the |
network |
Optional. Network to which VMs will be assigned. If empty or unspecified, the service will use the network "default". |
subnetwork |
Optional. Subnetwork to which VMs will be assigned, if desired. You can specify a subnetwork using either a complete URL or an abbreviated path. Expected to be of the form "https://www.googleapis.com/compute/v1/projects/HOST_PROJECT_ID/regions/REGION/subnetworks/SUBNETWORK" or "regions/REGION/subnetworks/SUBNETWORK". If the subnetwork is located in a Shared VPC network, you must use the complete URL. |
additional |
Optional. Additional user labels to be specified for the job. Keys and values should follow the restrictions specified in the labeling restrictions page. An object containing a list of "key": value pairs. Example: { "name": "wrench", "mass": "1kg", "count": "3" }. |
kms |
Optional. Name for the Cloud KMS key for the job. Key format is: projects/ |
ip |
Optional. Configuration for VM IPs. |
worker |
Required. The Compute Engine region (https://cloud.google.com/compute/docs/regions-zones/regions-zones) in which worker processing should occur, e.g. "us-west1". Mutually exclusive with workerZone. If neither workerRegion nor workerZone is specified, default to the control plane's region. |
worker |
Optional. The Compute Engine zone (https://cloud.google.com/compute/docs/regions-zones/regions-zones) in which worker processing should occur, e.g. "us-west1-a". Mutually exclusive with workerRegion. If neither workerRegion nor workerZone is specified, a zone in the control plane's region is chosen based on available capacity. If both |
enable |
Optional. Whether to enable Streaming Engine for the job. |
disk |
Optional. The disk size, in gigabytes, to use on each remote Compute Engine worker instance. |
streaming |
Optional. Specifies the Streaming Engine message processing guarantees. Reduces cost and latency but might result in duplicate messages committed to storage. Designed to run simple mapping streaming ETL jobs at the lowest cost. For example, Change Data Capture (CDC) to BigQuery is a canonical use case. For more information, see Set the pipeline streaming mode. |