This page explains how to encrypt sensitive data in Google-provided template parameters, by using encryption keys with Cloud KMS.
Overview
Some Google-provided templates enable you to encrypt sensitive data in the template parameters, such as usernames, passwords, JDBC connection strings, and API keys. When supported, these templates include a parameter to specify the Cloud KMS encryption key, such as:
KMSEncryptionKey
tokenKMSEncryptionKey
apiKeyKMSEncryptionKey
To use Cloud KMS encryption keys with these templates, perform the following steps:
- Grant the Dataflow worker service account the Cloud KMS CryptoKey Decrypter role.
- Create an encryption key.
- Use the key to encrypt the data.
- Base64-encode the encrypted data.
- When you run the template, specify the encryption key and use the encrypted parameter values.
If you specify an encryption key, you must encrypt all of the parameters that support encryption. To understand which parameters can be encrypted, see the documentation for the specific template.
Example
The following example uses the MySQL to BigQuery template.
Create a key ring.
gcloud kms keyrings create "KEY_RING_NAME" \ --location "global"
Create an encryption key.
gcloud kms keys create "KEY_NAME" \ --location "global" \ --keyring "KEY_RING_NAME" \ --purpose "encryption"
Encrypt and base64-encode the username, password, and JDBC connection string.
export USER_NAME=`echo -n "USER_NAME" \ | gcloud kms encrypt --key=quickstart --keyring=test --location=global --plaintext-file=- --ciphertext-file=- \ | base64 -w 0` export PASSWORD=`echo -n "PASSWORD" \ | gcloud kms encrypt --key=quickstart --keyring=test --location=global --plaintext-file=- --ciphertext-file=- \ | base64 -w 0` export CONNECTION_STRING=`echo -n "CONNECTION_STRING" \ | gcloud kms encrypt --key=quickstart --keyring=test --location=global --plaintext-file=- --ciphertext-file=- \ | base64 -w 0`
Run the template.
gcloud dataflow flex-template run mysql-job \ --project=PROJECT_ID \ --region=us-central1 \ --template-file-gcs-location=gs://dataflow-templates-us-central1/latest/flex/MySQL_to_BigQuery \ --parameters \ connectionURL="$CONNECTION_STRING",\ query="SOURCE_SQL_QUERY",\ outputTable=PROJECT_ID:DATASET.TABLE_NAME,\ bigQueryLoadingTemporaryDirectory=CLOUD_STORAGE_PATH,\ username="$USER_NAME",\ password="$PASSWORD",\ KMSEncryptionKey=projects/PROJECT_ID/locations/global/keyRings/KEY_RING_NAME/cryptoKeys/KEY_NAME
Troubleshooting
This section contains troubleshooting information for encrypting template parameters.
Permission denied
When you run the job, you see a PERMISSION_DENIED
error in the job logs,
similar to the following:
PERMISSION_DENIED: Permission cloudkms.cryptoKeyVersions.useToDecrypt denied on
resource RESOURCE_PATH (or it may not exist)
To decrypt the data, the Dataflow worker service account needs the cloudkms.cryptoKeyVersions.useToDecrypt permission for the encryption key. Make sure the worker service account has the Cloud KMS CryptoKey Decrypter role. For more information, see Dataflow security and permissions.
What's next
- Learn more about Cloud Key Management Service.
- See the list of Google-provided templates.