This page describes how to schedule backups for Cassandra in Cloud Storage. In this method, backups are stored in the specified Cloud Storage bucket.
To schedule Cassandra backups, perform the following steps:
- Run the following create-service-accountcommand to create a Google Cloud service account (SA) with the standardroles/storage.objectAdminrole. This SA role allows you to write backup data to Cloud Storage. Execute the command in the$APIGEE_HELM_CHARTS_HOME/apigee-operator/etc/directory../tools/create-service-account --env non-prod --dir ./ This command creates a single service account named apigee-non-prodfor use in non-production environments and places the downloaded key file in the./directory.For more information about Google Cloud service accounts, see Creating and managing service accounts. 
- The create-service-accountcommand saves a JSON file containing the service account private key. The file is saved in the same directory where the command executes. You will need the path to this file in the following steps.
- Create a Cloud Storage bucket. Specify a reasonable data retention policy for the bucket. Apigee recommends a data retention policy of 15 days.
- Open your overrides.yamlfile.
- Add the following cassandra.backupproperties to enable backup. Do not remove any of the properties that are already configured.Parameterscassandra: ... backup: enabled: true serviceAccountPath: SA_JSON_FILE_PATH dbStorageBucket: CLOUD_STORAGE_BUCKET_NAME schedule: BACKUP_SCHEDULE_CODE cloudProvider: "GCP" # For remote server backup set this to HYBRID (all caps) ...Example... cassandra: storage: type: gcepd capacity: 50Gi gcepd: replicationType: regional-pd auth: default: password: "abc123" admin: password: "abc234" ddl: password: "abc345" dml: password: "abc456" nodeSelector: key: cloud.google.com/gke-nodepool value: apigee-data backup: enabled: true serviceAccountPath: "my-cassandra-backup-sa.json" dbStorageBucket: "myname-cassandra-backup" schedule: "45 23 * * 6" cloudProvider: "GCP" ... Where:
- Apply the configuration changes to the new cluster. For example:
helm upgrade datastore apigee-datastore/ \ --namespace APIGEE_NAMESPACE \ --atomic \ -f OVERRIDES_FILE.yaml Where OVERRIDES_FILE is the path to the overrides file you just edited. 
- Verify the backup job. For example:
    kubectl get cronjob -n APIGEE_NAMESPACE NAME SCHEDULE SUSPEND ACTIVE LAST SCHEDULE AGE apigee-cassandra-backup 33 * * * * False 0 <none> 94s 
| Property | Description | 
|---|---|
| backup:enabled | Backup is disabled by default. You must set this property to true. | 
| backup:serviceAccountPath | SA_JSON_FILE_PATH The path on your filesystem to the service account
          JSON file that was downloaded when you
          ran the  The path must be relative to the apigee-datastore chart directory. For example,  | 
| backup:dbStorageBucket | CLOUD_STORAGE_BUCKET_NAME The name of an existing Google Cloud Storage bucket that will be used to store backup archives. See Creating buckets if you need to create one. | 
| backup:cloudProvider | 
 For a Cloud Storage backup, set the property to  For a remote server backup, set the property to  | 
| backup:schedule | BACKUP_SCHEDULE_CODE The time when the backup starts, specified in
        standard crontab syntax. Default:  | 
Launch a manual backup
    Cassandra backups generate automatically according to the cron schedule set in the overrides.yaml file.
  
To initiate a manual backup, use this command:
kubectl create job -n APIGEE_NAMESPACE --from=cronjob/apigee-cassandra-backup BACKUP_POD_NAME