Enable saving logs to the environment's bucket

Cloud Composer 1 | Cloud Composer 2

This page explains how to enable saving Airflow task logs to the environment's bucket for a new or existing environment.

All composer environments created after 2024-03-26 don't store task logs in the environment's bucket by default. Task logs are still available in Cloud Logging and Airflow UI.

As an option, you can enable the synchronization of task logs to the environment's bucket.

Before you begin

  • If you enable this feature, logs are saved both to Cloud Logging and to the environment's bucket.

  • When you upgrade your environment, the configuration for saving logs in the environment's bucket is not changed. For example, if you upgrade an environment that saved logs to the environment's bucket to a later version, the environment keeps saving logs to the environment's bucket.

  • If you disable saving task logs to the environment's bucket, the logs that were already saved to the environment's bucket are not deleted automatically.

  • To enable or disable this option, you can use Google Cloud CLI, Cloud Composer API, or Terraform. It is not possible to change this option through Google Cloud console.

Enable or disable saving task logs to the environment's bucket when creating an environment

gcloud

When you create an environment, the following arguments specify how Airflow task logs must be saved:

  • --disable-logs-in-cloud-logging-only argument enables saving Airflow task logs to the environment's bucket.
  • --enable-logs-in-cloud-logging-only argument disables saving Airflow task logs to the environment's bucket. Logs are saved only to Cloud Logging.
gcloud composer environments create ENVIRONMENT_NAME \
    --location LOCATION \
    --disable-logs-in-cloud-logging-only

Replace the following:

  • ENVIRONMENT_NAME: the name of your environment.
  • LOCATION: the region where the environment is located.

Example:

gcloud composer environments create example-environment \
    --location us-central1 \
    --disable-logs-in-cloud-logging-only

API

When you create an environment, in the Environment > EnvironmentConfig > DataRetentionConfig > TaskLogsRetentionConfig resource, specify how Airflow task logs must be saved:

  • CLOUD_LOGGING_AND_CLOUD_STORAGE to save logs to the environment's bucket and Cloud Logging.
  • CLOUD_LOGGING_ONLY to save logs only in Cloud Logging.
{
  "name": "projects/PROJECT_ID/locations/LOCATION/environments/ENVIRONMENT_NAME",
  "config": {
    "dataRetentionConfig": {
      "taskLogsRetentionConfig": {
        "storageMode": "CLOUD_LOGGING_AND_CLOUD_STORAGE"
      }
    }
  }
}

Replace the following:

  • ENVIRONMENT_NAME: the name of your environment.
  • LOCATION: the region where the environment is located.
  • PROJECT_ID: the Project ID.

Example:

// POST https://composer.googleapis.com/v1/{parent=projects/*/locations/*}/environments

{
  "name": "projects/example-project/locations/us-central1/environments/example-environment",
  "config": {
    "dataRetentionConfig": {
      "taskLogsRetentionConfig": {
        "storageMode": "CLOUD_LOGGING_AND_CLOUD_STORAGE"
      }
    }
  }
}

Terraform

When you create an environment, the storage_mode field in the task_logs_retention_config block specifies how Airflow task logs must be saved:

  • CLOUD_LOGGING_AND_CLOUD_STORAGE to save logs to the environment's bucket and Cloud Logging.
  • CLOUD_LOGGING_ONLY to save logs only in Cloud Logging.
resource "google_composer_environment" "example" {
  provider = google-beta
  name = "ENVIRONMENT_NAME"
  region = "LOCATION"

  config {

    data_retention_config {

      task_logs_retention_config {

        storage_mode = "CLOUD_LOGGING_AND_CLOUD_STORAGE"

      }
    }
  }

Replace the following:

  • ENVIRONMENT_NAME: the name of your environment.
  • LOCATION: the region where the environment is located.

Example:

resource "google_composer_environment" "example" {
  provider = google-beta
  name = "example-environment"
  region = "us-central1"

  config {

    data_retention_config {

      task_logs_retention_config {

        storage_mode = "CLOUD_LOGGING_AND_CLOUD_STORAGE"

      }
    }
  }
}

Enable or disable saving task logs to the environment's bucket for an existing environment

gcloud

When you update an environment, the following arguments enable or disable saving Airflow task logs to the environment's bucket:

  • --disable-logs-in-cloud-logging-only argument enables saving Airflow task logs to the environment's bucket. Logs are saved to the environment's bucket and to Cloud Logging.
  • --enable-logs-in-cloud-logging-only argument disables saving Airflow task logs to the environment's bucket. Logs are saved only to Cloud Logging.

To save Airflow logs in the environment's bucket:

gcloud composer environments update ENVIRONMENT_NAME \
    --location LOCATION \
    --disable-logs-in-cloud-logging-only

Replace the following:

  • ENVIRONMENT_NAME: the name of your environment.
  • LOCATION: the region where the environment is located.

Example:

gcloud composer environments update example-environment \
    --location us-central1 \
    --disable-logs-in-cloud-logging-only

API

  1. Construct an environments.patch API request.

  2. In this request:

    1. In the updateMask parameter, specify the config.dataRetentionConfig.taskLogsRetentionConfig.storageMode mask.

    2. In the request body, specify how Airflow task logs must be saved:

      • CLOUD_LOGGING_AND_CLOUD_STORAGE to save logs to the environment's bucket and Cloud Logging.
      • CLOUD_LOGGING_ONLY to save logs only in Cloud Logging.
{
  "config": {
    "dataRetentionConfig": {
      "taskLogsRetentionConfig": {
        "storageMode": "CLOUD_LOGGING_AND_CLOUD_STORAGE"
      }
    }
  }
}

Example:

// PATCH https://composer.googleapis.com/v1/projects/example-project/
// locations/us-central1/environments/example-environment?updateMask=
// config.dataRetentionConfig.taskLogsRetentionConfig.storageMode

{
  "config": {
    "dataRetentionConfig": {
      "taskLogsRetentionConfig": {
        "storageMode": "CLOUD_LOGGING_AND_CLOUD_STORAGE"
      }
    }
  }
}

Terraform

In the task_logs_retention_config block, in the storage_mode field specify how Airflow task logs must be saved:

  • CLOUD_LOGGING_AND_CLOUD_STORAGE to save logs to the environment's bucket and Cloud Logging.
  • CLOUD_LOGGING_ONLY to save logs only in Cloud Logging.
resource "google_composer_environment" "example" {
  provider = google-beta
  name = "ENVIRONMENT_NAME"
  region = "LOCATION"

  config {

    data_retention_config {

      task_logs_retention_config {

        storage_mode = "CLOUD_LOGGING_AND_CLOUD_STORAGE"

      }
    }
  }

Replace the following:

  • ENVIRONMENT_NAME: the name of your environment.
  • LOCATION: the region where the environment is located.

Example:

resource "google_composer_environment" "example" {
  provider = google-beta
  name = "example-environment"
  region = "us-central1"

  config {

    data_retention_config {

      task_logs_retention_config {

        storage_mode = "CLOUD_LOGGING_AND_CLOUD_STORAGE"

      }
    }
  }
}

What's next