Set up prebuilt reports in BigQuery

This page explains how to set up and view prebuilt reports in BigQuery. These reports are created using logs from Cloud Logging. To set up reports, you need to do a one-time activity of creating a log sink to stream logging data into BigQuery and then execute the prebuilt script.

Required IAM role

The following IAM permissions are required to view prebuilt reports in BigQuery. Learn how to grant an IAM role.

Role	When to grant the role
Logs Configuration Writer (`roles/logging.configWriter`) or Logging Admin (`roles/logging.admin`) and BigQuery Data Editor (`roles/bigquery.dataEditor`)	To create a sink and BigQuery dataset from the Google Cloud console.
Owner (`roles/owner`)	To create a sink and BigQuery dataset from the Google Cloud CLI.
BigQuery Admin (`bigquery.admin`)	To write custom queries or download queries.

Create a sink and route logs to BigQuery

BigQuery stores only the logs that are generated after a log sink is created. The logs that are generated before creating a log sink are not visible in BigQuery. You can create the log sink from the Google Cloud console or Google Cloud CLI.

To create sink and route logs in BigQuery, do the following:

Console

In the Google Cloud console, go to the Log Router page:
Go to Log Router page
Select an existing Google Cloud project.
Click Create sink.
In the Sink details panel, enter the following fields:
1. Sink name: enter the sink name as BackupandDR_reports_sink. You must use the sink name BackupandDR_reports_sink for the identification of Backup and DR reports from other sinks.
2. Sink description: Describe the purpose or use case for the sink.
In the Sink destination panel, do the following:
1. In the Select sink service menu, select the BigQuery dataset sink service.
2. In the Select BigQuery dataset, select Create new BigQuery dataset.
3. On the Create dataset page, do the following:
  1. For Dataset ID, enter the dataset name as BackupandDR_reports to identify from other datasets. Don't change the dataset name from BackupandDR_reports.
  2. For Location type, choose a geographic location for the dataset. After a dataset is created, the location can't be changed.
  3. Optional: If you want tables in this dataset to expire, select Enable table expiration, then specify the Default maximum table age in days.
  4. Click Create dataset.
In the Choose logs to include in sink panel, do the following:
1. In the Build inclusion filter field, enter the following filter expression that matches the log entries you want to include.
```
 logName=~"projects/PROJECT_ID/logs/backupdr.googleapis.com%2Fgcb_*"
```
2. To verify you entered the correct filter, select Preview logs. This opens the Logs Explorer in a new tab with the filter prepopulated.
Optional: In the Choose logs to filter out of sink panel, do the following:
1. In the Exclusion filter name field, enter a name.
2. In the Build an exclusion filter field, enter a filter expression that matches the log entries you want to exclude. You can also use the sample function to select a portion of the log entries to exclude.
  
  Note: If you want your exclusion filter to be disabled when the sink is created, then select Disable after you enter your filter expression. You can update the sink later to enable the exclusion filter.
Select Create sink.

You can see the dataset in the BigQuery Studio.

gcloud

Go to Activate cloud shell and click Open editor.
Click the icon, select File, and then select New text file.

Copy and paste the following script.

  #!/bin/bash
  echo "This script will set up a log sink for BackupDR reports to be available in BigQuery"

  # Get the default project ID
  DEFAULT_PROJECT_ID=$(gcloud config get-value project)
  read -p "Enter Project ID (default: $DEFAULT_PROJECT_ID, press Enter to continue):" PROJECT_ID

  # Use default if no input is provided
  if [ -z "$PROJECT_ID" ]; then
    PROJECT_ID=$DEFAULT_PROJECT_ID
  fi
    # Set the project ID
  result=$(gcloud config set project $PROJECT_ID)
  if [ $? -ne 0 ]; then
    echo "Error setting the project to $PROJECT_ID"
    exit 1
  fi
    # --- Check if BigQuery API is already enabled, enable if not ---
  echo "Checking if BigQuery API is enabled..."
  if gcloud services list | grep "bigquery.googleapis.com" >/dev/null; then
    echo "BigQuery API is already enabled for $PROJECT_ID"
  else
    echo "For logs to be available in BigQuery, we need to enable BigQuery service in the project if not done already. This might mean additional costs incurred. Please check the pricing at https://cloud.google.com/backup-disaster-recovery/docs/monitor-reports/reports-overview#pricing before proceeding."
    read -p "Do you want to continue(Y/N)?" continue
    if [ "$continue" = "y" ] || [ "$continue" = "Y" ]; then
      echo "Enabling BigQuery API..."
      result=$(gcloud services enable bigquery.googleapis.com --project $PROJECT_ID)
      if [ $? -eq 0 ]; then
        echo "Successfully enabled BigQuery api for $PROJECT_ID"
      else
        echo "Error in setting up the BigQuery api for the project. $result"
        exit 1
      fi
    else
      exit 0
    fi
  fi
    # --- Check if BigQuery data set already exists, create if not ---
  echo "Checking if BigQuery data set exists..."
  if bq ls | grep "BackupandDR_reports" >/dev/null; then
    echo "Dataset BackupandDR_reports already exists for $PROJECT_ID"
  else
    echo "Creating bigQuery dataset BackupandDR_reports..."
    # --- Get dataset location from user (default: US) ---
    read -p "Enter dataset location (default: US, press Enter to use): " DATASET_LOCATION
    if [ -z "$DATASET_LOCATION" ]; then
      DATASET_LOCATION="US"
    fi
    # --- Get table expiration in days from user (default: no expiration) ---
    read -p "Enter default table expiration in days (default: no expiration, press Enter to skip): " TABLE_EXPIRATION_DAYS

    # Calculate table expiration in seconds if provided
    if [ -n "$TABLE_EXPIRATION_DAYS" ]; then
      TABLE_EXPIRATION_SECONDS=$((TABLE_EXPIRATION_DAYS * 24 * 60 * 60))
      EXPIRATION_FLAG="--default_table_expiration $TABLE_EXPIRATION_SECONDS"
    else
      EXPIRATION_FLAG=""
    fi
    result=$(bq --location=$DATASET_LOCATION mk $EXPIRATION_FLAG BackupandDR_reports)
    if [ $? -eq 0 ]; then
      echo "Created a BigQuery dataset BackupandDR_reports successfully."
    else
      echo ""
      echo "ERROR : Failed to create the BigQuery dataset."
      echo $result
      exit 1
    fi
  fi
    # --- Check if Log Sink already exists, create if not ---
  echo "Checking if Log Sink exists..."
  if gcloud logging sinks list | grep "BackupandDR_reports_sink" >/dev/null; then
    echo "Log Sink BackupandDR_reports_sink already exists for $PROJECT_ID"
  else
    log_filter="projects/$PROJECT_ID/logs/backupdr.googleapis.com%2Fgcb_*"
    echo "Creating log sink BackupandDR_reports_sink..."
    result=$(gcloud logging sinks create BackupandDR_reports_sink bigquery.googleapis.com/projects/$PROJECT_ID/datasets/BackupandDR_reports --log-filter="logName=~\"$log_filter\"")
    if [ $? -eq 0 ]; then
      echo "Created a logsink BackupandDR_reports_sink successfully."
    else
      echo ""
      echo "ERROR : Failed to create logsink."
      exit 1
    fi
  fi

  # --- Add IAM Policy binding for Cloud logging service account to write logs to BigQuery ---
  result=$(gcloud projects add-iam-policy-binding $(gcloud projects describe $PROJECT_ID --format="value(projectNumber)") --member=serviceAccount:service-$(gcloud projects describe $PROJECT_ID --format="value(projectNumber)")@gcp-sa-logging.iam.gserviceaccount.com --role=roles/bigquery.dataEditor --condition=None)
  if [ $? -eq 0 ]; then
    echo "Added permission for cloud logging to write to BigQuery datasets"
  else
    echo ""
    echo "ERROR : Failed to add permissions for cloud logging to write to BigQuery datasets. Please make sure that you have correct access rights in order to be able to proceed."
    exit 1
  fi

  echo "Setup complete. The logs for the project $PROJECT_ID will now start flowing to bigquery."
  exit 0

Save the file with a name with a Bash file extension, for example, script.sh.
Run the command bash using the file you just created. For example, bash script.sh.

You can see the created dataset in the BigQuery Studio.