Serve traffic from multiple regions

You return faster responses to your users around the world by deploying services in multiple regions and routing your users to the nearest region. Deploying across multiple regions delivers low latency and higher availability in case of regional outages.

Because Cloud Run services deploy into individual regions, you must deploy your service to multiple regions and then configure global load balancing for the service.

You can automate cross-regional failover using Cloud Run service health.

Before you begin

Sign in to your Google Cloud account. If you're new to Google Cloud, create an account to evaluate how our products perform in real-world scenarios. New customers also get $300 in free credits to run, test, and deploy workloads.

In the Google Cloud console, on the project selector page, select or create a Google Cloud project.

Roles required to select or create a project

Select a project: Selecting a project doesn't require a specific IAM role—you can select any project that you've been granted a role on.
Create a project: To create a project, you need the Project Creator role (roles/resourcemanager.projectCreator), which contains the resourcemanager.projects.create permission. Learn how to grant roles.

Go to project selector

Verify that billing is enabled for your Google Cloud project.

In the Google Cloud console, on the project selector page, select or create a Google Cloud project.

Roles required to select or create a project

Select a project: Selecting a project doesn't require a specific IAM role—you can select any project that you've been granted a role on.
Create a project: To create a project, you need the Project Creator role (roles/resourcemanager.projectCreator), which contains the resourcemanager.projects.create permission. Learn how to grant roles.

Go to project selector

Verify that billing is enabled for your Google Cloud project.

Set up your Cloud Run development environment in your Google Cloud project.
Install and initialize the gcloud CLI.
Ensure your account has the following IAM roles:
- Cloud Run Admin (roles/run.admin)
- Project IAM Admin (roles/resourcemanager.projectIamAdmin)
- Service Usage Consumer (roles/serviceusage.serviceUsageConsumer)

Grant the roles

Console

In the Google Cloud console, go to the IAM page.
Go to IAM
Select the project.
Click Grant access.
In the New principals field, enter your user identifier. This is typically the Google Account email address used to deploy the Cloud Run service.
In the Select a role list, select a role.
To grant additional roles, click Add another role and select each additional role.
Click Save.

gcloud

To grant the required IAM roles to your account on your project:

        gcloud projects add-iam-policy-binding PROJECT_ID \
            --member=PRINCIPAL \
            --role=ROLE

Replace:

PROJECT_NUMBER with your Google Cloud project number.
PROJECT_ID with your Google Cloud project ID.
PRINCIPAL with the account you are adding the binding for. This is typically the Google Account email address that is used to deploy the Cloud Run service.
ROLE with the role you are adding to the deployer account.

Review the Cloud Run pricing page. To generate a cost estimate based on your projected usage, use the pricing calculator.
Enable the Artifact Registry, Cloud Build, Cloud Run Admin API, Compute Engine, and Network Services API APIs by running the following command:

  gcloud services enable artifactregistry.googleapis.com \
    cloudbuild.googleapis.com \
    run.googleapis.com \
    compute.googleapis.com \
    networkservices.googleapis.com

Deploy the service to multiple regions

Scaling parameters that you configure apply across multiple regions. In a multi-region deployment, for example, the minimum instances value applies for each of the multiple regions.

You deploy the same service to multiple regions using one of the following methods:

Repeat the steps to deploy to a single region.
Deploy a multi-region service.

Deploy a multi-region service

This section shows you how to deploy and configure a multi-region service from a single gcloud CLI command or using a YAML or Terraform file.

gcloud

To create and deploy a multi-region service, run the gcloud run deploy command using the --regions flag:
```
gcloud run deploy SERVICE_NAME \
  --image=IMAGE_URL \
  --regions=REGIONS
```
Replace the following:
- SERVICE_NAME: The name of the multi-region service that you want to deploy.
- IMAGE_URL: A reference to the container image, for example, us-docker.pkg.dev/cloudrun/container/hello:latest.
- REGIONS: The list of multiple regions that you want to deploy to. For example, europe-west1,asia-east1.

YAML

Create the YAML file for your service, using the run.googleapis.com/regions attribute to set the multiple regions that you want to deploy your service to:
```
apiVersion: serving.knative.dev/v1
kind: Service
metadata:
  name: SERVICE_NAME
  annotations:
    run.googleapis.com/regions: REGIONS
spec:
  template:
    spec:
      containers:
      - image: IMAGE_URL
```
Replace the following:
- SERVICE_NAME: The name of the multi-region service that you want to deploy to.
- REGIONS: The list of multiple regions that you want to update. For example, europe-west1,asia-east1.
- IMAGE_URL: A reference to the container image, for example, us-docker.pkg.dev/cloudrun/container/hello:latest.

Create the service using the following command:

gcloud run multi-region-services replace service.yaml

Terraform

To learn how to apply or remove a Terraform configuration, see Basic Terraform commands.

Add the following to a google_cloud_run_v2_service resource in your Terraform configuration.

resource "google_cloud_run_v2_service" "default" {
  name     = "cloudrun-service-multi-region"
  regions = [
    "REGION_1",
    "REGION_2",
  ]

  template {
    containers {
      image = "us-docker.pkg.dev/cloudrun/container/hello"
    }
  }
}

Replace "REGION_1" and "REGION_2" with each of the needed Google Cloud regions. For example, europe-west1 and us-central1.

Update a multi-region service

This section shows you how to add or remove regions from a multi-region service from a single gcloud CLI command or a YAML file.

gcloud

To add or remove regions from a multi-region service, run the gcloud run multi-region-services update command.

To add the multi-region service to an additional region or regions, use the --add-regions flag:
```
gcloud run multi-region-services update SERVICE_NAME \
  --add-regions=REGIONS
```
To remove the multi-region service from a region or regions, use the --remove-regions flag:
```
gcloud run multi-region-services update SERVICE_NAME \
  --remove-regions=REGIONS
```
Replace the following:
- SERVICE_NAME: The name of the multi-region service that you want to update.
- REGIONS: The region or regions that you want to add your service to or remove your service from. For example, us-central1,asia-east1.

YAML

To update an existing multi-region service, download its YAML configuration:

gcloud run multi-region-services describe SERVICE_NAME --format export > service.yaml

Update the run.googleapis.com/regions attribute to add or remove the list of regions that you want the service to deploy to:
```
apiVersion: serving.knative.dev/v1
kind: Service
metadata:
  name: SERVICE_NAME
  annotations:
    run.googleapis.com/regions: REGIONS
```
Replace the following:
- SERVICE_NAME: The name of the multi-region service that you want to deploy to.
- REGIONS: The new list of multiple regions that you want the service revision to deploy to.

Update the service using the following command:

gcloud run multi-region-services replace service.yaml

Delete a multi-region service

To delete a multi-region service, run the gcloud run multi-region-services delete command:
```
gcloud run multi-region-services delete SERVICE_NAME
```
Replace SERVICE_NAME with the name of the multi-region service that you want to delete.

Configure a global external Application Load Balancer

This section shows you how to configure a global external Application Load Balancer with a domain secured with a managed TLS certificate pointing to a global anycast IP address, which routes users to the nearest Google data center that deploys your service.

The architecture described in the following sections does not automatically route requests to a different region when a regional Cloud Run service becomes unresponsive or returns errors.

To increase the availability of your multi-regional service:

Automate cross-region failover with Cloud Run service health
Configure outlier detection to identify unhealthy Cloud Run services based on their HTTP error rate and diverge some requests to another region.

Create a global external Application Load Balancer

Creating a global external Application Load Balancer involves creating various networking resources and connecting them together:

gcloud

Reserve a static IP address so you don't have to update your DNS records when you recreate your load balancer.
```
gcloud compute addresses create --global SERVICE_IP
```
In the command above, replace SERVICE_IP with a name for the IP address resource (e.g. myservice-ip).
This IP address is a global anycast IPv4 address that routes to the Google datacenter or point of presence closest to your visitors.
Create a backend service.
```
gcloud compute backend-services create \
  --global BACKEND_NAME \
  --load-balancing-scheme=EXTERNAL_MANAGED
```
Replace BACKEND_NAME with a name you want to give to the backend service. For example, myservice-backend.
Create a URL map.
```
gcloud compute url-maps create URLMAP_NAME --default-service=BACKEND_NAME
```
Replace URLMAP_NAME with a name you want to give to the URL map (e.g. myservice-urlmap).
Create a managed TLS certificate for your domain to serve HTTPS traffic. (Replace example.com with your domain name.)
```
gcloud compute ssl-certificates create CERT_NAME \
  --domains=example.com
```
Replace CERT_NAME with the name you want the managed SSL certificate to have (e.g. myservice-cert).
Create a target HTTPS proxy.
```
gcloud compute target-https-proxies create HTTPS_PROXY_NAME \
  --ssl-certificates=CERT_NAME \
  --url-map=URLMAP_NAME
```
Replace HTTPS_PROXY_NAME with the name you want to give to the target HTTPS proxy (e.g. myservice-https).

Create a forwarding rule connecting the networking resources you created to the IP address.

gcloud compute forwarding-rules create --global FORWARDING_RULE_NAME \
  --target-https-proxy=HTTPS_PROXY_NAME \
  --address=SERVICE_IP \
  --ports=443 \
  --load-balancing-scheme=EXTERNAL_MANAGED

Replace FORWARDING_RULE_NAME with the name of the forwarding rule resource you want to create. For example, myservice-lb.

Terraform

Alternatively to the steps described in this section, you can use the Global HTTP Load Balancer Terraform Module.

To learn how to apply or remove a Terraform configuration, see Basic Terraform commands.

Configure the IP address:

resource "google_compute_global_address" "lb_default" {
  provider = google-beta
  name     = "myservice-service-ip"

  # Use an explicit depends_on clause to wait until API is enabled
  depends_on = [
    google_project_service.compute_api
  ]
}
output "load_balancer_ip_addr" {
  value = google_compute_global_address.lb_default.address
}

Configures your IP address resource name to be myservice-service-ip. You can change this to your own value. This IP address is a global anycast IPv4 address that routes to the Google data center or point of presence closest to your visitors.

Create and configure the backend service:

resource "google_compute_backend_service" "lb_default" {
  provider              = google-beta
  name                  = "myservice-backend"
  load_balancing_scheme = "EXTERNAL_MANAGED"

  backend {
    group = google_compute_region_network_endpoint_group.lb_default[0].id
  }

  backend {
    group = google_compute_region_network_endpoint_group.lb_default[1].id
  }

  # Use an explicit depends_on clause to wait until API is enabled
  depends_on = [
    google_project_service.compute_api,
  ]
}

This resource configures the backend service to be named myservice-backend. You can change this to your own value.

Configure the URL map:

resource "google_compute_url_map" "lb_default" {
  provider        = google-beta
  name            = "myservice-lb-urlmap"
  default_service = google_compute_backend_service.lb_default.id

  path_matcher {
    name            = "allpaths"
    default_service = google_compute_backend_service.lb_default.id
    route_rules {
      priority = 1
      url_redirect {
        https_redirect         = true
        redirect_response_code = "MOVED_PERMANENTLY_DEFAULT"
      }
    }
  }
}

Connects the backend service resource (myservice-backend) to the new URL map resource (myservice-lb-urlmap). You can change these to your own values.

Create a managed TLS certificate for your domain to serve HTTPS traffic. Replace example.com with your domain name in the google_compute_managed_ssl_certificate resource:

resource "google_compute_managed_ssl_certificate" "lb_default" {
  provider = google-beta
  name     = "myservice-ssl-cert"

  managed {
    domains = ["example.com"]
  }
}

Configure the HTTPS proxy:

resource "google_compute_target_https_proxy" "lb_default" {
  provider = google-beta
  name     = "myservice-https-proxy"
  url_map  = google_compute_url_map.lb_default.id
  ssl_certificates = [
    google_compute_managed_ssl_certificate.lb_default.name
  ]
  depends_on = [
    google_compute_managed_ssl_certificate.lb_default
  ]
}

Creates google_compute_target_https_proxy resource with target name myservice-https-proxy and connects previously created TLS certificate (myservice-ssl-cert) and URL mapping resources (myservice-lb-urlmap). You can change these to your own values.

Configure the forwarding rule:

resource "google_compute_global_forwarding_rule" "lb_default" {
  provider              = google-beta
  name                  = "myservice-lb-fr"
  load_balancing_scheme = "EXTERNAL_MANAGED"
  target                = google_compute_target_https_proxy.lb_default.id
  ip_address            = google_compute_global_address.lb_default.id
  port_range            = "443"
  depends_on            = [google_compute_target_https_proxy.lb_default]
}

Creates google_compute_global_forwarding_rule resource with target name myservice-https-proxy and connects previously created HTTPS proxy target (myservice-https-proxy) and IP address resource (myservice-service-ip). You can change these to your own values.

Apply this config:

To apply your Terraform configuration in a Google Cloud project, complete the steps in the following sections.

Prepare Cloud Shell
1. Launch Cloud Shell.
2. Set the default Google Cloud project where you want to apply your Terraform configurations.
  
  You only need to run this command once per project, and you can run it in any directory.
```
export GOOGLE_CLOUD_PROJECT=PROJECT_ID
```
  Environment variables are overridden if you set explicit values in the Terraform configuration file.
Prepare the directory

Each Terraform configuration file must have its own directory (also called a root module).
1. In Cloud Shell, create a directory and a new file within that directory. The filename must have the .tf extension—for example main.tf. In this tutorial, the file is referred to as main.tf.
```
mkdir DIRECTORY && cd DIRECTORY && touch main.tf
```
2. If you are following a tutorial, you can copy the sample code in each section or step.
  
  Copy the sample code into the newly created main.tf.
  
  Optionally, copy the code from GitHub. This is recommended when the Terraform snippet is part of an end-to-end solution.
3. Review and modify the sample parameters to apply to your environment.
4. Save your changes.
5. Initialize Terraform. You only need to do this once per directory.
```
terraform init
```
  Optionally, to use the latest Google provider version, include the -upgrade option:
```
terraform init -upgrade
```
Apply the changes
1. Review the configuration and verify that the resources that Terraform is going to create or update match your expectations:
```
terraform plan
```
  Make corrections to the configuration as necessary.
2. Apply the Terraform configuration by running the following command and entering yes at the prompt:
```
terraform apply
```
  Wait until Terraform displays the "Apply complete!" message.
3. Open your Google Cloud project to view the results. In the Google Cloud console, navigate to your resources in the UI to make sure that Terraform has created or updated them.
Note: Terraform samples typically assume that the required APIs are enabled in your Google Cloud project.

Configure regional network endpoint groups

For each region you deployed to in the previous step, you must create serverless network endpoint groups (NEGs) and add them to the backend service using the following instructions:

gcloud CLI

Create a network endpoint group for the Cloud Run service in REGION:
```
gcloud compute network-endpoint-groups create NEG_NAME \
  --region=REGION \
  --network-endpoint-type=serverless \
  --cloud-run-service=SERVICE_NAME
```
Replace the following:
- NEG_NAME with the name of the network endpoint group resource. (for example, myservice-neg-uscentral1)
- REGION with the region your service is deployed in.
- SERVICE_NAME with the name of your service.

Add the network endpoint group to the backend service:

gcloud compute backend-services add-backend --global BACKEND_NAME \
  --network-endpoint-group-region=REGION \
  --network-endpoint-group=NEG_NAME

Specify the NEG_NAME you created in the previous step for the region.

Repeat the preceding steps for each region.

Terraform

To learn how to apply or remove a Terraform configuration, see Basic Terraform commands.

Configure a network endpoint group with name myservice-neg for the Cloud Run service for each region specified in run_regions variable:

resource "google_compute_region_network_endpoint_group" "lb_default" {
  provider              = google-beta
  count                 = length(local.run_regions)
  name                  = "myservice-neg"
  network_endpoint_type = "SERVERLESS"
  region                = local.run_regions[count.index]
  cloud_run {
    service = google_cloud_run_v2_service.run_default[count.index].name
  }
}

Configure a backend service to attach the network endpoint group (myservice-neg):

resource "google_compute_backend_service" "lb_default" {
  provider              = google-beta
  name                  = "myservice-backend"
  load_balancing_scheme = "EXTERNAL_MANAGED"

  backend {
    group = google_compute_region_network_endpoint_group.lb_default[0].id
  }

  backend {
    group = google_compute_region_network_endpoint_group.lb_default[1].id
  }

  # Use an explicit depends_on clause to wait until API is enabled
  depends_on = [
    google_project_service.compute_api,
  ]
}

Configure DNS records on your domain

To point your domain name to the forwarding rule you created, update its DNS records with the IP address that you created.

Find the reserved IP address of the load balancer by running the following command:
```
gcloud compute addresses describe SERVICE_IP \
  --global \
  --format='value(address)'
```
Replace SERVICE_IP with the name of the IP address you created previously. This command prints the IP address to the output.
Update your domain's DNS records by adding an A record with this IP address.

Configure custom audience if using authenticated services

Authenticated services are protected by IAM. Such Cloud Run services require client authentication that declares the intended recipient of a request at credential generation time (the audience).

Audience is usually the full URL of the target service, which by default for Cloud Run services is a generated URL ending in run.app. However, in a multi-region deployment, a client cannot know in advance which regional service a request will be routed to. So, for a multi-region deployment, configure your service to use custom audiences.

Wait for load balancer to provision

After configuring the domain with the load balancer IP address, wait for DNS records to propagate. Similarly, wait for the managed TLS certificate to be issued for your domain and to be ready to start serving HTTPS traffic globally.

It might take up to 30 minutes for your load balancer to start serving traffic.

After it is ready, visit your website's URL with https:// prefix to try it out.

Verify status

To check the status of your DNS record propagation, use the dig command-line utility:
```
dig A +short example.com
```
The output shows the IP address that you configured in your DNS records.
Check the status of your managed certificate issuance by running the following command:
```
gcloud compute ssl-certificates describe CERT_NAME
```
Replace CERT_NAME with the name you previously chose for the SSL certificate resource.

The output shows a line containing status: ACTIVE.

Set up HTTP-to-HTTPS redirect

By default, a forwarding rule only handles a single protocol and therefore requests to your http:// endpoints respond with "404 Not Found". If you need requests to your http:// URLs to redirect to the https:// protocol, create an additional URL map and a forwarding rule using the following instructions:

gcloud CLI

Create a URL map with a redirect rule.

gcloud compute url-maps import HTTP_URLMAP_NAME \
  --global \
  --source /dev/stdin <<EOF
        name: HTTP_URLMAP_NAME
        defaultUrlRedirect:
          redirectResponseCode: MOVED_PERMANENTLY_DEFAULT
          httpsRedirect: True
        EOF

Replace the HTTP_URLMAP_NAME with the name of the URL map resource you will create (for example, myservice-httpredirect).

Create a target HTTP proxy with the URL map.
```
gcloud compute target-http-proxies create HTTP_PROXY_NAME \
  --url-map=HTTP_URLMAP_NAME
```
Replace HTTP_PROXY_NAME with the name of the target HTTP proxy you will create (for example, myservice-http).

Create a forwarding rule on port 80 with the same reserved IP address.

gcloud compute forwarding-rules create --global HTTP_FORWARDING_RULE_NAME \
  --target-http-proxy=HTTP_PROXY_NAME \
  --address=SERVICE_IP \
  --ports=80

Replace HTTP_FORWARDING_RULE_NAME with the name of the new forwarding rule you will create (for example, myservice-httplb).

Terraform

To learn how to apply or remove a Terraform configuration, see Basic Terraform commands.

Create a URL map resource with a redirect rule:

resource "google_compute_url_map" "https_default" {
  provider = google-beta
  name     = "myservice-https-urlmap"

  default_url_redirect {
    redirect_response_code = "MOVED_PERMANENTLY_DEFAULT"
    https_redirect         = true
    strip_query            = false
  }
}

Create a target HTTP proxy with the newly created URL map resource (myservice-https-urlmap):

resource "google_compute_target_http_proxy" "https_default" {
  provider = google-beta
  name     = "myservice-http-proxy"
  url_map  = google_compute_url_map.https_default.id

  depends_on = [
    google_compute_url_map.https_default
  ]
}

Create a forwarding rule on port 80 with the same reserved IP address resource (myservice-http-proxy):

resource "google_compute_global_forwarding_rule" "https_default" {
  provider   = google-beta
  name       = "myservice-https-fr"
  target     = google_compute_target_http_proxy.https_default.id
  ip_address = google_compute_global_address.lb_default.id
  port_range = "80"
  depends_on = [google_compute_target_http_proxy.https_default]
}

For additional configuration options, see Set up a global external Application Load Balancer with Cloud Run.

Automate cross-regional failover with Cloud Run service health

Cloud Run service health minimizes service disruptions and automates cross-region failover and failback. Set up a multi-region, high availability Cloud Run service with automated failover and failback capabilities for internal traffic.

Limitations

The following limitations apply to Cloud Run service health:

You must configure at least one service-level or revision-level minimum instance per region to calculate health. You can also use the Container instance count metric in Cloud Monitoring to estimate the required minimum instances for your regions.
Failovers require at least two services from different regions. Otherwise, if one service fails, the error message no healthy upstream is displayed.
Cloud Run service health doesn't support cross-region internal Application Load Balancers with more than 5 serverless NEG backends.
You can't configure a URL mask or tags in serverless NEGs.
You can't enable IAP from a backend service or load balancer. Enable IAP directly from Cloud Run.
If a Cloud Run service is deleted, Cloud Run doesn't report an unhealthy status to the load balancer.
Starting a new instance won't count the first readiness probe, so a request might briefly route to a newly started service before becoming unhealthy.
Cloud Run service health is computed across all instances. Revisions without probes are treated as unknown. The load balancer treats unknown instances as healthy.

Report regional health status

To aggregate regional Cloud Run service health and report a healthy or unhealthy status to the load balancer, perform the following steps:

Deploy a Cloud Run service revision in multiple regions with one or more minimum instances. Run the following command to use the readiness probe that you configured in the previous step:
```
gcloud beta run deploy SERVICE_NAME \
--regions=REGION_A,REGION_B \
--min=MIN_INSTANCES
```
Replace the following:
- SERVICE_NAME: the name of the service.
- REGION_A, REGION_B: different regions for your service revision. For example, set REGION_A to us-central1 and REGION_B to europe-west1.
- MIN_INSTANCES: the number of container instances to be kept warm, ready to receive requests. You must set the minimum value to 1 or more.
Configure a gRPC or HTTP readiness probe set up on each container instance.
Configure a cross-region internal Application Load Balancer to shift traffic away from unhealthy regions.
Set up Serverless NEGs for each Cloud Run service in each region.
Configure a backend service to connect with serverless NEGs.

Best practices

You can use a combination of readiness probes, traffic splitting, and minimum instances to perform safe, gradual rollouts. This lets you verify the health of a new revision in a single "canary" region before promoting it, ensuring that the load balancer only sends traffic to healthy regional backends.

Recommended rollout process

You can roll out a service revision on an existing Cloud Run service that's not using a readiness probe or Cloud Run service health. Follow this process one region at a time to safely deploy a new revision:

Deploy the new revision in a single "canary" region with a readiness probe configured.
Send a small amount of traffic (for example, 1%) to the new revision.
Use non-zero minimum instances at the service level, rather than at the revision level.
Check the readiness probe metric (run.googleapis.com/container/instance_count_with_readiness) to ensure that new instances are healthy.
In incremental steps, increase the traffic percentage to the new revision. As you ramp up, monitor the regional Cloud Run service health metric (run.googleapis.com/service_health_count), which is used by the load balancer. Cloud Run service health reports UNKNOWN until enough traffic is routed to the new revision.
Once the revision receives 100% of traffic and the regional Cloud Run service health is stable and healthy, repeat this process for all other regions.

Monitor health checks

After you set up Cloud Run service health, serverless NEGs collect the Cloud Monitoring service health metric. You can view the health status for the existing regional services. The following diagram shows how these Cloud Run service health components respond to requests to your service:

Illustration of Cloud Run service health components

If a service in a region is unhealthy, the load balancer diverts traffic from the unhealthy region to a healthy region. Traffic recovers after the region becomes healthy again.

Use authenticated Pub/Sub push subscriptions with multi-region deployment

A Pub/Sub service by default delivers messages to push endpoints in the same Google Cloud region where the Pub/Sub service stores the messages. For a workaround to this behavior, refer to Using an authenticated Pub/Sub push subscription with a multi-region Cloud Run deployment.

Configure a manual failover

To manually configure traffic to fail over to a healthy region, modify the global external Application Load Balancer URL map.

To update the global external Application Load Balancer URL map, remove the NEG from the backend service, using the --global flag:
```
gcloud compute backend-services remove-backend BACKEND_NAME \
--network-endpoint-group=NEG_NAME \
--network-endpoint-group-region=REGION \
--global
```
Replace the following:
- BACKEND_NAME: The name of the backend service.
- NEG_NAME: The name of the network endpoint group resource, for example, myservice-neg-uscentral1.
- REGION: The region where the NEG was created and where you want to remove your service from. For example, us-central1,asia-east1.
To confirm that a healthy region is now serving traffic, navigate to https://<domain-name>.

What's next

Learn how to configure health checks for Cloud Run services, including readiness probes.

Check out a Cloud Run readiness probe and service health code sample written in Go.

Serve traffic from multiple regions

Before you begin

Grant the roles

Console

gcloud

Deploy the service to multiple regions

Deploy a multi-region service

gcloud

YAML

Terraform

Update a multi-region service

gcloud

YAML

Delete a multi-region service

Configure a global external Application Load Balancer

Create a global external Application Load Balancer

gcloud

Terraform

Prepare Cloud Shell

Prepare the directory

Apply the changes

Configure regional network endpoint groups

gcloud CLI

Terraform

Configure DNS records on your domain

Configure custom audience if using authenticated services

Wait for load balancer to provision

Verify status

Set up HTTP-to-HTTPS redirect

gcloud CLI

Terraform

Automate cross-regional failover with Cloud Run service health

Limitations

Report regional health status

Best practices

Recommended rollout process

Monitor health checks

Use authenticated Pub/Sub push subscriptions with multi-region deployment

Configure a manual failover

What's next