Quotas and Limits

This page contains usage quota and limits that apply when using Cloud Run.

The number of Cloud Run resources is limited. Quotas for Cloud Run encompass API rate limits, which affect the rate at which you can call the Cloud Run Admin API.

There is no direct limit for:

  • the size of container images you can deploy
  • the number of concurrent requests served by a Cloud Run service.

Resource limits for Cloud Run (fully managed)

Resource Description Limit Can be increased Scope
Service Maximum number of services 1000 No per region
Revision Maximum number of revisions 1000 No per service
Revision Maximum number of revisions serving traffic 1000 No per region
Container instance Maximum number of container instances 1000 Yes per service
Container instance Maximum container startup time, in minutes 4 No per container instance
Memory Maximum memory size, in GB 2 No per container instance
CPU Maximum number of vCPU 2 No per container instance
Disk Maximum writable, in-memory filesystem, limited by instance memory, in GB 2 No per container instance
Environment variable Maximum variable length, in Kb 32 No per variable
Request Maximum number of concurrent requests 80 No per container instance
Request Maximum number of concurrent streams 100 No per HTTP/2 connection
Request Maximum time before timeout, in minutes 15 No per request
Request Maximum request size, in MB 32 No per request
Response Maximum response size, in MB 32 No per response
Domain mapping Maximum number of SSL certificates 50 No per top domain and per week
Domain mapping Maximum number of duplicate SSL certificates 5 No per top domain and per week
Access token Maximum number of unique access token generated 50 No per container instance per second
Identity token Maximum number of unique identity token generated 50 No per container instance per second

Cloud Run Admin API rate Limits

The following rate limits apply to the Cloud Run Admin API. They do not apply to the requests reaching your deployed Cloud Run (fully managed) services.

Quota Description Limit Increasable? Scope
Cloud Run Admin API read requests The number of API reads per 100 seconds per project. This is not the number of read requests to your Cloud Run (fully managed) services, which is not limited. 1,000 per 100 seconds Yes Global
Cloud Run Admin API write requests The number of API writes per 100 seconds per project. This is not the number of write requests to your Cloud Run (fully managed) services, which is not limited. 100 per 100 seconds Yes Global

How to increase quota

To increase quotas above the defaults listed on this page:

  1. Go to the Cloud Run Quotas page

  2. Select the quota(s) you want to modify and click EDIT QUOTAS.

  3. Supply your user information if prompted, and enter the new quota limit for each quota you selected.

Resource limits for Cloud Run for Anthos on Google Cloud

Cloud Run for Anthos on Google Cloud is subject to the Google Kubernetes Engine quotas and limits.

The number of Cloud Run for Anthos on Google Cloud resources is limited by the configuration of the cluster as well as other dependencies. The following limits are recommended limits for a properly scaled Kubernetes Engine cluster.

Resource Description Limit Can be increased Scope
Services Maximum number of services 150 No per cluster
Revisions Maximum number of revisions 300 No per cluster
Timeout Maximum time before timeout in minutes 10 No per request

Other resource limitations are imposed by the configuration of the Kubernetes Engine cluster that the services are running in. For example, you cannot request more memory than is available in the nodes in the cluster.