Quotas and Limits

This page contains usage quota and limits that apply when using Cloud Run and Cloud Run on GKE.

The number of Cloud Run resources is limited. Quotas for Cloud Run encompass API rate limits, which affect the rate at which you can call the Cloud Run API.

Resource limits for Cloud Run

Resource Description Limit Can be increased Scope
Services Maximum number of services 1000 No per region
Revisions Maximum number of revisions 1000 No per service
Revisions Maximum number of revisions serving traffic 1000 No per region
Container instances Maximum number of container instances 1000 Yes, via GCP Support per service
Memory Maximum memory size, in GB 2 No per container instance
Disk Maximum writable, in-memory filesystem, limited by instance memory, in GB 2 No per container instance
Concurrency Maximum number of concurrent requests that can be sent to each container instance 80 No per container instance
Environment variable Maximum variable length, in Kb 32 No per variable
Request size Maximum request size, in MB 32 No per request
Response size Maximum response size, in MB 32 No per response
Timeout Maximum time before timeout in minutes 15 No per request

How to increase resource limits

To increase resource limits, contact GCP Support.

Resource limits for Cloud Run on GKE

Cloud Run on GKE is subject to the Google Kubernetes Engine quotas and limits.

The number of Cloud Run on GKE resources is limited by the configuration of the cluster as well as other dependencies. The following limits are recommended limits for a properly scaled Kubernetes Engine cluster.

Resource Description Limit Can be increased Scope
Services Maximum number of services 150 No per cluster
Revisions Maximum number of revisions* 300 No per cluster
Timeout Maximum time before timeout in minutes 10 No per request

Other resource limitations are imposed by the configuration of the Kubernetes Engine cluster that the services are running in. For example, you cannot request more memory than is available in the nodes in the cluster.

API rate Limits

The following rate limits apply to the Cloud Run API. They do not apply to the requests reaching your deployed Cloud Run services.

Quota Description Limit Increasable? Scope
Cloud Run API read requests, per user The number of invocations per 100 seconds, per user. If exceeded, all services will be paused until the next quota period 5,000 per 100 seconds Yes per region
Cloud Run API read requests per project The number of reads per 100 seconds, per client project. If exceeded, all services will be paused until the next quota period 4,000,000,000 per 100 seconds Yes per region
Cloud Run API write requests per user The number of writes per 100 seconds, per user. If exceeded, all services will be paused until the next quota period 80 per 100 seconds Yes per region
Cloud Run API write requests per project The number of writes per 100 seconds, per client project. If exceeded, all services will be paused until the next quota period 64,000,000 per 100 seconds Yes per region

How to increase API quota

To increase quotas above the defaults listed on this page:

  1. Go to the Cloud Run Quotas page

  2. Select the quota(s) you want to modify and click EDIT QUOTAS.

  3. Supply your user information if prompted, and enter the new quota limit for each quota you selected.

क्या यह पेज उपयोगी था? हमारे काम के बारे में अपनी राय दें:

निम्न के बारे में फ़ीडबैक भेजें...