Vertex AI quotas and limits

A quota restricts how much of a shared Google Cloud resource your Google Cloud project can use, including hardware, software, and network components. Therefore, quotas are a part of a system that does the following:

  • Monitors your use or consumption of Google Cloud products and services.
  • Restricts your consumption of those resources, for reasons that include ensuring fairness and reducing spikes in usage.
  • Maintains configurations that automatically enforce prescribed restrictions.
  • Provides a means to request or make changes to the quota.

In most cases, when a quota is exceeded, the system immediately blocks access to the relevant Google resource, and the task that you're trying to perform fails. In most cases, quotas apply to each Google Cloud project and are shared across all applications and IP addresses that use that Google Cloud project.

There are also limits on Vertex AI resources. These limits are unrelated to the quota system. Limits can't be changed.

Request quotas

The following quotas apply to Vertex AI requests for a given project and supported region. For example, in a single project, you can have up to 30,000 online prediction requests per minute in one region and another 30,000 online prediction requests per minute in another supported region.

Request type Requests per minute
Resource management (CRUD) requests1 600
Job or long-running operation (LRO) submission requests 60
Online prediction requests2 30,000
Online prediction request throughput 1.5 GB
Online explain requests 600
Vertex AI TensorBoard Time Series read requests 60,000
ML Metadata (CRUD) requests 12,000
Vertex AI Vizier (CRUD) requests 6,000
Vertex AI Feature Store online serving requests 300,000
Vertex ML Metadata requests 12,000
Number of count tokens or compute tokens requests 3,000

1Resource management requests include any request that isn't a job, an LRO, an online prediction request, a Vertex AI Vizier request, an ML metadata request, a Vertex AI TensorBoard Timeseries Insights API read request, a Vertex AI Feature Store request, a Vertex AI Feature Store streaming request, or a Vector Search request.

2 This quota applies for public endpoints only. Private endpoints have unlimited requests per minute.

Jobs or LROs include the following requests:

  • Create or delete a dataset.
  • Import or export data to or from a dataset.
  • Create an endpoint.
  • Create or delete a custom job.
  • Create or delete a data-labeling job.
  • Create or delete a hyper-parameter tuning job.
  • Create or delete a batch-prediction job.
  • Create or delete a model.
  • Upload, delete, or export a model.
  • Create or delete a notebook-runtime template.
  • Assign, delete, start, or upgrade a notebook runtime.


For quota information for Generative AI models, see Generative AI on Vertex AI quotas and limits.

AutoML model quotas

The following quotas apply to each data type and objective for a given project and region. For example, in a particular project and region, you can deploy 10 AutoML image classification models and 10 AutoML image object detection models for a total of 20 deployed models.

Image

Classification

Quota Value
Concurrent training jobs 5
Concurrent training jobs with explainable AI 2
Concurrent batch prediction jobs 5
Concurrent model deployment jobs 5
Concurrent model undeployment jobs 5
Number of deployed models 10

Object detection

Quota Value
Concurrent training jobs 5
Concurrent batch prediction jobs 5
Number of deployed models 10

Tabular

Quota Value
Concurrent training jobs 5
Concurrent batch prediction jobs 5
Number of deployed models 30

Text

Classification

Quota Value
Concurrent training jobs 5
Concurrent batch prediction jobs 5
Number of deployed models 10

Entity extraction

Quota Value
Concurrent training jobs 5
Concurrent batch prediction jobs 5
Number of deployed models 10

Sentiment analysis

Quota Value
Concurrent training jobs 5
Concurrent batch prediction jobs 5
Number of deployed models 10

Video

Action Recognition

Quota Value
Concurrent training jobs 5
Concurrent batch prediction jobs 5

Classification

Quota Value
Concurrent training jobs 5
Concurrent batch prediction jobs 5

Object tracking

Quota Value
Concurrent training jobs 5
Concurrent batch prediction jobs 5

AutoML Video Intelligence specific limits

The following describes current restrictions on the use of AutoML Video Intelligence.

Type of limits Value
Maximum video length 3 hours
Maximum video file size 50GB
Minimum labels per dataset 2
Minimum videos per label 10 (1000 is recommended)
Batch input CSV file size Maximum: 100 MB
Number of video segments in batch input Maximum: 1,000

Custom-trained model quotas

The following quotas apply to Vertex AI custom-trained models for a given project and region.

Training

Quota Value
Concurrent custom training pipelines 2,000
Number of N1 and E2 CPUs for training, per region
RegionValue
us-west1 2,200
us-west2 20
us-west3 2,200
us-west4 20
us-central1 2,200
us-east1 2,200
us-east4 20
us-south1 450
northamerica-northeast1 2,200
northamerica-northeast2 20
southamerica-east1 20
southamerica-west1 20
europe-west2 2,200
europe-west1 2,200
europe-west4 2,200
europe-west6 20
europe-west3 2,200
europe-north1europe-central2 20
europe-west8 20
europe-west9 450
europe-southwest1asia-south1 20
asia-southeast1 2,200
asia-southeast2 2,200
asia-east2 2,200
asia-east1 2,200
asia-northeast1 2,200
asia-northeast2 20
australia-southeast1 2,200
australia-southeast2 20
asia-northeast3 2,200
me-west1 450
Number of N2 CPUs for training, per region
RegionValue
us-west1 20
us-west2 20
us-west3 20
us-west4 20
us-central1 450
us-east1 20
us-east4 20
us-south1 20
northamerica-northeast1 20
northamerica-northeast2 20
southamerica-east1 20
southamerica-west1 20
europe-west2 20
europe-west1 20
europe-west4 450
europe-west6 20
europe-west3 20
europe-north1europe-central2 20
europe-west8 20
europe-west9 450
europe-southwest1asia-south1 20
asia-southeast1 20
asia-southeast2 20
asia-east2 20
asia-east1 450
asia-northeast1 20
asia-northeast2 20
australia-southeast1 20
australia-southeast2 20
asia-northeast3 20
me-west1 20
Number of M1 CPUs for training, per region
RegionValue
us-west1 0
us-west2 0
us-west3 0
us-west4 0
us-central1 0
us-east1 0
us-east4 0
us-south1 0
northamerica-northeast1 0
northamerica-northeast2 0
southamerica-east1 0
southamerica-west1 0
europe-west2 0
europe-west1 0
europe-west4 0
europe-west6 0
europe-west3 0
europe-north1europe-central2 0
europe-west8 0
europe-west9 0
europe-southwest1asia-south1 0
asia-southeast1 0
asia-southeast2 0
asia-east2 0
asia-east1 0
asia-northeast1 0
asia-northeast2 0
australia-southeast1 0
australia-southeast2 0
asia-northeast3 0
me-west1 0
Number of C2 CPUs for training, per region
RegionValue
us-west1 20
us-west2 20
us-west3 20
us-west4 20
us-central1 450
us-east1 20
us-east4 20
us-south1 20
northamerica-northeast1 20
northamerica-northeast2 20
southamerica-east1 20
southamerica-west1 20
europe-west2 20
europe-west1 20
europe-west4 450
europe-west6 20
europe-west3 20
europe-north1europe-central2 20
europe-west8 20
europe-west9 450
europe-southwest1asia-south1 20
asia-southeast1 20
asia-southeast2 20
asia-east2 20
asia-east1 450
asia-northeast1 20
asia-northeast2 20
australia-southeast1 20
australia-southeast2 20
asia-northeast3 20
me-west1 20
Number of A2 CPUs for training, per region
RegionValue
us-west1 Not available
us-west2 Not available
us-west3 Not available
us-west4 Not available
us-central1 96
us-east1 Not available
us-east4 Not available
us-south1 Not available
northamerica-northeast1 Not available
northamerica-northeast2 Not available
southamerica-east1 Not available
southamerica-west1 Not available
europe-west2 Not available
europe-west1 Not available
europe-west4 96
europe-west6 Not available
europe-west3 Not available
europe-north1europe-central2 Not available
europe-west8 Not available
europe-west9 Not available
europe-southwest1asia-south1 Not available
asia-southeast1 96
asia-southeast2 Not available
asia-east2 Not available
asia-east1 Not available
asia-northeast1 Not available
asia-northeast2 Not available
australia-southeast1 Not available
australia-southeast2 Not available
asia-northeast3 Not available
me-west1 Not available
Number of concurrent A3 CPUs for training, per region
RegionValue
us-west1 Not available
us-west2 Not available
us-west3 Not available
us-west4 Not available
us-central1 0
us-east1 Not available
us-east4 Not available
us-south1 Not available
northamerica-northeast1 Not available
northamerica-northeast2 Not available
southamerica-east1 Not available
southamerica-west1 Not available
europe-west2 Not available
europe-west1 Not available
europe-west4 0
europe-west6 Not available
europe-west3 Not available
europe-north1europe-central2 Not available
europe-west8 Not available
europe-west9 Not available
europe-southwest1asia-south1 Not available
asia-southeast1 0
asia-southeast2 Not available
asia-east2 Not available
asia-east1 Not available
asia-northeast1 Not available
asia-northeast2 Not available
australia-southeast1 Not available
australia-southeast2 Not available
asia-northeast3 Not available
me-west1 Not available
Number of concurrent K80 GPUs for training, per region
RegionValue
us-west1 30
us-west2 Not available
us-west3 Not available
us-west4 Not available
us-central1 56
us-east1 30
us-east4 Not available
us-south1 Not available
northamerica-northeast1 Not available
northamerica-northeast2 Not available
southamerica-east1 Not available
southamerica-west1 Not available
europe-west2 Not available
europe-west1 30
europe-west4 Not available
europe-west6 Not available
europe-west3 Not available
europe-north1europe-central2 Not available
europe-west8 Not available
europe-west9 Not available
europe-southwest1asia-south1 Not available
asia-southeast1 Not available
asia-southeast2 Not available
asia-east2 Not available
asia-east1 56
asia-northeast1 Not available
asia-northeast2 Not available
australia-southeast1 Not available
australia-southeast2 Not available
asia-northeast3 Not available
me-west1 Not available
Number of P4 GPUs for training, per region
RegionValue
us-west1 Not available
us-west2 6
us-west3 Not available
us-west4 Not available
us-central1 6
us-east1 Not available
us-east4 1
us-south1 Not available
northamerica-northeast1 6
northamerica-northeast2 Not available
southamerica-east1 Not available
southamerica-west1 Not available
europe-west2 Not available
europe-west1 Not available
europe-west4 6
europe-west6 Not available
europe-west3 Not available
europe-north1europe-central2 Not available
europe-west8 Not available
europe-west9 Not available
europe-southwest1asia-south1 Not available
asia-southeast1 6
asia-southeast2 Not available
asia-east2 Not available
asia-east1 Not available
asia-northeast1 Not available
asia-northeast2 Not available
australia-southeast1 6
australia-southeast2 Not available
asia-northeast3 Not available
me-west1 Not available
Number of T4 GPUs for training, per region
RegionValue
us-west1 2
us-west2 Not available
us-west3 Not available
us-west4 Not available
us-central1 12
us-east1 2
us-east4 Not available
us-south1 Not available
northamerica-northeast1 Not available
northamerica-northeast2 Not available
southamerica-east1 Not available
southamerica-west1 Not available
europe-west2 6
europe-west1 Not available
europe-west4 2
europe-west6 Not available
europe-west3 0
europe-north1europe-central2 Not available
europe-west8 Not available
europe-west9 Not available
europe-southwest1asia-south1 Not available
asia-southeast1 1
asia-southeast2 Not available
asia-east2 Not available
asia-east1 Not available
asia-northeast1 6
asia-northeast2 Not available
australia-southeast1 Not available
australia-southeast2 Not available
asia-northeast3 1
me-west1 Not available
Number of L4 GPUs for training, per region
RegionValue
us-west1 0
us-west2 Not available
us-west3 Not available
us-west4 Not available
us-central1 0
us-east1 0
us-east4 0
us-south1 Not available
northamerica-northeast1 Not available
northamerica-northeast2 Not available
southamerica-east1 Not available
southamerica-west1 Not available
europe-west2 Not available
europe-west1 Not available
europe-west4 0
europe-west6 Not available
europe-west3 Not available
europe-north1europe-central2 Not available
europe-west8 Not available
europe-west9 Not available
europe-southwest1asia-south1 Not available
asia-southeast1 0
asia-southeast2 Not available
asia-east2 Not available
asia-east1 Not available
asia-northeast1 Not available
asia-northeast2 Not available
australia-southeast1 Not available
australia-southeast2 Not available
asia-northeast3 Not available
me-west1 Not available
Number of P100 GPUs for training, per region
RegionValue
us-west1 30
us-west2 Not available
us-west3 Not available
us-west4 Not available
us-central1 56
us-east1 30
us-east4 Not available
us-south1 Not available
northamerica-northeast1 Not available
northamerica-northeast2 Not available
southamerica-east1 Not available
southamerica-west1 Not available
europe-west2 Not available
europe-west1 30
europe-west4 Not available
europe-west6 Not available
europe-west3 Not available
europe-north1europe-central2 Not available
europe-west8 Not available
europe-west9 Not available
europe-southwest1asia-south1 Not available
asia-southeast1 Not available
asia-southeast2 Not available
asia-east2 Not available
asia-east1 30
asia-northeast1 Not available
asia-northeast2 Not available
australia-southeast1 6
australia-southeast2 Not available
asia-northeast3 Not available
me-west1 Not available
Number of V100 GPUs for training, per region
RegionValue
us-west1 6
us-west2 Not available
us-west3 Not available
us-west4 Not available
us-central1 6
us-east1 Not available
us-east4 Not available
us-south1 Not available
northamerica-northeast1 Not available
northamerica-northeast2 Not available
southamerica-east1 Not available
southamerica-west1 Not available
europe-west2 Not available
europe-west1 Not available
europe-west4 6
europe-west6 Not available
europe-west3 Not available
europe-north1europe-central2 Not available
europe-west8 Not available
europe-west9 Not available
europe-southwest1asia-south1 Not available
asia-southeast1 Not available
asia-southeast2 Not available
asia-east2 Not available
asia-east1 6
asia-northeast1 Not available
asia-northeast2 Not available
australia-southeast1 Not available
australia-southeast2 Not available
asia-northeast3 Not available
me-west1 Not available
Number of A100 GPUs for training, per region
RegionValue
us-west1 Not available
us-west2 Not available
us-west3 Not available
us-west4 Not available
us-central1 8
us-east1 Not available
us-east4 Not available
us-south1 Not available
northamerica-northeast1 Not available
northamerica-northeast2 Not available
southamerica-east1 Not available
southamerica-west1 Not available
europe-west2 Not available
europe-west1 Not available
europe-west4 8
europe-west6 Not available
europe-west3 Not available
europe-north1europe-central2 Not available
europe-west8 Not available
europe-west9 Not available
europe-southwest1asia-south1 Not available
asia-southeast1 8
asia-southeast2 Not available
asia-east2 Not available
asia-east1 Not available
asia-northeast1 Not available
asia-northeast2 Not available
australia-southeast1 Not available
australia-southeast2 Not available
asia-northeast3 Not available
me-west1 Not available
Number of concurrent A100 80GB GPUs for training, per region
RegionValue
us-west1 Not available
us-west2 Not available
us-west3 Not available
us-west4 Not available
us-central1 0
us-east1 Not available
us-east4 0
us-south1 Not available
northamerica-northeast1 Not available
northamerica-northeast2 Not available
southamerica-east1 Not available
southamerica-west1 Not available
europe-west2 Not available
europe-west1 Not available
europe-west4 0
europe-west6 Not available
europe-west3 Not available
europe-north1europe-central2 Not available
europe-west8 Not available
europe-west9 Not available
europe-southwest1asia-south1 Not available
asia-southeast1 0
asia-southeast2 Not available
asia-east2 Not available
asia-east1 Not available
asia-northeast1 Not available
asia-northeast2 Not available
australia-southeast1 Not available
australia-southeast2 Not available
asia-northeast3 Not available
me-west1 Not available

If interested, please see the quotas documentation.

Number of concurrent H100 GPUs for training, per region
RegionValue
us-west1 Not available
us-west2 Not available
us-west3 Not available
us-west4 Not available
us-central1 0
us-east1 Not available
us-east4 0
us-south1 Not available
northamerica-northeast1 Not available
northamerica-northeast2 Not available
southamerica-east1 Not available
southamerica-west1 Not available
europe-west2 Not available
europe-west1 Not available
europe-west4 0
europe-west6 Not available
europe-west3 Not available
europe-north1europe-central2 Not available
europe-west8 Not available
europe-west9 Not available
europe-southwest1asia-south1 Not available
asia-southeast1 Not available
asia-southeast2 Not available
asia-east2 Not available
asia-east1 Not available
asia-northeast1 Not available
asia-northeast2 Not available
australia-southeast1 Not available
australia-southeast2 Not available
asia-northeast3 Not available
me-west1 Not available

If interested, please see the quotas documentation.

Number of TPU V2 cores for training, per region
RegionValue
us-west1 Not available
us-west2 Not available
us-west3 Not available
us-west4 Not available
us-central1 8
us-east1 Not available
us-east4 Not available
us-south1 Not available
northamerica-northeast1 Not available
northamerica-northeast2 Not available
southamerica-east1 Not available
southamerica-west1 Not available
europe-west2 Not available
europe-west1 Not available
europe-west4 8
europe-west6 Not available
europe-west3 Not available
europe-north1europe-central2 Not available
europe-west8 Not available
europe-west9 Not available
europe-southwest1asia-south1 Not available
asia-southeast1 Not available
asia-southeast2 Not available
asia-east2 Not available
asia-east1 8
asia-northeast1 Not available
asia-northeast2 Not available
australia-southeast1 Not available
australia-southeast2 Not available
asia-northeast3 Not available
me-west1 Not available
Number of TPU V2 pod cores for training, per region
RegionValue
us-west1 Not available
us-west2 Not available
us-west3 Not available
us-west4 Not available
us-central1 Not available
us-east1 Not available
us-east4 Not available
us-south1 Not available
northamerica-northeast1 Not available
northamerica-northeast2 Not available
southamerica-east1 Not available
southamerica-west1 Not available
europe-west2 Not available
europe-west1 Not available
europe-west4 Not available
europe-west6 Not available
europe-west3 Not available
europe-north1europe-central2 Not available
europe-west8 Not available
europe-west9 Not available
europe-southwest1asia-south1 Not available
asia-southeast1 Not available
asia-southeast2 Not available
asia-east2 Not available
asia-east1 Not available
asia-northeast1 Not available
asia-northeast2 Not available
australia-southeast1 Not available
australia-southeast2 Not available
asia-northeast3 Not available
me-west1 Not available
Number of TPU V3 cores for training, per region
RegionValue
us-west1 Not available
us-west2 Not available
us-west3 Not available
us-west4 Not available
us-central1 8
us-east1 Not available
us-east4 Not available
us-south1 Not available
northamerica-northeast1 Not available
northamerica-northeast2 Not available
southamerica-east1 Not available
southamerica-west1 Not available
europe-west2 Not available
europe-west1 Not available
europe-west4 8
europe-west6 Not available
europe-west3 Not available
europe-north1europe-central2 Not available
europe-west8 Not available
europe-west9 Not available
europe-southwest1asia-south1 Not available
asia-southeast1 Not available
asia-southeast2 Not available
asia-east2 Not available
asia-east1 8
asia-northeast1 Not available
asia-northeast2 Not available
australia-southeast1 Not available
australia-southeast2 Not available
asia-northeast3 Not available
me-west1 Not available
Number of TPU V3 pod cores for training, per region
RegionValue
us-west1 Not available
us-west2 Not available
us-west3 Not available
us-west4 Not available
us-central1 Not available
us-east1 Not available
us-east4 Not available
us-south1 Not available
northamerica-northeast1 Not available
northamerica-northeast2 Not available
southamerica-east1 Not available
southamerica-west1 Not available
europe-west2 Not available
europe-west1 Not available
europe-west4 Not available
europe-west6 Not available
europe-west3 Not available
europe-north1europe-central2 Not available
europe-west8 Not available
europe-west9 Not available
europe-southwest1asia-south1 Not available
asia-southeast1 Not available
asia-southeast2 Not available
asia-east2 Not available
asia-east1 Not available
asia-northeast1 Not available
asia-northeast2 Not available
australia-southeast1 Not available
australia-southeast2 Not available
asia-northeast3 Not available
me-west1 Not available
HDD usage (GB) during training, per region
RegionValue
us-west1 180,000
us-west2 3,600
us-west3 180,000
us-west4 3,600
us-central1 180,000
us-east1 180,000
us-east4 3,600
us-south1 180,000
northamerica-northeast1 180,000
northamerica-northeast2 3,600
southamerica-east1 3,600
southamerica-west1 3,600
europe-west2 180,000
europe-west1 180,000
europe-west4 180,000
europe-west6 3,600
europe-west3 180,000
europe-north1europe-central2 3,600
europe-west8 3,600
europe-west9 180,000
europe-southwest1asia-south1 3,600
asia-southeast1 180,000
asia-southeast2 180,000
asia-east2 180,000
asia-east1 180,000
asia-northeast1 180,000
asia-northeast2 3,600
australia-southeast1 180,000
australia-southeast2 3,600
asia-northeast3 180,000
me-west1 180,000
SSD usage (GB) during training, per region
RegionValue
us-west1 75,000
us-west2 450
us-west3 75,000
us-west4 450
us-central1 75,000
us-east1 75,000
us-east4 450
us-south1 75,000
northamerica-northeast1 75,000
northamerica-northeast2 450
southamerica-east1 450
southamerica-west1 450
europe-west2 75,000
europe-west1 75,000
europe-west4 75,000
europe-west6 450
europe-west3 75,000
europe-north1europe-central2 450
europe-west8 450
europe-west9 75,000
europe-southwest1asia-south1 450
asia-southeast1 75,000
asia-southeast2 75,000
asia-east2 75,000
asia-east1 75,000
asia-northeast1 75,000
asia-northeast2 450
australia-southeast1 75,000
australia-southeast2 450
asia-northeast3 75,000
me-west1 75,000

Serving

Quota Value
Number of deployed custom models 100
Number of CPUs for serving, per region
RegionValue
us-west1 2,200
us-west2 2,200
us-west3 2,200
us-west4 16
us-central1 2,200
us-east1 2,200
us-east4 2,200
us-south1 450
northamerica-northeast1 2,200
northamerica-northeast2 450
southamerica-east1 2,200
southamerica-west1 450
europe-west2 2,200
europe-west1 2,200
europe-west4 2,200
europe-west6 2,200
europe-west3 2,200
europe-north1europe-central2 16
europe-west8 16
europe-west9 16
europe-southwest1asia-south1 16
asia-southeast1 2,200
asia-southeast2 2,200
asia-east2 2,200
asia-east1 2,200
asia-northeast1 2,200
asia-northeast2 16
australia-southeast1 2,200
australia-southeast2 16
asia-northeast3 2,200
me-west1 450
Number of K80 GPUs for serving, per region
RegionValue
us-west1 Not available
us-west2 Not available
us-west3 Not available
us-west4 Not available
us-central1 56
us-east1 30
us-east4 Not available
us-south1 Not available
northamerica-northeast1 Not available
northamerica-northeast2 Not available
southamerica-east1 Not available
southamerica-west1 Not available
europe-west2 Not available
europe-west1 30
europe-west4 Not available
europe-west6 Not available
europe-west3 Not available
europe-north1europe-central2 Not available
europe-west8 Not available
europe-west9 Not available
europe-southwest1asia-south1 Not available
asia-southeast1 Not available
asia-southeast2 Not available
asia-east2 Not available
asia-east1 56
asia-northeast1 Not available
asia-northeast2 Not available
australia-southeast1 Not available
australia-southeast2 Not available
asia-northeast3 Not available
me-west1 Not available
Number of P100 GPUs for serving, per region
RegionValue
us-west1 30
us-west2 Not available
us-west3 Not available
us-west4 Not available
us-central1 56
us-east1 30
us-east4 Not available
us-south1 Not available
northamerica-northeast1 Not available
northamerica-northeast2 Not available
southamerica-east1 Not available
southamerica-west1 Not available
europe-west2 Not available
europe-west1 30
europe-west4 Not available
europe-west6 Not available
europe-west3 Not available
europe-north1europe-central2 Not available
europe-west8 Not available
europe-west9 Not available
europe-southwest1asia-south1 Not available
asia-southeast1 Not available
asia-southeast2 Not available
asia-east2 Not available
asia-east1 30
asia-northeast1 Not available
asia-northeast2 Not available
australia-southeast1 Not available
australia-southeast2 Not available
asia-northeast3 Not available
me-west1 Not available
Number of P4 GPUs for serving, per region
RegionValue
us-west1 Not available
us-west2 6
us-west3 Not available
us-west4 Not available
us-central1 6
us-east1 Not available
us-east4 6
us-south1 Not available
northamerica-northeast1 6
northamerica-northeast2 Not available
southamerica-east1 Not available
southamerica-west1 Not available
europe-west2 Not available
europe-west1 Not available
europe-west4 6
europe-west6 Not available
europe-west3 Not available
europe-north1europe-central2 Not available
europe-west8 Not available
europe-west9 Not available
europe-southwest1asia-south1 Not available
asia-southeast1 6
asia-southeast2 Not available
asia-east2 Not available
asia-east1 Not available
asia-northeast1 Not available
asia-northeast2 Not available
australia-southeast1 6
australia-southeast2 Not available
asia-northeast3 Not available
me-west1 Not available
Number of T4 GPUs for serving, per region
RegionValue
us-west1 12
us-west2 Not available
us-west3 Not available
us-west4 Not available
us-central1 12
us-east1 12
us-east4 Not available
us-south1 Not available
northamerica-northeast1 Not available
northamerica-northeast2 Not available
southamerica-east1 Not available
southamerica-west1 Not available
europe-west2 12
europe-west1 Not available
europe-west4 12
europe-west6 Not available
europe-west3 0
europe-north1europe-central2 Not available
europe-west8 Not available
europe-west9 Not available
europe-southwest1asia-south1 Not available
asia-southeast1 6
asia-southeast2 Not available
asia-east2 12
asia-east1 6
asia-northeast1 6
asia-northeast2 Not available
australia-southeast1 Not available
australia-southeast2 Not available
asia-northeast3 6
me-west1 Not available
Number of L4 GPUs for serving, per region
RegionValue
us-west1 28
us-west2 Not available
us-west3 Not available
us-west4 Not available
us-central1 28
us-east1 28
us-east4 28
us-south1 Not available
northamerica-northeast1 Not available
northamerica-northeast2 Not available
southamerica-east1 Not available
southamerica-west1 Not available
europe-west2 28
europe-west1 28
europe-west4 28
europe-west6 Not available
europe-west3 Not available
europe-north1europe-central2 Not available
europe-west8 Not available
europe-west9 Not available
europe-southwest1asia-south1 Not available
asia-southeast1 28
asia-southeast2 Not available
asia-east2 Not available
asia-east1 28
asia-northeast1 28
asia-northeast2 Not available
australia-southeast1 Not available
australia-southeast2 Not available
asia-northeast3 Not available
me-west1 Not available
Number of V100 GPUs for serving, per region
RegionValue
us-west1 6
us-west2 Not available
us-west3 Not available
us-west4 Not available
us-central1 6
us-east1 Not available
us-east4 Not available
us-south1 Not available
northamerica-northeast1 Not available
northamerica-northeast2 Not available
southamerica-east1 Not available
southamerica-west1 Not available
europe-west2 Not available
europe-west1 Not available
europe-west4 6
europe-west6 Not available
europe-west3 Not available
europe-north1europe-central2 Not available
europe-west8 Not available
europe-west9 Not available
europe-southwest1asia-south1 Not available
asia-southeast1 Not available
asia-southeast2 Not available
asia-east2 Not available
asia-east1 Not available
asia-northeast1 Not available
asia-northeast2 Not available
australia-southeast1 Not available
australia-southeast2 Not available
asia-northeast3 Not available
me-west1 Not available
Number of A100 GPUs for serving, per region
RegionValue
us-west1 Not available
us-west2 Not available
us-west3 Not available
us-west4 Not available
us-central1 14
us-east1 Not available
us-east4 Not available
us-south1 Not available
northamerica-northeast1 Not available
northamerica-northeast2 Not available
southamerica-east1 Not available
southamerica-west1 Not available
europe-west2 Not available
europe-west1 Not available
europe-west4 14
europe-west6 Not available
europe-west3 Not available
europe-north1europe-central2 Not available
europe-west8 Not available
europe-west9 Not available
europe-southwest1asia-south1 Not available
asia-southeast1 14
asia-southeast2 Not available
asia-east2 Not available
asia-east1 Not available
asia-northeast1 14
asia-northeast2 Not available
australia-southeast1 Not available
australia-southeast2 Not available
asia-northeast3 14
me-west1 14
Number of A100 80GB GPUs for serving, per region
RegionValue
us-west1 Not available
us-west2 Not available
us-west3 Not available
us-west4 Not available
us-central1 0
us-east1 Not available
us-east4 0
us-south1 Not available
northamerica-northeast1 Not available
northamerica-northeast2 Not available
southamerica-east1 Not available
southamerica-west1 Not available
europe-west2 Not available
europe-west1 Not available
europe-west4 0
europe-west6 Not available
europe-west3 Not available
europe-north1europe-central2 Not available
europe-west8 Not available
europe-west9 Not available
europe-southwest1asia-south1 Not available
asia-southeast1 0
asia-southeast2 Not available
asia-east2 Not available
asia-east1 Not available
asia-northeast1 Not available
asia-northeast2 Not available
australia-southeast1 Not available
australia-southeast2 Not available
asia-northeast3 Not available
me-west1 Not available
Number of v5e TPU Chips for serving, per region
RegionValue
us-west1 4
us-west2 Not available
us-west3 Not available
us-west4 Not available
us-central1 Not available
us-east1 Not available
us-east4 Not available
us-south1 Not available
northamerica-northeast1 Not available
northamerica-northeast2 Not available
southamerica-east1 Not available
southamerica-west1 Not available
europe-west2 Not available
europe-west1 Not available
europe-west4 Not available
europe-west6 Not available
europe-west3 Not available
europe-north1europe-central2 Not available
europe-west8 Not available
europe-west9 Not available
europe-southwest1asia-south1 Not available
asia-southeast1 Not available
asia-southeast2 Not available
asia-east2 Not available
asia-east1 Not available
asia-northeast1 Not available
asia-northeast2 Not available
australia-southeast1 Not available
australia-southeast2 Not available
asia-northeast3 Not available
me-west1 Not available

Vertex AI Feature Store

This section lists the quotas and limits for the following:

Vertex AI Feature Store

The following quotas apply to a given project and region. For example, in a single project, you can have 200 online serving nodes in us-central1 and another 50 nodes in us-east4.

Quota Value
Online serving requests per minute 300,000
Maximum number of FeatureOnlineStore instances 10
Maximum number of search requests per minute 6,000,000

Vertex AI Feature Store also has the following limits. You can't request an increase to any of the limits in the following table:

Limit Value
Maximum number of FeatureGroup resources in a project and location 25
Maximum number of Feature resources within a FeatureGroup 10,000
Maximum number of FeatureView instances across all FeatureOnlineStore instances 30
Maximum size of feature data per entity 5 MB
Number of entity IDs per online serving request (FetchFeatureValues) 1

Vertex AI Feature Store (Legacy)

The following quotas apply to a given project and region. For example, in a single project, you can have 75 concurrent batch jobs in us-central1 and another 75 jobs in europe-west4.

Quota Value
Online serving requests per minute 300,000
Streaming ingestion requests per minute 60,000
Streaming ingestion write throughput per minute 1.2 GB
Feature creation requests per minute 100
Online serving nodes across all featurestores 30
Concurrent batch jobs (ingestion, serving, and delete feature values combined) 75
Concurrent requests to delete feature values 1
Entity types across all featurestores 75

Vertex AI Feature Store (Legacy) also has the following limits. You can't request an increase to any of the limits in the following table:

Limit Value
Storage limit for an online serving node 5 TB
Total data in the offline store Unlimited
Features per entity type 5,000
Number of create, update, and delete featurestore requests per day per project per region 500
For streaming ingestion, the size per request 1 MB
For streaming read, the number of entities that can be included per request 100
For batch import, the number of files that can be included per request 5,000 for Avro or 500 for CSV
For batch serving and exports, the number of features you can request 5,000
For batch ingestion and streaming ingestion, the oldest timestamp for which feature data can be ingested 4,000 days from current date

The data retention limit in Vertex AI Feature Store (Legacy) has the following default value, which you can override:

Data retention defaults Default value
Data retention in offline store (oldest feature value timestamp after which the values are deleted) 4,000 days from the current date
Data retention in online store (oldest feature value timestamp after which the values are deleted) 4,000 days from the current date

You can override the data retention limit in the following ways:

  • To override the data retention limit for the online store, set the online_storage_ttl_days parameter while creating or updating a featurestore.

  • To override the data retention limit for the offline store, set the offline_storage_ttl_days parameter while creating or updating an entity type.

Vector Search

The following quotas apply to Vector Search for a given project in each region.

Quota Value
Concurrent index creation operations 5
Concurrent index update operations 5
Number of deployed index nodes 50
Number of deployed index N2D nodes 5
Number of Index 100
Streaming Update requests per minute 6,000
Streaming Update throughput(in KB) per minute 120,000

Vertex AI Pipelines

The following quotas and limits apply to Vertex AI Pipelines for a given project in each region.

Quota Value
Running pipeline tasks in parallel* 600
Concurrent pipeline runs* 300

* Pipeline run / task requests beyond this limit are queued until resources are available.

Vertex AI Pipelines have the following limits. Note that, unlike quotas, you can't request an increase to a limit.

Limit Value
Number of pipeline tasks per job 10,000
Input and output artifacts per pipeline task 100
Input and output artifacts per pipeline job 10,000
Maximum size of JSON payload containing output parameters and artifacts per pipeline task 131,072 bytes

Vertex AI Decision Optimization

The following quotas and limits apply to Vertex AI Decision Optimization for a given project in each region.

Quota Value
Solve request per minute 30

Colab Enterprise quotas and limits

Colab Enterprise quotas and limits are listed separately. See Colab Enterprise quotas and limits.

Quota increases

If you want to increase any of your quotas for Vertex AI, you can use the Google Cloud console to request a quota increase.

For more information about submitting a quota increase request, see the following sections of Working with quotas:

What's next