A quota restricts how much of a shared Google Cloud resource your
Google Cloud project can use, including hardware, software, and network
components. Therefore, quotas are a part of a system that does the
following:
Monitors your use or consumption of Google Cloud products and
services.
Restricts your consumption of those resources, for reasons that include
ensuring fairness and reducing spikes in usage. Maintains
configurations that automatically enforce prescribed restrictions.
Provides a means to request or make changes to the quota.
In most cases, when a quota is exceeded, the system immediately blocks
access to the relevant Google resource, and the task that you're trying to
perform fails. In most cases, quotas apply to each Google Cloud project
and are shared across all applications and IP addresses that use that
Google Cloud project.
There are also limits on Vertex AI resources. These limits are
unrelated to the quota system. Limits can't be changed.
Request quotas
The following quotas apply to Vertex AI requests for a given
project and supported region. For example, in a single project, you can have up
to 30,000 online prediction requests per minute in one region and another 30,000
online prediction requests per minute in another supported region.
Request type
Requests per minute
Resource management (CRUD) requests1
600
Job or long-running operation (LRO) submission requests
60
Online prediction requests2
30,000
Online prediction request throughput
1.5 GB
Online explain requests
600
Vertex AI TensorBoard Time Series read requests
60,000
ML Metadata (CRUD) requests
12,000
Vertex AI Vizier (CRUD) requests
6,000
Vertex AI Feature Store online serving requests
300,000
Vertex ML Metadata requests
12,000
Number of count tokens or compute tokens requests
3,000
1 Resource management requests include any request that isn't a
job, an LRO, an online prediction request, a
Vertex AI Vizier request, an ML metadata request, a Vertex AI TensorBoard
Timeseries Insights API read request, a Vertex AI Feature Store
request, a Vertex AI Feature Store streaming request, or a
Vector Search request.
2 This quota applies for public endpoints only. Private endpoints
have unlimited requests per minute.
Jobs or LROs include the following requests:
Create or delete a dataset.
Import or export data to or from a dataset.
Create an endpoint.
Create or delete a custom job.
Create or delete a data-labeling job.
Create or delete a hyper-parameter tuning job.
Create or delete a batch-prediction job.
Create or delete a model.
Upload, delete, or export a model.
Create or delete a notebook-runtime template.
Assign, delete, start, or upgrade a notebook runtime.
Create, delete, or update a model monitor.
Create or delete a model monitoring job.
For quota information for Generative AI models, see
Generative AI on Vertex AI quotas and limits .
AutoML model quotas
The following quotas apply to each data type and objective for a given project
and region. For example, in a particular project and region, you can deploy 10
AutoML image classification models and 10 AutoML image object
detection models for a total of 20 deployed models.
Image
Classification
Quota
Value
Concurrent training jobs
5
Concurrent training jobs with explainable AI
2
Concurrent batch prediction jobs
5
Concurrent model deployment jobs
5
Concurrent model undeployment jobs
5
Number of deployed models
10
Object detection
Quota
Value
Concurrent training jobs
5
Concurrent batch prediction jobs
5
Number of deployed models
10
Tabular
Quota
Value
Concurrent training jobs
5
Concurrent batch prediction jobs
5
Number of deployed models
30
Text
Classification
Quota
Value
Concurrent training jobs
5
Concurrent batch prediction jobs
5
Number of deployed models
10
Quota
Value
Concurrent training jobs
5
Concurrent batch prediction jobs
5
Number of deployed models
10
Sentiment analysis
Quota
Value
Concurrent training jobs
5
Concurrent batch prediction jobs
5
Number of deployed models
10
Video
Action Recognition
Quota
Value
Concurrent training jobs
5
Concurrent batch prediction jobs
5
Classification
Quota
Value
Concurrent training jobs
5
Concurrent batch prediction jobs
5
Object tracking
Quota
Value
Concurrent training jobs
5
Concurrent batch prediction jobs
5
AutoML Video Intelligence specific limits
The following describes current restrictions on the use of
AutoML Video Intelligence.
Type of limits
Value
Maximum video length
3 hours
Maximum video file size
50GB
Minimum labels per dataset
2
Minimum videos per label
10 (1000 is recommended)
Batch input CSV file size
Maximum: 100 MB
Number of video segments in batch input
Maximum: 1,000
Custom-trained model quotas
The following quotas apply to Vertex AI custom-trained models for a
given project and region.
Note: The number of CPUs or GPUs are not counted against Compute Engine
quotas. They are counted separately and can't be used together.
Training
Quota
Value
Concurrent custom training pipelines
2,000
Number of N1 and E2 CPUs for training, per region Region Value us-west1
2,200
us-west2
20
us-west3
2,200
us-west4
20
us-central1
2,200
us-east1
2,200
us-east4
20
us-south1
450
northamerica-northeast1
2,200
northamerica-northeast2
20
southamerica-east1
20
southamerica-west1
20
europe-west2
2,200
europe-west1
2,200
europe-west4
2,200
europe-west6
20
europe-west3
2,200
europe-north1europe-central2
20
europe-west8
20
europe-west9
450
europe-southwest1asia-south1
20
asia-southeast1
2,200
asia-southeast2
2,200
asia-east2
2,200
asia-east1
2,200
asia-northeast1
2,200
asia-northeast2
20
australia-southeast1
2,200
australia-southeast2
20
asia-northeast3
2,200
me-west1
450
Number of N2 CPUs for training, per region Region Value us-west1
20
us-west2
20
us-west3
20
us-west4
20
us-central1
450
us-east1
20
us-east4
20
us-south1
20
northamerica-northeast1
20
northamerica-northeast2
20
southamerica-east1
20
southamerica-west1
20
europe-west2
20
europe-west1
20
europe-west4
450
europe-west6
20
europe-west3
20
europe-north1europe-central2
20
europe-west8
20
europe-west9
450
europe-southwest1asia-south1
20
asia-southeast1
20
asia-southeast2
20
asia-east2
20
asia-east1
450
asia-northeast1
20
asia-northeast2
20
australia-southeast1
20
australia-southeast2
20
asia-northeast3
20
me-west1
20
Number of M1 CPUs for training, per region Region Value us-west1
0
us-west2
0
us-west3
0
us-west4
0
us-central1
0
us-east1
0
us-east4
0
us-south1
0
northamerica-northeast1
0
northamerica-northeast2
0
southamerica-east1
0
southamerica-west1
0
europe-west2
0
europe-west1
0
europe-west4
0
europe-west6
0
europe-west3
0
europe-north1europe-central2
0
europe-west8
0
europe-west9
0
europe-southwest1asia-south1
0
asia-southeast1
0
asia-southeast2
0
asia-east2
0
asia-east1
0
asia-northeast1
0
asia-northeast2
0
australia-southeast1
0
australia-southeast2
0
asia-northeast3
0
me-west1
0
Number of C2 CPUs for training, per region Region Value us-west1
20
us-west2
20
us-west3
20
us-west4
20
us-central1
450
us-east1
20
us-east4
20
us-south1
20
northamerica-northeast1
20
northamerica-northeast2
20
southamerica-east1
20
southamerica-west1
20
europe-west2
20
europe-west1
20
europe-west4
450
europe-west6
20
europe-west3
20
europe-north1europe-central2
20
europe-west8
20
europe-west9
450
europe-southwest1asia-south1
20
asia-southeast1
20
asia-southeast2
20
asia-east2
20
asia-east1
450
asia-northeast1
20
asia-northeast2
20
australia-southeast1
20
australia-southeast2
20
asia-northeast3
20
me-west1
20
Number of A2 CPUs for training, per region Region Value us-west1
Not available
us-west2
Not available
us-west3
Not available
us-west4
Not available
us-central1
96
us-east1
Not available
us-east4
Not available
us-south1
Not available
northamerica-northeast1
Not available
northamerica-northeast2
Not available
southamerica-east1
Not available
southamerica-west1
Not available
europe-west2
Not available
europe-west1
Not available
europe-west4
96
europe-west6
Not available
europe-west3
Not available
europe-north1europe-central2
Not available
europe-west8
Not available
europe-west9
Not available
europe-southwest1asia-south1
Not available
asia-southeast1
96
asia-southeast2
Not available
asia-east2
Not available
asia-east1
Not available
asia-northeast1
Not available
asia-northeast2
Not available
australia-southeast1
Not available
australia-southeast2
Not available
asia-northeast3
Not available
me-west1
Not available
Number of concurrent A3 CPUs for training, per region Region Value us-west1
Not available
us-west2
Not available
us-west3
Not available
us-west4
Not available
us-central1
0
us-east1
Not available
us-east4
Not available
us-south1
Not available
northamerica-northeast1
Not available
northamerica-northeast2
Not available
southamerica-east1
Not available
southamerica-west1
Not available
europe-west2
Not available
europe-west1
Not available
europe-west4
0
europe-west6
Not available
europe-west3
Not available
europe-north1europe-central2
Not available
europe-west8
Not available
europe-west9
Not available
europe-southwest1asia-south1
Not available
asia-southeast1
0
asia-southeast2
Not available
asia-east2
Not available
asia-east1
Not available
asia-northeast1
Not available
asia-northeast2
Not available
australia-southeast1
Not available
australia-southeast2
Not available
asia-northeast3
Not available
me-west1
Not available
Number of concurrent K80 GPUs for training, per region Region Value us-west1
30
us-west2
Not available
us-west3
Not available
us-west4
Not available
us-central1
56
us-east1
30
us-east4
Not available
us-south1
Not available
northamerica-northeast1
Not available
northamerica-northeast2
Not available
southamerica-east1
Not available
southamerica-west1
Not available
europe-west2
Not available
europe-west1
30
europe-west4
Not available
europe-west6
Not available
europe-west3
Not available
europe-north1europe-central2
Not available
europe-west8
Not available
europe-west9
Not available
europe-southwest1asia-south1
Not available
asia-southeast1
Not available
asia-southeast2
Not available
asia-east2
Not available
asia-east1
56
asia-northeast1
Not available
asia-northeast2
Not available
australia-southeast1
Not available
australia-southeast2
Not available
asia-northeast3
Not available
me-west1
Not available
Number of P4 GPUs for training, per region Region Value us-west1
Not available
us-west2
6
us-west3
Not available
us-west4
Not available
us-central1
6
us-east1
Not available
us-east4
1
us-south1
Not available
northamerica-northeast1
6
northamerica-northeast2
Not available
southamerica-east1
Not available
southamerica-west1
Not available
europe-west2
Not available
europe-west1
Not available
europe-west4
6
europe-west6
Not available
europe-west3
Not available
europe-north1europe-central2
Not available
europe-west8
Not available
europe-west9
Not available
europe-southwest1asia-south1
Not available
asia-southeast1
6
asia-southeast2
Not available
asia-east2
Not available
asia-east1
Not available
asia-northeast1
Not available
asia-northeast2
Not available
australia-southeast1
6
australia-southeast2
Not available
asia-northeast3
Not available
me-west1
Not available
Number of T4 GPUs for training, per region Region Value us-west1
2
us-west2
Not available
us-west3
Not available
us-west4
Not available
us-central1
12
us-east1
2
us-east4
Not available
us-south1
Not available
northamerica-northeast1
Not available
northamerica-northeast2
Not available
southamerica-east1
Not available
southamerica-west1
Not available
europe-west2
6
europe-west1
Not available
europe-west4
2
europe-west6
Not available
europe-west3
0
europe-north1europe-central2
Not available
europe-west8
Not available
europe-west9
Not available
europe-southwest1asia-south1
Not available
asia-southeast1
1
asia-southeast2
Not available
asia-east2
Not available
asia-east1
Not available
asia-northeast1
6
asia-northeast2
Not available
australia-southeast1
Not available
australia-southeast2
Not available
asia-northeast3
1
me-west1
Not available
Number of L4 GPUs for training, per region Region Value us-west1
0
us-west2
Not available
us-west3
Not available
us-west4
Not available
us-central1
0
us-east1
0
us-east4
0
us-south1
Not available
northamerica-northeast1
Not available
northamerica-northeast2
Not available
southamerica-east1
Not available
southamerica-west1
Not available
europe-west2
Not available
europe-west1
Not available
europe-west4
0
europe-west6
Not available
europe-west3
Not available
europe-north1europe-central2
Not available
europe-west8
Not available
europe-west9
Not available
europe-southwest1asia-south1
Not available
asia-southeast1
0
asia-southeast2
Not available
asia-east2
Not available
asia-east1
Not available
asia-northeast1
Not available
asia-northeast2
Not available
australia-southeast1
Not available
australia-southeast2
Not available
asia-northeast3
Not available
me-west1
Not available
Number of P100 GPUs for training, per region Region Value us-west1
30
us-west2
Not available
us-west3
Not available
us-west4
Not available
us-central1
56
us-east1
30
us-east4
Not available
us-south1
Not available
northamerica-northeast1
Not available
northamerica-northeast2
Not available
southamerica-east1
Not available
southamerica-west1
Not available
europe-west2
Not available
europe-west1
30
europe-west4
Not available
europe-west6
Not available
europe-west3
Not available
europe-north1europe-central2
Not available
europe-west8
Not available
europe-west9
Not available
europe-southwest1asia-south1
Not available
asia-southeast1
Not available
asia-southeast2
Not available
asia-east2
Not available
asia-east1
30
asia-northeast1
Not available
asia-northeast2
Not available
australia-southeast1
6
australia-southeast2
Not available
asia-northeast3
Not available
me-west1
Not available
Number of V100 GPUs for training, per region Region Value us-west1
6
us-west2
Not available
us-west3
Not available
us-west4
Not available
us-central1
6
us-east1
Not available
us-east4
Not available
us-south1
Not available
northamerica-northeast1
Not available
northamerica-northeast2
Not available
southamerica-east1
Not available
southamerica-west1
Not available
europe-west2
Not available
europe-west1
Not available
europe-west4
6
europe-west6
Not available
europe-west3
Not available
europe-north1europe-central2
Not available
europe-west8
Not available
europe-west9
Not available
europe-southwest1asia-south1
Not available
asia-southeast1
Not available
asia-southeast2
Not available
asia-east2
Not available
asia-east1
6
asia-northeast1
Not available
asia-northeast2
Not available
australia-southeast1
Not available
australia-southeast2
Not available
asia-northeast3
Not available
me-west1
Not available
Number of A100 GPUs for training, per region Region Value us-west1
Not available
us-west2
Not available
us-west3
Not available
us-west4
Not available
us-central1
8
us-east1
Not available
us-east4
Not available
us-south1
Not available
northamerica-northeast1
Not available
northamerica-northeast2
Not available
southamerica-east1
Not available
southamerica-west1
Not available
europe-west2
Not available
europe-west1
Not available
europe-west4
8
europe-west6
Not available
europe-west3
Not available
europe-north1europe-central2
Not available
europe-west8
Not available
europe-west9
Not available
europe-southwest1asia-south1
Not available
asia-southeast1
8
asia-southeast2
Not available
asia-east2
Not available
asia-east1
Not available
asia-northeast1
Not available
asia-northeast2
Not available
australia-southeast1
Not available
australia-southeast2
Not available
asia-northeast3
Not available
me-west1
Not available
Number of concurrent A100 80GB GPUs for training, per region Region Value us-west1
Not available
us-west2
Not available
us-west3
Not available
us-west4
Not available
us-central1
0
us-east1
Not available
us-east4
0
us-south1
Not available
northamerica-northeast1
Not available
northamerica-northeast2
Not available
southamerica-east1
Not available
southamerica-west1
Not available
europe-west2
Not available
europe-west1
Not available
europe-west4
0
europe-west6
Not available
europe-west3
Not available
europe-north1europe-central2
Not available
europe-west8
Not available
europe-west9
Not available
europe-southwest1asia-south1
Not available
asia-southeast1
0
asia-southeast2
Not available
asia-east2
Not available
asia-east1
Not available
asia-northeast1
Not available
asia-northeast2
Not available
australia-southeast1
Not available
australia-southeast2
Not available
asia-northeast3
Not available
me-west1
Not available
If interested, please see the quotas documentation .
Number of concurrent H100 GPUs for training, per region Region Value us-west1
Not available
us-west2
Not available
us-west3
Not available
us-west4
Not available
us-central1
0
us-east1
Not available
us-east4
0
us-south1
Not available
northamerica-northeast1
Not available
northamerica-northeast2
Not available
southamerica-east1
Not available
southamerica-west1
Not available
europe-west2
Not available
europe-west1
Not available
europe-west4
0
europe-west6
Not available
europe-west3
Not available
europe-north1europe-central2
Not available
europe-west8
Not available
europe-west9
Not available
europe-southwest1asia-south1
Not available
asia-southeast1
Not available
asia-southeast2
Not available
asia-east2
Not available
asia-east1
Not available
asia-northeast1
Not available
asia-northeast2
Not available
australia-southeast1
Not available
australia-southeast2
Not available
asia-northeast3
Not available
me-west1
Not available
If interested, please see the quotas documentation .
Number of TPU V2 cores for training, per region Region Value us-west1
Not available
us-west2
Not available
us-west3
Not available
us-west4
Not available
us-central1
8
us-east1
Not available
us-east4
Not available
us-south1
Not available
northamerica-northeast1
Not available
northamerica-northeast2
Not available
southamerica-east1
Not available
southamerica-west1
Not available
europe-west2
Not available
europe-west1
Not available
europe-west4
8
europe-west6
Not available
europe-west3
Not available
europe-north1europe-central2
Not available
europe-west8
Not available
europe-west9
Not available
europe-southwest1asia-south1
Not available
asia-southeast1
Not available
asia-southeast2
Not available
asia-east2
Not available
asia-east1
8
asia-northeast1
Not available
asia-northeast2
Not available
australia-southeast1
Not available
australia-southeast2
Not available
asia-northeast3
Not available
me-west1
Not available
Number of TPU V2 pod cores for training, per region Region Value us-west1
Not available
us-west2
Not available
us-west3
Not available
us-west4
Not available
us-central1
Not available
us-east1
Not available
us-east4
Not available
us-south1
Not available
northamerica-northeast1
Not available
northamerica-northeast2
Not available
southamerica-east1
Not available
southamerica-west1
Not available
europe-west2
Not available
europe-west1
Not available
europe-west4
Not available
europe-west6
Not available
europe-west3
Not available
europe-north1europe-central2
Not available
europe-west8
Not available
europe-west9
Not available
europe-southwest1asia-south1
Not available
asia-southeast1
Not available
asia-southeast2
Not available
asia-east2
Not available
asia-east1
Not available
asia-northeast1
Not available
asia-northeast2
Not available
australia-southeast1
Not available
australia-southeast2
Not available
asia-northeast3
Not available
me-west1
Not available
Number of TPU V3 cores for training, per region Region Value us-west1
Not available
us-west2
Not available
us-west3
Not available
us-west4
Not available
us-central1
8
us-east1
Not available
us-east4
Not available
us-south1
Not available
northamerica-northeast1
Not available
northamerica-northeast2
Not available
southamerica-east1
Not available
southamerica-west1
Not available
europe-west2
Not available
europe-west1
Not available
europe-west4
8
europe-west6
Not available
europe-west3
Not available
europe-north1europe-central2
Not available
europe-west8
Not available
europe-west9
Not available
europe-southwest1asia-south1
Not available
asia-southeast1
Not available
asia-southeast2
Not available
asia-east2
Not available
asia-east1
8
asia-northeast1
Not available
asia-northeast2
Not available
australia-southeast1
Not available
australia-southeast2
Not available
asia-northeast3
Not available
me-west1
Not available
Number of TPU V3 pod cores for training, per region Region Value us-west1
Not available
us-west2
Not available
us-west3
Not available
us-west4
Not available
us-central1
Not available
us-east1
Not available
us-east4
Not available
us-south1
Not available
northamerica-northeast1
Not available
northamerica-northeast2
Not available
southamerica-east1
Not available
southamerica-west1
Not available
europe-west2
Not available
europe-west1
Not available
europe-west4
Not available
europe-west6
Not available
europe-west3
Not available
europe-north1europe-central2
Not available
europe-west8
Not available
europe-west9
Not available
europe-southwest1asia-south1
Not available
asia-southeast1
Not available
asia-southeast2
Not available
asia-east2
Not available
asia-east1
Not available
asia-northeast1
Not available
asia-northeast2
Not available
australia-southeast1
Not available
australia-southeast2
Not available
asia-northeast3
Not available
me-west1
Not available
HDD usage (GB) during training, per region Region Value us-west1
180,000
us-west2
3,600
us-west3
180,000
us-west4
3,600
us-central1
180,000
us-east1
180,000
us-east4
3,600
us-south1
180,000
northamerica-northeast1
180,000
northamerica-northeast2
3,600
southamerica-east1
3,600
southamerica-west1
3,600
europe-west2
180,000
europe-west1
180,000
europe-west4
180,000
europe-west6
3,600
europe-west3
180,000
europe-north1europe-central2
3,600
europe-west8
3,600
europe-west9
180,000
europe-southwest1asia-south1
3,600
asia-southeast1
180,000
asia-southeast2
180,000
asia-east2
180,000
asia-east1
180,000
asia-northeast1
180,000
asia-northeast2
3,600
australia-southeast1
180,000
australia-southeast2
3,600
asia-northeast3
180,000
me-west1
180,000
SSD usage (GB) during training, per region Region Value us-west1
75,000
us-west2
450
us-west3
75,000
us-west4
450
us-central1
75,000
us-east1
75,000
us-east4
450
us-south1
75,000
northamerica-northeast1
75,000
northamerica-northeast2
450
southamerica-east1
450
southamerica-west1
450
europe-west2
75,000
europe-west1
75,000
europe-west4
75,000
europe-west6
450
europe-west3
75,000
europe-north1europe-central2
450
europe-west8
450
europe-west9
75,000
europe-southwest1asia-south1
450
asia-southeast1
75,000
asia-southeast2
75,000
asia-east2
75,000
asia-east1
75,000
asia-northeast1
75,000
asia-northeast2
450
australia-southeast1
75,000
australia-southeast2
450
asia-northeast3
75,000
me-west1
75,000
Serving
Quota
Value
Number of deployed custom models
100
Number of CPUs for serving, per region Region Value us-west1
2,200
us-west2
2,200
us-west3
2,200
us-west4
16
us-central1
2,200
us-east1
2,200
us-east4
2,200
us-south1
450
northamerica-northeast1
2,200
northamerica-northeast2
450
southamerica-east1
2,200
southamerica-west1
450
europe-west2
2,200
europe-west1
2,200
europe-west4
2,200
europe-west6
2,200
europe-west3
2,200
europe-north1europe-central2
16
europe-west8
16
europe-west9
16
europe-southwest1asia-south1
16
asia-southeast1
2,200
asia-southeast2
2,200
asia-east2
2,200
asia-east1
2,200
asia-northeast1
2,200
asia-northeast2
16
australia-southeast1
2,200
australia-southeast2
16
asia-northeast3
2,200
me-west1
450
Number of K80 GPUs for serving, per region Region Value us-west1
Not available
us-west2
Not available
us-west3
Not available
us-west4
Not available
us-central1
56
us-east1
30
us-east4
Not available
us-south1
Not available
northamerica-northeast1
Not available
northamerica-northeast2
Not available
southamerica-east1
Not available
southamerica-west1
Not available
europe-west2
Not available
europe-west1
30
europe-west4
Not available
europe-west6
Not available
europe-west3
Not available
europe-north1europe-central2
Not available
europe-west8
Not available
europe-west9
Not available
europe-southwest1asia-south1
Not available
asia-southeast1
Not available
asia-southeast2
Not available
asia-east2
Not available
asia-east1
56
asia-northeast1
Not available
asia-northeast2
Not available
australia-southeast1
Not available
australia-southeast2
Not available
asia-northeast3
Not available
me-west1
Not available
Number of P100 GPUs for serving, per region Region Value us-west1
30
us-west2
Not available
us-west3
Not available
us-west4
Not available
us-central1
56
us-east1
30
us-east4
Not available
us-south1
Not available
northamerica-northeast1
Not available
northamerica-northeast2
Not available
southamerica-east1
Not available
southamerica-west1
Not available
europe-west2
Not available
europe-west1
30
europe-west4
Not available
europe-west6
Not available
europe-west3
Not available
europe-north1europe-central2
Not available
europe-west8
Not available
europe-west9
Not available
europe-southwest1asia-south1
Not available
asia-southeast1
Not available
asia-southeast2
Not available
asia-east2
Not available
asia-east1
30
asia-northeast1
Not available
asia-northeast2
Not available
australia-southeast1
Not available
australia-southeast2
Not available
asia-northeast3
Not available
me-west1
Not available
Number of P4 GPUs for serving, per region Region Value us-west1
Not available
us-west2
6
us-west3
Not available
us-west4
Not available
us-central1
6
us-east1
Not available
us-east4
6
us-south1
Not available
northamerica-northeast1
6
northamerica-northeast2
Not available
southamerica-east1
Not available
southamerica-west1
Not available
europe-west2
Not available
europe-west1
Not available
europe-west4
6
europe-west6
Not available
europe-west3
Not available
europe-north1europe-central2
Not available
europe-west8
Not available
europe-west9
Not available
europe-southwest1asia-south1
Not available
asia-southeast1
6
asia-southeast2
Not available
asia-east2
Not available
asia-east1
Not available
asia-northeast1
Not available
asia-northeast2
Not available
australia-southeast1
6
australia-southeast2
Not available
asia-northeast3
Not available
me-west1
Not available
Number of T4 GPUs for serving, per region Region Value us-west1
12
us-west2
Not available
us-west3
Not available
us-west4
Not available
us-central1
12
us-east1
12
us-east4
Not available
us-south1
Not available
northamerica-northeast1
Not available
northamerica-northeast2
Not available
southamerica-east1
Not available
southamerica-west1
Not available
europe-west2
12
europe-west1
Not available
europe-west4
12
europe-west6
Not available
europe-west3
0
europe-north1europe-central2
Not available
europe-west8
Not available
europe-west9
Not available
europe-southwest1asia-south1
Not available
asia-southeast1
6
asia-southeast2
Not available
asia-east2
12
asia-east1
6
asia-northeast1
6
asia-northeast2
Not available
australia-southeast1
Not available
australia-southeast2
Not available
asia-northeast3
6
me-west1
Not available
Number of L4 GPUs for serving, per region Region Value us-west1
28
us-west2
Not available
us-west3
Not available
us-west4
Not available
us-central1
28
us-east1
28
us-east4
28
us-south1
Not available
northamerica-northeast1
Not available
northamerica-northeast2
Not available
southamerica-east1
Not available
southamerica-west1
Not available
europe-west2
28
europe-west1
28
europe-west4
28
europe-west6
Not available
europe-west3
Not available
europe-north1europe-central2
Not available
europe-west8
Not available
europe-west9
Not available
europe-southwest1asia-south1
Not available
asia-southeast1
28
asia-southeast2
Not available
asia-east2
Not available
asia-east1
28
asia-northeast1
28
asia-northeast2
Not available
australia-southeast1
Not available
australia-southeast2
Not available
asia-northeast3
Not available
me-west1
Not available
Number of V100 GPUs for serving, per region Region Value us-west1
6
us-west2
Not available
us-west3
Not available
us-west4
Not available
us-central1
6
us-east1
Not available
us-east4
Not available
us-south1
Not available
northamerica-northeast1
Not available
northamerica-northeast2
Not available
southamerica-east1
Not available
southamerica-west1
Not available
europe-west2
Not available
europe-west1
Not available
europe-west4
6
europe-west6
Not available
europe-west3
Not available
europe-north1europe-central2
Not available
europe-west8
Not available
europe-west9
Not available
europe-southwest1asia-south1
Not available
asia-southeast1
Not available
asia-southeast2
Not available
asia-east2
Not available
asia-east1
Not available
asia-northeast1
Not available
asia-northeast2
Not available
australia-southeast1
Not available
australia-southeast2
Not available
asia-northeast3
Not available
me-west1
Not available
Number of A100 GPUs for serving, per region Region Value us-west1
Not available
us-west2
Not available
us-west3
Not available
us-west4
Not available
us-central1
14
us-east1
Not available
us-east4
Not available
us-south1
Not available
northamerica-northeast1
Not available
northamerica-northeast2
Not available
southamerica-east1
Not available
southamerica-west1
Not available
europe-west2
Not available
europe-west1
Not available
europe-west4
14
europe-west6
Not available
europe-west3
Not available
europe-north1europe-central2
Not available
europe-west8
Not available
europe-west9
Not available
europe-southwest1asia-south1
Not available
asia-southeast1
14
asia-southeast2
Not available
asia-east2
Not available
asia-east1
Not available
asia-northeast1
14
asia-northeast2
Not available
australia-southeast1
Not available
australia-southeast2
Not available
asia-northeast3
14
me-west1
14
Number of A100 80GB GPUs for serving, per region Region Value us-west1
Not available
us-west2
Not available
us-west3
Not available
us-west4
Not available
us-central1
0
us-east1
Not available
us-east4
0
us-south1
Not available
northamerica-northeast1
Not available
northamerica-northeast2
Not available
southamerica-east1
Not available
southamerica-west1
Not available
europe-west2
Not available
europe-west1
Not available
europe-west4
0
europe-west6
Not available
europe-west3
Not available
europe-north1europe-central2
Not available
europe-west8
Not available
europe-west9
Not available
europe-southwest1asia-south1
Not available
asia-southeast1
0
asia-southeast2
Not available
asia-east2
Not available
asia-east1
Not available
asia-northeast1
Not available
asia-northeast2
Not available
australia-southeast1
Not available
australia-southeast2
Not available
asia-northeast3
Not available
me-west1
Not available
Number of v5e TPU Chips for serving, per region Region Value us-west1
4
us-west2
Not available
us-west3
Not available
us-west4
Not available
us-central1
Not available
us-east1
Not available
us-east4
Not available
us-south1
Not available
northamerica-northeast1
Not available
northamerica-northeast2
Not available
southamerica-east1
Not available
southamerica-west1
Not available
europe-west2
Not available
europe-west1
Not available
europe-west4
Not available
europe-west6
Not available
europe-west3
Not available
europe-north1europe-central2
Not available
europe-west8
Not available
europe-west9
Not available
europe-southwest1asia-south1
Not available
asia-southeast1
Not available
asia-southeast2
Not available
asia-east2
Not available
asia-east1
Not available
asia-northeast1
Not available
asia-northeast2
Not available
australia-southeast1
Not available
australia-southeast2
Not available
asia-northeast3
Not available
me-west1
Not available
Vertex AI Feature Store
This section lists the quotas and limits for the following:
Vertex AI Feature Store
The following quotas apply to a given project and region. For example, in a
single project, you can have 200 online serving nodes in us-central1
and
another 50 nodes in us-east4
.
Quota
Value
Online serving requests per minute
300,000
Maximum number of FeatureOnlineStore
instances
10
Maximum number of search requests per minute
6,000,000
Maximum number of online serving nodes across all Optimized FeatureOnlineStore
instances in the project
80
Maximum number of FeatureView
instances across all FeatureOnlineStore
instances
30
Vertex AI Feature Store also has the following limits. You can't request an increase to any of the limits in the following table:
Limit
Value
Maximum number of FeatureGroup
resources in a project and location
250
Maximum number of Feature
resources within a FeatureGroup
10,000
Maximum size of feature data per entity
5 MB
Number of entity IDs per online serving request (FetchFeatureValues
)
1
Vertex AI Feature Store (Legacy)
The following quotas apply to a given project and region. For example, in a
single project, you can have 75 concurrent batch jobs in us-central1
and
another 75 jobs in europe-west4
.
Quota
Value
Online serving requests per minute
300,000
Streaming ingestion requests per minute
60,000
Streaming ingestion write throughput per minute
1.2 GB
Feature creation requests per minute
100
Online serving nodes across all featurestores
30
Concurrent batch jobs (ingestion, serving, and delete feature values combined)
75
Concurrent requests to delete feature values
1
Entity types across all featurestores
75
Vertex AI Feature Store (Legacy) also has the following limits. You can't request an increase to any of the limits in the following table:
Limit
Value
Storage limit for an online serving node
5 TB
Total data in the offline store
Unlimited
Features per entity type
5,000
Number of create, update, and delete featurestore requests per day per
project per region
500
For streaming ingestion, the size per request
1 MB
For streaming read, the number of entities that can be included per
request
100
For batch import, the number of files that can be included per
request
5,000 for Avro or 500 for CSV
For batch serving and exports, the number of features you can
request
5,000
For batch ingestion and streaming ingestion, the oldest timestamp for which feature data can be ingested
4,000 days from current date
The data retention limit in Vertex AI Feature Store (Legacy) has the following default value, which you can override:
Data retention defaults
Default value
Data retention in offline store (oldest feature value timestamp after which the values
are deleted)
4,000 days from the current date
Data retention in online store (oldest feature value timestamp after which the values
are deleted)
4,000 days from the current date
You can override the data retention limit in the following ways:
To override the data retention limit for the online store, set the online_storage_ttl_days
parameter while creating or updating a featurestore.
To override the data retention limit for the offline store, set the offline_storage_ttl_days
parameter while creating or updating an entity type.
Vector Search
The following quotas apply to Vector Search for a given
project in each region.
Quota
Value
Concurrent index creation operations
5
Concurrent index update operations
5
Number of deployed index nodes
50
Number of deployed index N2D nodes
5
Number of Index
100
Streaming Update requests per minute
6,000
Streaming Update throughput(in KB) per minute
120,000
Vertex AI Pipelines
The following quotas and limits apply to Vertex AI Pipelines for a
given project in each region.
Quota
Value
Running pipeline tasks in parallel*
600
Concurrent pipeline runs*
300
* Pipeline run / task requests beyond this limit are queued until resources are available.
Vertex AI Pipelines have the following limits. Note that,
unlike quotas, you can't request an increase to a limit.
Limit
Value
Number of pipeline tasks per job
10,000
Input and output artifacts per pipeline task
100
Input and output artifacts per pipeline job
10,000
Maximum size of JSON payload containing output parameters and artifacts per pipeline task
131,072 bytes
Vertex AI Decision Optimization
The following quotas and limits apply to Vertex AI Decision Optimization for a
given project in each region.
Quota
Value
Solve request per minute
30
Colab Enterprise quotas and limits
Colab Enterprise quotas and limits are listed separately. See
Colab Enterprise quotas and limits .
Quota increases
If you want to increase any of your quotas for Vertex AI, you can
use the Google Cloud console to request a quota increase.
For more information about submitting a quota increase request, see the
following sections of Working with quotas :
What's next
Send feedback
Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License , and code samples are licensed under the Apache 2.0 License . For details, see the Google Developers Site Policies . Java is a registered trademark of Oracle and/or its affiliates.
Last updated 2024-04-24 UTC.
[{
"type": "thumb-down",
"id": "hardToUnderstand",
"label":"Hard to understand"
},{
"type": "thumb-down",
"id": "incorrectInformationOrSampleCode",
"label":"Incorrect information or sample code"
},{
"type": "thumb-down",
"id": "missingTheInformationSamplesINeed",
"label":"Missing the information/samples I need"
},{
"type": "thumb-down",
"id": "otherDown",
"label":"Other"
}]
[{
"type": "thumb-up",
"id": "easyToUnderstand",
"label":"Easy to understand"
},{
"type": "thumb-up",
"id": "solvedMyProblem",
"label":"Solved my problem"
},{
"type": "thumb-up",
"id": "otherUp",
"label":"Other"
}]
Need to tell us more?