Stay organized with collections
Save and categorize content based on your preferences.
Vertex AI pricing
Prices are listed in US Dollars (USD).
If you pay in a currency other than USD, the prices listed in your currency on
Cloud Platform SKUs
apply.
Vertex AI pricing compared to legacy product pricing
The costs for Vertex AI remain the same as they are for the legacy
AI Platform and AutoML products that Vertex AI
supersedes, with the following exceptions:
Legacy AI Platform Prediction and AutoML Tables
predictions supported lower-cost, lower-performance machine types that aren't
supported for Vertex AI Prediction and AutoML tabular.
Legacy AI Platform Prediction supported scale-to-zero, which
isn't supported for Vertex AI Prediction.
Vertex AI also offers more ways to optimize costs, such as the following:
For Vertex AI AutoML models, you pay for three main activities:
Training the model
Deploying the model to an endpoint
Using the model to make predictions
Vertex AI uses predefined machine configurations for Vertex AutoML models,
and the hourly rate for these activities reflects the resource usage.
The time required to train your model depends on the size and complexity
of your training data. Models must be deployed before they can provide online
predictions or online explanations.
You pay for each model deployed to an endpoint, even if no prediction is made.
You must undeploy your model to stop incurring further charges.
Models that are not deployed or have failed to deploy are not charged.
You pay only for compute hours used; if training fails for any reason
other than a user-initiated cancellation, you are not billed for the
time. You are charged for training time if you cancel the operation.
Select a model type below for pricing information.
Compute associated with Vertex Explainable AI is charged at same rate as prediction.
However, explanations take longer to process than normal predictions, so heavy
usage of Vertex Explainable AI along with auto-scaling could result in more nodes being
started, which would increase prediction charges.
Vertex AI Forecast
AutoML
Stage
Pricing
Prediction
$0.2 per 1K data points* (0-1M points) $0.1 per 1K data points* (1M-50M points) $0.02 per 1K data points* (>50M points)
* A prediction data point is one time point in the forecast horizon. For example, with daily granularity a 7-day horizon is 7 points per each time series.
Up to 5 prediction quantiles can be included at no additional cost.
The number of data points consumed per tier is refreshed monthly.
$250.00 per TB x Number of Candidate Models x Number of Backtesting Windows*
Explainable AI
Explainability with time series decomposition does not add any additional cost. Explainability using Shapley values is not supported.
Refer to the BigQuery ML pricing page for additional details. Each training and prediction job incurs the cost of 1 managed pipeline run, as described in Vertex AI pricing.
* A backtesting window is created for each period in the test set. The AUTO_ARIMA_MAX_ORDER used determines the number of candidate models. It ranges from 6-42 for models with multiple time series.
Custom-trained models
Training
The tables below provide the approximate price per hour of various training
configurations. You can choose a custom configuration of selected
machine types. To calculate pricing,
sum the costs of the virtual machines you use.
If you use Compute Engine machine types and attach
accelerators, the cost of the accelerators is separate. To calculate this cost,
multiply the prices in the table of accelerators below by how many
machine hours of each type of accelerator you use.
Machine types
You can use Spot VMs with Vertex AI custom training.
Spot VMs are billed according to
Compute Engine Spot VMs pricing. There are Vertex AI custom
training management fees in addition to your infrastructure usage, captured in the
following tables.
You can use Compute Engine reservations with Vertex AI custom training. When using
Compute Engine reservations, you're billed according to
Compute Engine Pricing, including any applicable committed
use discounts (CUDs). There are Vertex AI custom training management fees in
addition to your infrastructure usage, captured in the following tables.
*This amount includes GPU price, since this instance type always requires a fixed number of GPU accelerators.
If you pay in a currency other than USD, the prices listed in your currency on
Cloud Platform SKUs
apply.
Accelerators
If you pay in a currency other than USD, the prices listed in your currency on
Cloud Platform SKUs
apply.
* The price for training using a Cloud TPU Pod is based on the
number of cores in the Pod. The number of cores in a pod is always a multiple
of 32. To determine the price of training on a Pod that has more than 32 cores,
take the price for a 32-core Pod, and multiply it by the number of cores,
divided by 32. For example, for a 128-core Pod, the price is
(32-core Pod price) * (128/32). For information about which Cloud TPU
Pods are available for a specific region, see System Architecture
in the Cloud TPU documentation.
Disks
If you pay in a currency other than USD, the prices listed in your currency on
Cloud Platform SKUs
apply.
You are required to store your data and program files in
Google Cloud Storage buckets during the Vertex AI lifecycle.
See more about Cloud Storage usage.
You are charged for training your models from the moment when resources are
provisioned for a job until the job finishes.
Scale tiers for predefined configurations (AI Platform Training)
You can control the type of processing cluster to use when training your model.
The simplest way is to choose from one of the predefined configurations called
scale tiers. Read more about
scale tiers.
Machine types for custom configurations
If you use Vertex AI or select CUSTOM as your scale tier for
AI Platform Training, you have control over the number and
type of virtual machines to use for the cluster's master, worker and parameter
servers. Read more about
machine types for Vertex AI and
machine types for AI Platform Training.
The cost of training with a custom processing cluster is the sum of all the
machines you specify. You are charged for the total time of the job, not for the
active processing time of individual machines.
Gen AI Evaluation Service
Vertex AI Gen AI Evaluation Service charges string input and output fields by every 1,000 characters. One character is defined as one Unicode character. White space is excluded from the count. Failed evaluation request, including filtered response, will not be charged for input nor output. At the end of each billing cycle, fractions of one cent ($0.01) are rounded to one cent.
Gen AI Evaluation Service is generally available (GA). Pricing took effect
on September 27, 2024.
Metric
Pricing
Pointwise
Input: $0.005 per 1k characters Output: $0.015 per 1k characters
Pairwise
Input: $0.005 per 1k characters Output: $0.015 per 1k characters
Computation-based metrics are charged at $0.00003 per 1k characters for input and $0.00009 per 1k characters for output. They are referred to as Automatic Metric in SKU.
Metric Name
Type
Exact Match
Computation-based
Bleu
Computation-based
Rouge
Computation-based
Tool Call Valid
Computation-based
Tool Name Match
Computation-based
Tool Parameter Key Match
Computation-based
Tool Parameter KV Match
Computation-based
Prices are listed in US Dollars (USD).
If you pay in a currency other than USD, the prices listed in your currency on
Cloud Platform SKUs
apply.
Legacy model-based metrics are charged at $0.005 per 1k characters for input and $0.015 per 1k characters for output.
Metric Name
Type
Coherence
Pointwise
Fluency
Pointwise
Fulfillment
Pointwise
Safety
Pointwise
Groundedness
Pointwise
Summarization Quality
Pointwise
Summarization Helpfulness
Pointwise
Summarization Verbosity
Pointwise
Question Answering Quality
Pointwise
Question Answering Relevance
Pointwise
Question Answering Helpfulness
Pointwise
Question Answering Correctness
Pointwise
Pairwise Summarization Quality
Pairwise
Pairwise Question Answering Quality
Pairwise
Prices are listed in US Dollars (USD).
If you pay in a currency other than USD, the prices listed in your currency on
Cloud Platform SKUs
apply.
Ray on Vertex AI
Training
The tables below provide the approximate price per hour of various training
configurations. You can choose a custom configuration of selected
machine types. To calculate pricing,
sum the costs of the virtual machines you use.
If you use Compute Engine machine types and attach
accelerators, the cost of the accelerators is separate. To calculate this cost,
multiply the prices in the table of accelerators below by how many
machine hours of each type of accelerator you use.
Machine types
If you pay in a currency other than USD, the prices listed in your currency on
Cloud Platform SKUs
apply.
Accelerators
If you pay in a currency other than USD, the prices listed in your currency on
Cloud Platform SKUs
apply.
* The price for training using a Cloud TPU Pod is based on the
number of cores in the Pod. The number of cores in a pod is always a multiple
of 32. To determine the price of training on a Pod that has more than 32 cores,
take the price for a 32-core Pod, and multiply it by the number of cores,
divided by 32. For example, for a 128-core Pod, the price is
(32-core Pod price) * (128/32). For information about which Cloud TPU
Pods are available for a specific region, see System Architecture
in the Cloud TPU documentation.
Disks
If you pay in a currency other than USD, the prices listed in your currency on
Cloud Platform SKUs
apply.
You are required to store your data and program files in
Google Cloud Storage buckets during the Vertex AI lifecycle.
See more about Cloud Storage usage.
You are charged for training your models from the moment when resources are
provisioned for a job until the job finishes.
Prediction and explanation
The following tables provide the prices of batch prediction, online prediction,
and online explanation per node hour. A node hour represents the time a virtual
machine spends running your prediction job or waiting in an active state (an
endpoint with one or more models deployed) to handle prediction or explanation
requests.
You can use Spot VMs with Vertex AI custom training.
Spot VMs are billed according to
Compute Engine Spot VMs pricing. There are Vertex AI custom
training management fees in addition to your infrastructure usage, captured in the
following tables.
You can use Compute Engine reservations with Vertex AI custom training. When using
Compute Engine reservations, you're billed according to
Compute Engine Pricing, including any applicable committed
use discounts (CUDs). There are Vertex AI custom training management fees in
addition to your infrastructure usage, captured in the following tables.
The following tables provide the price per node hour for each machine type.
E2 Series
e2-standard-2approximations:
us-west2
$0.0926
us-west4
$0.0868
us-east4
$0.0868
northamerica-northeast1
$0.0848
northamerica-northeast2
$0.0848
southamerica-east1
$0.1223
Other Americas regions
$0.0771
e2-standard-4approximations:
us-west2
$0.1851
us-west4
$0.1736
us-east4
$0.1736
northamerica-northeast1
$0.1697
northamerica-northeast2
$0.1697
southamerica-east1
$0.2446
Other Americas regions
$0.1541
e2-standard-8approximations:
us-west2
$0.3702
us-west4
$0.3471
us-east4
$0.3471
northamerica-northeast1
$0.3393
northamerica-northeast2
$0.3393
southamerica-east1
$0.4893
Other Americas regions
$0.3082
e2-standard-16approximations:
us-west2
$0.7405
us-west4
$0.6942
us-east4
$0.6942
northamerica-northeast1
$0.6787
northamerica-northeast2
$0.6787
southamerica-east1
$0.9786
Other Americas regions
$0.6165
e2-standard-32approximations:
us-west2
$1.4809
us-west4
$1.3885
us-east4
$1.3885
northamerica-northeast1
$1.3574
northamerica-northeast2
$1.3574
southamerica-east1
$1.9572
Other Americas regions
$1.2329
e2-highmem-2approximations:
us-west2
$0.1249
us-west4
$0.1171
us-east4
$0.1171
northamerica-northeast1
$0.1144
northamerica-northeast2
$0.1144
southamerica-east1
$0.165
Other Americas regions
$0.1039
e2-highmem-4approximations:
us-west2
$0.2497
us-west4
$0.2341
us-east4
$0.2341
northamerica-northeast1
$0.2289
northamerica-northeast2
$0.2289
southamerica-east1
$0.33
Other Americas regions
$0.2079
e2-highmem-8approximations:
us-west2
$0.4994
us-west4
$0.4682
us-east4
$0.4682
northamerica-northeast1
$0.4578
northamerica-northeast2
$0.4578
southamerica-east1
$0.66
Other Americas regions
$0.4158
e2-highmem-16approximations:
us-west2
$0.9989
us-west4
$0.9365
us-east4
$0.9365
northamerica-northeast1
$0.9155
northamerica-northeast2
$0.9155
southamerica-east1
$1.3201
Other Americas regions
$0.8316
e2-highcpu-2approximations:
us-west2
$0.0683
us-west4
$0.0641
us-east4
$0.0641
northamerica-northeast1
$0.0626
northamerica-northeast2
$0.0626
southamerica-east1
$0.0903
Other Americas regions
$0.0569
e2-highcpu-4approximations:
us-west2
$0.1367
us-west4
$0.1281
us-east4
$0.1281
northamerica-northeast1
$0.1253
northamerica-northeast2
$0.1253
southamerica-east1
$0.1806
Other Americas regions
$0.1138
e2-highcpu-8approximations:
us-west2
$0.2733
us-west4
$0.2563
us-east4
$0.2563
northamerica-northeast1
$0.2505
northamerica-northeast2
$0.2505
southamerica-east1
$0.3612
Other Americas regions
$0.2276
e2-highcpu-16approximations:
us-west2
$0.5467
us-west4
$0.5126
us-east4
$0.5126
northamerica-northeast1
$0.501
northamerica-northeast2
$0.501
southamerica-east1
$0.7225
Other Americas regions
$0.4551
e2-highcpu-32approximations:
us-west2
$1.0933
us-west4
$1.0252
us-east4
$1.0252
northamerica-northeast1
$1.0021
northamerica-northeast2
$1.0021
southamerica-east1
$1.4449
Other Americas regions
$0.9102
N1 Series
n1-standard-2approximations:
us-east4
$0.123
northamerica-northeast1
$0.1203
Other Americas regions
$0.1093
n1-standard-4approximations:
us-east4
$0.2461
northamerica-northeast1
$0.2405
Other Americas regions
$0.2186
n1-standard-8approximations:
us-east4
$0.4922
northamerica-northeast1
$0.4811
Other Americas regions
$0.4372
n1-standard-16approximations:
us-east4
$0.9843
northamerica-northeast1
$0.9622
Other Americas regions
$0.8744
n1-standard-32approximations:
us-east4
$1.9687
northamerica-northeast1
$1.9243
Other Americas regions
$1.7488
n1-highmem-2approximations:
us-east4
$0.1532
northamerica-northeast1
$0.1498
Other Americas regions
$0.1361
n1-highmem-4approximations:
us-east4
$0.3064
northamerica-northeast1
$0.2995
Other Americas regions
$0.2723
n1-highmem-8approximations:
us-east4
$0.6129
northamerica-northeast1
$0.5991
Other Americas regions
$0.5445
n1-highmem-16approximations:
us-east4
$1.2257
northamerica-northeast1
$1.1982
Other Americas regions
$1.089
n1-highcpu-2approximations:
us-east4
$0.0918
northamerica-northeast1
$0.0897
Other Americas regions
$0.0815
n1-highcpu-4approximations:
us-east4
$0.1835
northamerica-northeast1
$0.1794
Other Americas regions
$0.163
n1-highcpu-8approximations:
us-east4
$0.3671
northamerica-northeast1
$0.3588
Other Americas regions
$0.326
n1-highcpu-16approximations:
us-east4
$0.7341
northamerica-northeast1
$0.7176
Other Americas regions
$0.6519
n1-highcpu-32approximations:
us-east4
$1.4683
northamerica-northeast1
$1.4352
Other Americas regions
$1.3039
N2 Series
n2-standard-2approximations:
northamerica_northeast1
$0.123
northamerica_northeast2
$0.123
southamerica_east1
$0.1773
us_central1
$0.1117
us_east1
$0.1117
us_east4
$0.1258
us_south1
$0.1318
us_west1
$0.1117
us_west2
$0.1341
us_west3
$0.1341
us_west4
$0.1258
n2-standard-4approximations:
northamerica_northeast1
$0.2459
northamerica_northeast2
$0.2459
southamerica_east1
$0.3546
us_central1
$0.2234
us_east1
$0.2234
us_east4
$0.2516
us_south1
$0.2636
us_west1
$0.2234
us_west2
$0.2683
us_west3
$0.2683
us_west4
$0.2516
n2-standard-8approximations:
northamerica_northeast1
$0.4918
northamerica_northeast2
$0.4918
southamerica_east1
$0.7091
us_central1
$0.4467
us_east1
$0.4467
us_east4
$0.5031
us_south1
$0.5272
us_west1
$0.4467
us_west2
$0.5366
us_west3
$0.5366
us_west4
$0.5031
n2-standard-16approximations:
northamerica_northeast1
$0.9836
northamerica_northeast2
$0.9836
southamerica_east1
$1.4183
us_central1
$0.8935
us_east1
$0.8935
us_east4
$1.0063
us_south1
$1.0543
us_west1
$0.8935
us_west2
$1.0732
us_west3
$1.0732
us_west4
$1.0062
n2-standard-32approximations:
northamerica_northeast1
$1.9673
northamerica_northeast2
$1.9673
southamerica_east1
$2.8365
us_central1
$1.787
us_east1
$1.787
us_east4
$2.0126
us_south1
$2.1087
us_west1
$1.787
us_west2
$2.1464
us_west3
$2.1464
us_west4
$2.0125
n2-highmem-2approximations:
northamerica_northeast1
$0.1659
northamerica_northeast2
$0.1659
southamerica_east1
$0.2392
us_central1
$0.1507
us_east1
$0.1507
us_east4
$0.1697
us_south1
$0.1778
us_west1
$0.1507
us_west2
$0.181
us_west3
$0.181
us_west4
$0.1697
n2-highmem-4approximations:
northamerica_northeast1
$0.3317
northamerica_northeast2
$0.3317
southamerica_east1
$0.4783
us_central1
$0.3013
us_east1
$0.3013
us_east4
$0.3394
us_south1
$0.3556
us_west1
$0.3013
us_west2
$0.3619
us_west3
$0.3619
us_west4
$0.3393
n2-highmem-8approximations:
northamerica_northeast1
$0.6634
northamerica_northeast2
$0.6634
southamerica_east1
$0.9566
us_central1
$0.6027
us_east1
$0.6027
us_east4
$0.6787
us_south1
$0.7112
us_west1
$0.6027
us_west2
$0.7239
us_west3
$0.7239
us_west4
$0.6787
n2-highmem-16approximations:
northamerica_northeast1
$1.3269
northamerica_northeast2
$1.3269
southamerica_east1
$1.9132
us_central1
$1.2053
us_east1
$1.2053
us_east4
$1.3574
us_south1
$1.4223
us_west1
$1.2053
us_west2
$1.4477
us_west3
$1.4477
us_west4
$1.3574
n2-highcpu-2approximations:
northamerica_northeast1
$0.0908
northamerica_northeast2
$0.0908
southamerica_east1
$0.1309
us_central1
$0.0825
us_east1
$0.0825
us_east4
$0.0929
us_south1
$0.0973
us_west1
$0.0825
us_west2
$0.099
us_west3
$0.099
us_west4
$0.0929
n2-highcpu-4approximations:
northamerica_northeast1
$0.1815
northamerica_northeast2
$0.1815
southamerica_east1
$0.2618
us_central1
$0.1649
us_east1
$0.1649
us_east4
$0.1857
us_south1
$0.1946
us_west1
$0.1649
us_west2
$0.1981
us_west3
$0.1981
us_west4
$0.1857
n2-highcpu-8approximations:
northamerica_northeast1
$0.3631
northamerica_northeast2
$0.3631
southamerica_east1
$0.5235
us_central1
$0.3298
us_east1
$0.3298
us_east4
$0.3715
us_south1
$0.3892
us_west1
$0.3298
us_west2
$0.3961
us_west3
$0.3961
us_west4
$0.3714
n2-highcpu-16approximations:
northamerica_northeast1
$0.7262
northamerica_northeast2
$0.7262
southamerica_east1
$1.0471
us_central1
$0.6596
us_east1
$0.6596
us_east4
$0.7429
us_south1
$0.7783
us_west1
$0.6596
us_west2
$0.7923
us_west3
$0.7923
us_west4
$0.7429
n2-highcpu-32approximations:
northamerica_northeast1
$1.4523
northamerica_northeast2
$1.4523
southamerica_east1
$2.0941
us_central1
$1.3192
us_east1
$1.3192
us_east4
$1.4858
us_south1
$1.5567
us_west1
$1.3192
us_west2
$1.5846
us_west3
$1.5846
us_west4
$1.4858
N2D Series
n2d-standard-2approximations:
northamerica_northeast1
$0.107
southamerica_east1
$0.1542
us_central1
$0.0972
us_east1
$0.0972
us_east4
$0.1094
us_west1
$0.0972
us_west2
$0.1167
us_west4
$0.1094
n2d-standard-4approximations:
northamerica_northeast1
$0.2139
southamerica_east1
$0.3085
us_central1
$0.1943
us_east1
$0.1943
us_east4
$0.2189
us_west1
$0.1943
us_west2
$0.2334
us_west4
$0.2189
n2d-standard-8approximations:
northamerica_northeast1
$0.4279
southamerica_east1
$0.617
us_central1
$0.3887
us_east1
$0.3887
us_east4
$0.4377
us_west1
$0.3887
us_west2
$0.4668
us_west4
$0.4377
n2d-standard-16approximations:
northamerica_northeast1
$0.8558
southamerica_east1
$1.2339
us_central1
$0.7773
us_east1
$0.7773
us_east4
$0.8755
us_west1
$0.7773
us_west2
$0.9336
us_west4
$0.8755
n2d-standard-32approximations:
northamerica_northeast1
$1.7116
southamerica_east1
$2.4678
us_central1
$1.5547
us_east1
$1.5547
us_east4
$1.7509
us_west1
$1.5547
us_west2
$1.8673
us_west4
$1.7509
n2d-highmem-2approximations:
northamerica_northeast1
$0.1443
southamerica_east1
$0.2081
us_central1
$0.1311
us_east1
$0.1311
us_east4
$0.1476
us_west1
$0.1311
us_west2
$0.1574
us_west4
$0.1476
n2d-highmem-4approximations:
northamerica_northeast1
$0.2886
southamerica_east1
$0.4161
us_central1
$0.2622
us_east1
$0.2622
us_east4
$0.2952
us_west1
$0.2622
us_west2
$0.3149
us_west4
$0.2952
n2d-highmem-8approximations:
northamerica_northeast1
$0.5772
southamerica_east1
$0.8323
us_central1
$0.5243
us_east1
$0.5243
us_east4
$0.5905
us_west1
$0.5243
us_west2
$0.6297
us_west4
$0.5905
n2d-highmem-16approximations:
northamerica_northeast1
$1.1545
southamerica_east1
$1.6646
us_central1
$1.0486
us_east1
$1.0486
us_east4
$1.181
us_west1
$1.0486
us_west2
$1.2595
us_west4
$1.181
n2d-highcpu-2approximations:
northamerica_northeast1
$0.079
southamerica_east1
$0.1139
us_central1
$0.0717
us_east1
$0.0717
us_east4
$0.0808
us_west1
$0.0717
us_west2
$0.0862
us_west4
$0.0808
n2d-highcpu-4approximations:
northamerica_northeast1
$0.1579
southamerica_east1
$0.2277
us_central1
$0.1435
us_east1
$0.1435
us_east4
$0.1616
us_west1
$0.1435
us_west2
$0.1723
us_west4
$0.1616
n2d-highcpu-8approximations:
northamerica_northeast1
$0.3159
southamerica_east1
$0.4555
us_central1
$0.2869
us_east1
$0.2869
us_east4
$0.3232
us_west1
$0.2869
us_west2
$0.3446
us_west4
$0.3232
n2d-highcpu-16approximations:
northamerica_northeast1
$0.6318
southamerica_east1
$0.9109
us_central1
$0.5739
us_east1
$0.5739
us_east4
$0.6463
us_west1
$0.5739
us_west2
$0.6893
us_west4
$0.6463
n2d-highcpu-32approximations:
northamerica_northeast1
$1.2636
southamerica_east1
$1.8219
us_central1
$1.1477
us_east1
$1.1477
us_east4
$1.2927
us_west1
$1.1477
us_west2
$1.3786
us_west4
$1.2927
C2 Series
c2-standard-4approximations:
northamerica_northeast1
$0.264
southamerica_east1
$0.3812
us_central1
$0.24
us_east1
$0.24
us_east4
$0.2702
us_west1
$0.24
us_west2
$0.2884
us_west3
$0.2889
us_west4
$0.2702
c2-standard-8approximations:
northamerica_northeast1
$0.5281
southamerica_east1
$0.7623
us_central1
$0.4801
us_east1
$0.4801
us_east4
$0.5405
us_west1
$0.4801
us_west2
$0.5768
us_west3
$0.5778
us_west4
$0.5405
c2-standard-16approximations:
northamerica_northeast1
$1.0562
southamerica_east1
$1.5246
us_central1
$0.9601
us_east1
$0.9601
us_east4
$1.081
us_west1
$0.9601
us_west2
$1.1537
us_west3
$1.1555
us_west4
$1.081
c2-standard-30approximations:
northamerica_northeast1
$1.9803
southamerica_east1
$2.8587
us_central1
$1.8002
us_east1
$1.8002
us_east4
$2.0269
us_west1
$1.8002
us_west2
$2.1631
us_west3
$2.1666
us_west4
$2.0269
c2-standard-60approximations:
northamerica_northeast1
$3.9606
southamerica_east1
$5.7173
us_central1
$3.6004
us_east1
$3.6004
us_east4
$4.0537
us_west1
$3.6004
us_west2
$4.3263
us_west3
$4.3332
us_west4
$4.0537
C2D Series
c2d-standard-2approximations:
us_central1
$0.1044
us_east1
$0.1044
us_east4
$0.1176
us_west1
$0.1044
us_west4
$0.1176
c2d-standard-4approximations:
us_central1
$0.2088
us_east1
$0.2088
us_east4
$0.2352
us_west1
$0.2088
us_west4
$0.2352
c2d-standard-8approximations:
us_central1
$0.4177
us_east1
$0.4177
us_east4
$0.4704
us_west1
$0.4177
us_west4
$0.4704
c2d-standard-16approximations:
us_central1
$0.8353
us_east1
$0.8353
us_east4
$0.9408
us_west1
$0.8353
us_west4
$0.9408
c2d-standard-32approximations:
us_central1
$1.6707
us_east1
$1.6707
us_east4
$1.8815
us_west1
$1.6707
us_west4
$1.8815
c2d-standard-56approximations:
us_central1
$2.9237
us_east1
$2.9237
us_east4
$3.2926
us_west1
$2.9237
us_west4
$3.2926
c2d-standard-112approximations:
us_central1
$5.8474
us_east1
$5.8474
us_east4
$6.5853
us_west1
$5.8474
us_west4
$6.5853
c2d-highmem-2approximations:
us_central1
$0.1408
us_east1
$0.1408
us_east4
$0.1586
us_west1
$0.1408
us_west4
$0.1586
c2d-highmem-4approximations:
us_central1
$0.2817
us_east1
$0.2817
us_east4
$0.3172
us_west1
$0.2817
us_west4
$0.3172
c2d-highmem-8approximations:
us_central1
$0.5634
us_east1
$0.5634
us_east4
$0.6344
us_west1
$0.5634
us_west4
$0.6344
c2d-highmem-16approximations:
us_central1
$1.1267
us_east1
$1.1267
us_east4
$1.2689
us_west1
$1.1267
us_west4
$1.2689
c2d-highmem-32approximations:
us_central1
$2.2534
us_east1
$2.2534
us_east4
$2.5377
us_west1
$2.2534
us_west4
$2.5377
c2d-highmem-56approximations:
us_central1
$3.9435
us_east1
$3.9435
us_east4
$4.441
us_west1
$3.9435
us_west4
$4.441
c2d-highmem-112approximations:
us_central1
$7.887
us_east1
$7.887
us_east4
$8.882
us_west1
$7.887
us_west4
$8.882
c2d-highcpu-2approximations:
us_central1
$0.0862
us_east1
$0.0862
us_east4
$0.0971
us_west1
$0.0862
us_west4
$0.0971
c2d-highcpu-4approximations:
us_central1
$0.1724
us_east1
$0.1724
us_east4
$0.1942
us_west1
$0.1724
us_west4
$0.1942
c2d-highcpu-8approximations:
us_central1
$0.3448
us_east1
$0.3448
us_east4
$0.3884
us_west1
$0.3448
us_west4
$0.3884
c2d-highcpu-16approximations:
us_central1
$0.6896
us_east1
$0.6896
us_east4
$0.7767
us_west1
$0.6896
us_west4
$0.7767
c2d-highcpu-32approximations:
us_central1
$1.3793
us_east1
$1.3793
us_east4
$1.5534
us_west1
$1.3793
us_west4
$1.5534
c2d-highcpu-56approximations:
us_central1
$2.4138
us_east1
$2.4138
us_east4
$2.7185
us_west1
$2.4138
us_west4
$2.7185
c2d-highcpu-112approximations:
us_central1
$4.8275
us_east1
$4.8275
us_east4
$5.4369
us_west1
$4.8275
us_west4
$5.4369
C3 Series
c3-highcpu-4approximations:
us_central1
$0.1982
us_east1
$0.1982
us_east4
$0.2232
c3-highcpu-8approximations:
us_central1
$0.3965
us_east1
$0.3965
us_east4
$0.4465
c3-highcpu-22approximations:
us_central1
$1.0903
us_east1
$1.0903
us_east4
$1.2278
c3-highcpu-44approximations:
us_central1
$2.1806
us_east1
$2.1806
us_east4
$2.4556
c3-highcpu-88approximations:
us_central1
$4.3613
us_east1
$4.3613
us_east4
$4.9113
c3-highcpu-176approximations:
us_central1
$8.7226
us_east1
$8.7226
us_east4
$9.8226
A2 Series*
a2-highgpu-1gapproximations:
us-central1
$4.2245
a2-highgpu-2gapproximations:
us-central1
$8.449
a2-highgpu-4gapproximations:
us-central1
$16.898
a2-highgpu-8gapproximations:
us-central1
$33.796
a2-megagpu-16gapproximations:
us-central1
$64.1021
a2-ultragpu-1gapproximations:
us-central1
$5.7818
us-east4
$6.3524
a2-ultragpu-2gapproximations:
us-central1
$11.5637
us-east4
$12.7048
a2-ultragpu-4gapproximations:
us-central1
$23.1274
us-east4
$25.4095
a2-ultragpu-8gapproximations:
us-central1
$46.2548
us-east4
$50.8191
A3 Series*
a3-highgpu-2gapproximations:
us-west1
$25.2518
a3-highgpu-4gapproximations:
us-west1
$50.5037
a3-highgpu-8gapproximations:
us-central1
$101.0074
us-east4
$101.0074
us-west1
$101.0074
G2 Series*
g2-standard-4approximations:
us-central1
$0.8129
g2-standard-8approximations:
us-central1
$0.9818
g2-standard-12approximations:
us-central1
$1.1507
g2-standard-16approximations:
us-central1
$1.3196
g2-standard-24approximations:
us-central1
$2.3014
g2-standard-32approximations:
us-central1
$1.9951
g2-standard-48approximations:
us-central1
$4.6028
g2-standard-96approximations:
us-central1
$9.2055
*When consuming from a reservation or spot capacity, billing is spread
across two SKUs: the GCE SKU with the label 'vertex-ai-online-prediction' and
the Vertex AI Management Fee SKU. This enables you to use your Committed Use
Discounts (CUDs) in Vertex AI.
TPU v5e
ct5lp-hightpu-1t
Approximations:
us-west1
$1.38
ct5lp-hightpu-4t
Approximations:
us-west1
$5.52
ct5lp-hightpu-8t
Approximations:
us-west1
$11.04
Pricing for Europe
The following tables provide the price per node hour for each machine type.
E2 Series
e2-standard-2approximations:
europe-west1
$0.0848
europe-west2
$0.0993
europe-west3
$0.0993
europe-west4
$0.0848
europe-west6
$0.1078
europe-west9
$0.1079
e2-standard-4approximations:
europe-west1
$0.1695
europe-west2
$0.1986
europe-west3
$0.1986
europe-west4
$0.1697
europe-west6
$0.2156
europe-west9
$0.2158
e2-standard-8approximations:
europe-west1
$0.3391
europe-west2
$0.3971
europe-west3
$0.3971
europe-west4
$0.3393
europe-west6
$0.4313
europe-west9
$0.4316
e2-standard-16approximations:
europe-west1
$0.6782
europe-west2
$0.7943
europe-west3
$0.7943
europe-west4
$0.6787
europe-west6
$0.8626
europe-west9
$0.8631
e2-standard-32approximations:
europe-west1
$1.3563
europe-west2
$1.5885
europe-west3
$1.5885
europe-west4
$1.3574
europe-west6
$1.7251
europe-west9
$1.7262
e2-highmem-2approximations:
europe-west1
$0.1144
europe-west2
$0.1339
europe-west3
$0.1339
europe-west4
$0.1144
europe-west6
$0.1454
europe-west9
$0.1455
e2-highmem-4approximations:
europe-west1
$0.2287
europe-west2
$0.2679
europe-west3
$0.2679
europe-west4
$0.2289
europe-west6
$0.2909
europe-west9
$0.2911
e2-highmem-8approximations:
europe-west1
$0.4574
europe-west2
$0.5357
europe-west3
$0.5357
europe-west4
$0.4578
europe-west6
$0.5818
europe-west9
$0.5822
e2-highmem-16approximations:
europe-west1
$0.9149
europe-west2
$1.0714
europe-west3
$1.0714
europe-west4
$0.9155
europe-west6
$1.1636
europe-west9
$1.1643
e2-highcpu-2approximations:
europe-west1
$0.0626
europe-west2
$0.0733
europe-west3
$0.0733
europe-west4
$0.0626
europe-west6
$0.0796
europe-west9
$0.0796
e2-highcpu-4approximations:
europe-west1
$0.1252
europe-west2
$0.1466
europe-west3
$0.1466
europe-west4
$0.1253
europe-west6
$0.1592
europe-west9
$0.1593
e2-highcpu-8approximations:
europe-west1
$0.2503
europe-west2
$0.2932
europe-west3
$0.2932
europe-west4
$0.2505
europe-west6
$0.3184
europe-west9
$0.3186
e2-highcpu-16approximations:
europe-west1
$0.5006
europe-west2
$0.5864
europe-west3
$0.5864
europe-west4
$0.501
europe-west6
$0.6368
europe-west9
$0.6372
e2-highcpu-32approximations:
europe-west1
$1.0013
europe-west2
$1.1728
europe-west3
$1.1728
europe-west4
$1.0021
europe-west6
$1.2736
europe-west9
$1.2743
N1 Series
n1-standard-2approximations:
europe-west2
$0.1408
Other Europe regions
$0.1265
n1-standard-4approximations:
europe-west2
$0.2815
Other Europe regions
$0.2531
n1-standard-8approximations:
europe-west2
$0.563
Other Europe regions
$0.5061
n1-standard-16approximations:
europe-west2
$1.126
Other Europe regions
$1.0123
n1-standard-32approximations:
europe-west2
$2.2521
Other Europe regions
$2.0245
n1-highmem-2approximations:
europe-west2
$0.1753
Other Europe regions
$0.1575
n1-highmem-4approximations:
europe-west2
$0.3506
Other Europe regions
$0.3151
n1-highmem-8approximations:
europe-west2
$0.7011
Other Europe regions
$0.6302
n1-highmem-16approximations:
europe-west2
$1.4022
Other Europe regions
$1.2603
n1-highcpu-2approximations:
europe-west2
$0.105
Other Europe regions
$0.0944
n1-highcpu-4approximations:
europe-west2
$0.21
Other Europe regions
$0.1888
n1-highcpu-8approximations:
europe-west2
$0.4199
Other Europe regions
$0.3776
n1-highcpu-16approximations:
europe-west2
$0.8398
Other Europe regions
$0.7552
n1-highcpu-32approximations:
europe-west2
$1.6796
Other Europe regions
$1.5104
N2 Series
n2-standard-2approximations:
europe_central2
$0.1439
europe_west1
$0.1229
europe_west2
$0.1439
europe_west3
$0.1439
europe_west4
$0.1229
europe_west6
$0.1564
europe_west9
$0.1296
n2-standard-4approximations:
europe_central2
$0.2878
europe_west1
$0.2457
europe_west2
$0.2878
europe_west3
$0.2878
europe_west4
$0.2457
europe_west6
$0.3127
europe_west9
$0.2591
n2-standard-8approximations:
europe_central2
$0.5756
europe_west1
$0.4914
europe_west2
$0.5756
europe_west3
$0.5756
europe_west4
$0.4914
europe_west6
$0.6254
europe_west9
$0.5182
n2-standard-16approximations:
europe_central2
$1.1511
europe_west1
$0.9829
europe_west2
$1.1511
europe_west3
$1.1511
europe_west4
$0.9828
europe_west6
$1.2508
europe_west9
$1.0364
n2-standard-32approximations:
europe_central2
$2.3023
europe_west1
$1.9658
europe_west2
$2.3023
europe_west3
$2.3023
europe_west4
$1.9657
europe_west6
$2.5017
europe_west9
$2.0729
n2-highmem-2approximations:
europe_central2
$0.1941
europe_west1
$0.1657
europe_west2
$0.1941
europe_west3
$0.1941
europe_west4
$0.1657
europe_west6
$0.2109
europe_west9
$0.1748
n2-highmem-4approximations:
europe_central2
$0.3882
europe_west1
$0.3315
europe_west2
$0.3882
europe_west3
$0.3882
europe_west4
$0.3315
europe_west6
$0.4218
europe_west9
$0.3495
n2-highmem-8approximations:
europe_central2
$0.7764
europe_west1
$0.663
europe_west2
$0.7764
europe_west3
$0.7764
europe_west4
$0.6629
europe_west6
$0.8436
europe_west9
$0.6991
n2-highmem-16approximations:
europe_central2
$1.5528
europe_west1
$1.3259
europe_west2
$1.5528
europe_west3
$1.5528
europe_west4
$1.3259
europe_west6
$1.6873
europe_west9
$1.3982
n2-highcpu-2approximations:
europe_central2
$0.1062
europe_west1
$0.0907
europe_west2
$0.1062
europe_west3
$0.1062
europe_west4
$0.0907
europe_west6
$0.1154
europe_west9
$0.0956
n2-highcpu-4approximations:
europe_central2
$0.2125
europe_west1
$0.1814
europe_west2
$0.2125
europe_west3
$0.2125
europe_west4
$0.1814
europe_west6
$0.2309
europe_west9
$0.1913
n2-highcpu-8approximations:
europe_central2
$0.4249
europe_west1
$0.3628
europe_west2
$0.4249
europe_west3
$0.4249
europe_west4
$0.3628
europe_west6
$0.4617
europe_west9
$0.3826
n2-highcpu-16approximations:
europe_central2
$0.8499
europe_west1
$0.7256
europe_west2
$0.8499
europe_west3
$0.8499
europe_west4
$0.7256
europe_west6
$0.9235
europe_west9
$0.7651
n2-highcpu-32approximations:
europe_central2
$1.6997
europe_west1
$1.4512
europe_west2
$1.6997
europe_west3
$1.6997
europe_west4
$1.4511
europe_west6
$1.847
europe_west9
$1.5303
N2D Series
n2d-standard-2approximations:
europe_west1
$0.1069
europe_west2
$0.1252
europe_west3
$0.1252
europe_west4
$0.107
europe_west9
$0.1127
n2d-standard-4approximations:
europe_west1
$0.2138
europe_west2
$0.2504
europe_west3
$0.2504
europe_west4
$0.2139
europe_west9
$0.2254
n2d-standard-8approximations:
europe_west1
$0.4275
europe_west2
$0.5007
europe_west3
$0.5007
europe_west4
$0.4279
europe_west9
$0.4509
n2d-standard-16approximations:
europe_west1
$0.8551
europe_west2
$1.0015
europe_west3
$1.0015
europe_west4
$0.8558
europe_west9
$0.9017
n2d-standard-32approximations:
europe_west1
$1.7102
europe_west2
$2.0029
europe_west3
$2.0029
europe_west4
$1.7116
europe_west9
$1.8034
n2d-highmem-2approximations:
europe_west1
$0.1442
europe_west2
$0.1689
europe_west3
$0.1689
europe_west4
$0.1443
europe_west9
$0.1521
n2d-highmem-4approximations:
europe_west1
$0.2884
europe_west2
$0.3377
europe_west3
$0.3377
europe_west4
$0.2886
europe_west9
$0.3041
n2d-highmem-8approximations:
europe_west1
$0.5768
europe_west2
$0.6755
europe_west3
$0.6755
europe_west4
$0.5772
europe_west9
$0.6082
n2d-highmem-16approximations:
europe_west1
$1.1535
europe_west2
$1.3509
europe_west3
$1.3509
europe_west4
$1.1545
europe_west9
$1.2164
n2d-highcpu-2approximations:
europe_west1
$0.0789
europe_west2
$0.0924
europe_west3
$0.0924
europe_west4
$0.079
europe_west9
$0.0832
n2d-highcpu-4approximations:
europe_west1
$0.1578
europe_west2
$0.1848
europe_west3
$0.1848
europe_west4
$0.1579
europe_west9
$0.1664
n2d-highcpu-8approximations:
europe_west1
$0.3156
europe_west2
$0.3697
europe_west3
$0.3697
europe_west4
$0.3159
europe_west9
$0.3328
n2d-highcpu-16approximations:
europe_west1
$0.6313
europe_west2
$0.7394
europe_west3
$0.7394
europe_west4
$0.6318
europe_west9
$0.6657
n2d-highcpu-32approximations:
europe_west1
$1.2625
europe_west2
$1.4787
europe_west3
$1.4787
europe_west4
$1.2636
europe_west9
$1.3314
C2 Series
c2-standard-4approximations:
europe_west1
$0.2641
europe_west2
$0.3094
europe_west3
$0.3092
europe_west4
$0.2643
europe_west6
$0.3362
c2-standard-8approximations:
europe_west1
$0.5283
europe_west2
$0.6187
europe_west3
$0.6184
europe_west4
$0.5285
europe_west6
$0.6724
c2-standard-16approximations:
europe_west1
$1.0565
europe_west2
$1.2375
europe_west3
$1.2368
europe_west4
$1.0571
europe_west6
$1.3449
c2-standard-30approximations:
europe_west1
$1.981
europe_west2
$2.3202
europe_west3
$2.3191
europe_west4
$1.982
europe_west6
$2.5216
c2-standard-60approximations:
europe_west1
$3.962
europe_west2
$4.6404
europe_west3
$4.6382
europe_west4
$3.964
europe_west6
$5.0432
C2D Series
c2d-standard-2approximations:
europe_west1
$0.115
europe_west2
$0.1345
europe_west3
$0.1345
europe_west4
$0.115
c2d-standard-4approximations:
europe_west1
$0.2299
europe_west2
$0.269
europe_west3
$0.269
europe_west4
$0.2299
c2d-standard-8approximations:
europe_west1
$0.4599
europe_west2
$0.5381
europe_west3
$0.5381
europe_west4
$0.4599
c2d-standard-16approximations:
europe_west1
$0.9198
europe_west2
$1.0762
europe_west3
$1.0762
europe_west4
$0.9198
c2d-standard-32approximations:
europe_west1
$1.8395
europe_west2
$2.1524
europe_west3
$2.1524
europe_west4
$1.8395
c2d-standard-56approximations:
europe_west1
$3.2191
europe_west2
$3.7666
europe_west3
$3.7666
europe_west4
$3.2191
c2d-standard-112approximations:
europe_west1
$6.4383
europe_west2
$7.5333
europe_west3
$7.5333
europe_west4
$6.4383
c2d-highmem-2approximations:
europe_west1
$0.1551
europe_west2
$0.1814
europe_west3
$0.1814
europe_west4
$0.1551
c2d-highmem-4approximations:
europe_west1
$0.3101
europe_west2
$0.3629
europe_west3
$0.3629
europe_west4
$0.3101
c2d-highmem-8approximations:
europe_west1
$0.6203
europe_west2
$0.7258
europe_west3
$0.7258
europe_west4
$0.6203
c2d-highmem-16approximations:
europe_west1
$1.2406
europe_west2
$1.4515
europe_west3
$1.4515
europe_west4
$1.2406
c2d-highmem-32approximations:
europe_west1
$2.4812
europe_west2
$2.9031
europe_west3
$2.9031
europe_west4
$2.4812
c2d-highmem-56approximations:
europe_west1
$4.342
europe_west2
$5.0804
europe_west3
$5.0804
europe_west4
$4.342
c2d-highmem-112approximations:
europe_west1
$8.684
europe_west2
$10.1608
europe_west3
$10.1608
europe_west4
$8.684
c2d-highcpu-2approximations:
europe_west1
$0.0949
europe_west2
$0.1111
europe_west3
$0.1111
europe_west4
$0.0949
c2d-highcpu-4approximations:
europe_west1
$0.1898
europe_west2
$0.2221
europe_west3
$0.2221
europe_west4
$0.1898
c2d-highcpu-8approximations:
europe_west1
$0.3797
europe_west2
$0.4442
europe_west3
$0.4442
europe_west4
$0.3797
c2d-highcpu-16approximations:
europe_west1
$0.7593
europe_west2
$0.8885
europe_west3
$0.8885
europe_west4
$0.7593
c2d-highcpu-32approximations:
europe_west1
$1.5187
europe_west2
$1.777
europe_west3
$1.777
europe_west4
$1.5187
c2d-highcpu-56approximations:
europe_west1
$2.6577
europe_west2
$3.1097
europe_west3
$3.1097
europe_west4
$2.6577
c2d-highcpu-112approximations:
europe_west1
$5.3154
europe_west2
$6.2195
europe_west3
$6.2195
europe_west4
$5.3154
C3 Series
c3-highcpu-4approximations:
europe_west1
$0.218
europe_west4
$0.2182
c3-highcpu-8approximations:
europe_west1
$0.4361
europe_west4
$0.4365
c3-highcpu-22approximations:
europe_west1
$1.1992
europe_west4
$1.2003
c3-highcpu-44approximations:
europe_west1
$2.3984
europe_west4
$2.4006
c3-highcpu-88approximations:
europe_west1
$4.7969
europe_west4
$4.8013
c3-highcpu-176approximations:
europe_west1
$9.5938
europe_west4
$9.6026
A2 Series*
a2-highgpu-1gapproximations:
europe-west4
$4.3103
a2-highgpu-2gapproximations:
europe-west4
$8.6205
a2-highgpu-4gapproximations:
europe-west4
$17.2411
a2-highgpu-8gapproximations:
europe-west4
$34.4822
a2-megagpu-16gapproximations:
europe-west4
$65.1222
a2-ultragpu-1gapproximations:
europe-west4
$6.3661
a2-ultragpu-2gapproximations:
europe-west4
$12.7321
a2-ultragpu-4gapproximations:
europe-west4
$25.4643
a2-ultragpu-8gapproximations:
europe-west4
$50.9286
A3 Series*
a3-highgpu-2gapproximations:
europe-west4
$32.147
a3-highgpu-4gapproximations:
europe-west4
$64.294
a3-highgpu-8gapproximations:
europe-west4
$128.588
G2 Series*
g2-standard-4approximations:
europe-west4
$0.8951
g2-standard-8approximations:
europe-west4
$1.081
g2-standard-12approximations:
europe-west4
$1.2669
g2-standard-16approximations:
europe-west4
$1.4528
g2-standard-24approximations:
europe-west4
$2.5338
g2-standard-32approximations:
europe-west4
$2.1965
g2-standard-48approximations:
europe-west4
$5.0677
g2-standard-96approximations:
europe-west4
$10.1354
*When consuming from a reservation or spot capacity, billing is spread
across two SKUs: the GCE SKU with the label 'vertex-ai-online-prediction' and
the Vertex AI Management Fee SKU. This enables you to use your Committed Use
Discounts (CUDs) in Vertex AI.
Pricing for Asia Pacific
The following tables provide the price per node hour for each machine type.
E2 Series
e2-standard-2approximations:
asia-east1
$0.0892
asia-east2
$0.1078
asia-northeast1
$0.0989
asia-northeast3
$0.0989
asia-south1
$0.0926
asia-southeast1
$0.0951
australia-southeast1
$0.1093
e2-standard-4approximations:
asia-east1
$0.1785
asia-east2
$0.2156
asia-northeast1
$0.1977
asia-northeast3
$0.1977
asia-south1
$0.1851
asia-southeast1
$0.1901
australia-southeast1
$0.2187
e2-standard-8approximations:
asia-east1
$0.3569
asia-east2
$0.4313
asia-northeast1
$0.3954
asia-northeast3
$0.3954
asia-south1
$0.3702
asia-southeast1
$0.3802
australia-southeast1
$0.4373
e2-standard-16approximations:
asia-east1
$0.7138
asia-east2
$0.8626
asia-northeast1
$0.7909
asia-northeast3
$0.7909
asia-south1
$0.7405
asia-southeast1
$0.7605
australia-southeast1
$0.8747
e2-standard-32approximations:
asia-east1
$1.4276
asia-east2
$1.7251
asia-northeast1
$1.5817
asia-northeast3
$1.5817
asia-south1
$1.4809
asia-southeast1
$1.5209
australia-southeast1
$1.7494
e2-highmem-2approximations:
asia-east1
$0.1204
asia-east2
$0.1454
asia-northeast1
$0.1333
asia-northeast3
$0.1333
asia-south1
$0.1249
asia-southeast1
$0.1282
australia-southeast1
$0.1475
e2-highmem-4approximations:
asia-east1
$0.2407
asia-east2
$0.2909
asia-northeast1
$0.2665
asia-northeast3
$0.2665
asia-south1
$0.2497
asia-southeast1
$0.2564
australia-southeast1
$0.295
e2-highmem-8approximations:
asia-east1
$0.4815
asia-east2
$0.5818
asia-northeast1
$0.533
asia-northeast3
$0.533
asia-south1
$0.4994
asia-southeast1
$0.5129
australia-southeast1
$0.59
e2-highmem-16approximations:
asia-east1
$0.963
asia-east2
$1.1636
asia-northeast1
$1.0661
asia-northeast3
$1.0661
asia-south1
$0.9989
asia-southeast1
$1.0258
australia-southeast1
$1.1799
e2-highcpu-2approximations:
asia-east1
$0.0659
asia-east2
$0.0796
asia-northeast1
$0.0731
asia-northeast3
$0.0731
asia-south1
$0.0683
asia-southeast1
$0.0702
australia-southeast1
$0.0807
e2-highcpu-4approximations:
asia-east1
$0.1317
asia-east2
$0.1592
asia-northeast1
$0.1461
asia-northeast3
$0.1461
asia-south1
$0.1367
asia-southeast1
$0.1404
australia-southeast1
$0.1614
e2-highcpu-8approximations:
asia-east1
$0.2635
asia-east2
$0.3184
asia-northeast1
$0.2922
asia-northeast3
$0.2922
asia-south1
$0.2733
asia-southeast1
$0.2807
australia-southeast1
$0.3229
e2-highcpu-16approximations:
asia-east1
$0.527
asia-east2
$0.6368
asia-northeast1
$0.5845
asia-northeast3
$0.5845
asia-south1
$0.5467
asia-southeast1
$0.5615
australia-southeast1
$0.6458
e2-highcpu-32approximations:
asia-east1
$1.0539
asia-east2
$1.2736
asia-northeast1
$1.169
asia-northeast3
$1.169
asia-south1
$1.0933
asia-southeast1
$1.1229
australia-southeast1
$1.2916
N1 Series
n1-standard-2approximations:
asia-northeast1
$0.1402
asia-southeast1
$0.1348
australia-southeast1
$0.155
Other Asia Pacific regions
$0.1265
n1-standard-4approximations:
asia-northeast1
$0.2803
asia-southeast1
$0.2695
australia-southeast1
$0.31
Other Asia Pacific regions
$0.2531
n1-standard-8approximations:
asia-northeast1
$0.5606
asia-southeast1
$0.5391
australia-southeast1
$0.6201
Other Asia Pacific regions
$0.5061
n1-standard-16approximations:
asia-northeast1
$1.1213
asia-southeast1
$1.0782
australia-southeast1
$1.2401
Other Asia Pacific regions
$1.0123
n1-standard-32approximations:
asia-northeast1
$2.2426
asia-southeast1
$2.1564
australia-southeast1
$2.4802
Other Asia Pacific regions
$2.0245
n1-highmem-2approximations:
asia-northeast1
$0.1744
asia-southeast1
$0.1678
australia-southeast1
$0.193
Other Asia Pacific regions
$0.1575
n1-highmem-4approximations:
asia-northeast1
$0.3489
asia-southeast1
$0.3357
australia-southeast1
$0.3861
Other Asia Pacific regions
$0.3151
n1-highmem-8approximations:
asia-northeast1
$0.6977
asia-southeast1
$0.6713
australia-southeast1
$0.7721
Other Asia Pacific regions
$0.6302
n1-highmem-16approximations:
asia-northeast1
$1.3955
asia-southeast1
$1.3426
australia-southeast1
$1.5443
Other Asia Pacific regions
$1.2603
n1-highcpu-2approximations:
asia-northeast1
$0.1046
asia-southeast1
$0.1005
australia-southeast1
$0.1156
Other Asia Pacific regions
$0.0944
n1-highcpu-4approximations:
asia-northeast1
$0.2093
asia-southeast1
$0.201
australia-southeast1
$0.2312
Other Asia Pacific regions
$0.1888
n1-highcpu-8approximations:
asia-northeast1
$0.4186
asia-southeast1
$0.4021
australia-southeast1
$0.4624
Other Asia Pacific regions
$0.3776
n1-highcpu-16approximations:
asia-northeast1
$0.8371
asia-southeast1
$0.8041
australia-southeast1
$0.9249
Other Asia Pacific regions
$0.7552
n1-highcpu-32approximations:
asia-northeast1
$1.6742
asia-southeast1
$1.6082
australia-southeast1
$1.8498
Other Asia Pacific regions
$1.5104
N2 Series
n2-standard-2approximations:
asia_east1
$0.1293
asia_east2
$0.1563
asia_northeast1
$0.1433
asia_northeast3
$0.1433
asia_south1
$0.1341
asia_southeast1
$0.1378
asia_southeast2
$0.1502
australia_southeast1
$0.1585
n2-standard-4approximations:
asia_east1
$0.2586
asia_east2
$0.3125
asia_northeast1
$0.2866
asia_northeast3
$0.2866
asia_south1
$0.2683
asia_southeast1
$0.2756
asia_southeast2
$0.3003
australia_southeast1
$0.3169
n2-standard-8approximations:
asia_east1
$0.5173
asia_east2
$0.6251
asia_northeast1
$0.5731
asia_northeast3
$0.5731
asia_south1
$0.5366
asia_southeast1
$0.5511
asia_southeast2
$0.6007
australia_southeast1
$0.6339
n2-standard-16approximations:
asia_east1
$1.0346
asia_east2
$1.2502
asia_northeast1
$1.1462
asia_northeast3
$1.1462
asia_south1
$1.0731
asia_southeast1
$1.1022
asia_southeast2
$1.2014
australia_southeast1
$1.2678
n2-standard-32approximations:
asia_east1
$2.0691
asia_east2
$2.5003
asia_northeast1
$2.2924
asia_northeast3
$2.2924
asia_south1
$2.1462
asia_southeast1
$2.2044
asia_southeast2
$2.4028
australia_southeast1
$2.5355
n2-highmem-2approximations:
asia_east1
$0.1745
asia_east2
$0.2108
asia_northeast1
$0.1931
asia_northeast3
$0.1931
asia_south1
$0.181
asia_southeast1
$0.1859
asia_southeast2
$0.2026
australia_southeast1
$0.2138
n2-highmem-4approximations:
asia_east1
$0.3489
asia_east2
$0.4216
asia_northeast1
$0.3863
asia_northeast3
$0.3863
asia_south1
$0.3619
asia_southeast1
$0.3717
asia_southeast2
$0.4052
australia_southeast1
$0.4275
n2-highmem-8approximations:
asia_east1
$0.6978
asia_east2
$0.8432
asia_northeast1
$0.7725
asia_northeast3
$0.7725
asia_south1
$0.7238
asia_southeast1
$0.7434
asia_southeast2
$0.8103
australia_southeast1
$0.8551
n2-highmem-16approximations:
asia_east1
$1.3956
asia_east2
$1.6865
asia_northeast1
$1.545
asia_northeast3
$1.545
asia_south1
$1.4476
asia_southeast1
$1.4868
asia_southeast2
$1.6206
australia_southeast1
$1.7102
n2-highcpu-2approximations:
asia_east1
$0.0955
asia_east2
$0.1154
asia_northeast1
$0.1059
asia_northeast3
$0.1059
asia_south1
$0.099
asia_southeast1
$0.1017
asia_southeast2
$0.1109
australia_southeast1
$0.117
n2-highcpu-4approximations:
asia_east1
$0.1909
asia_east2
$0.2307
asia_northeast1
$0.2118
asia_northeast3
$0.2118
asia_south1
$0.1981
asia_southeast1
$0.2034
asia_southeast2
$0.2217
australia_southeast1
$0.234
n2-highcpu-8approximations:
asia_east1
$0.3819
asia_east2
$0.4615
asia_northeast1
$0.4235
asia_northeast3
$0.4235
asia_south1
$0.3961
asia_southeast1
$0.4069
asia_southeast2
$0.4435
australia_southeast1
$0.468
n2-highcpu-16approximations:
asia_east1
$0.7637
asia_east2
$0.9229
asia_northeast1
$0.8471
asia_northeast3
$0.8471
asia_south1
$0.7923
asia_southeast1
$0.8137
asia_southeast2
$0.887
australia_southeast1
$0.936
n2-highcpu-32approximations:
asia_east1
$1.5275
asia_east2
$1.8458
asia_northeast1
$1.6942
asia_northeast3
$1.6942
asia_south1
$1.5845
asia_southeast1
$1.6275
asia_southeast2
$1.7739
australia_southeast1
$1.8719
N2D Series
n2d-standard-2approximations:
asia_east1
$0.1125
asia_east2
$0.136
asia_northeast1
$0.1247
asia_south1
$0.0641
asia_southeast1
$0.1199
australia_southeast1
$0.1379
n2d-standard-4approximations:
asia_east1
$0.225
asia_east2
$0.2719
asia_northeast1
$0.2493
asia_south1
$0.1283
asia_southeast1
$0.2397
australia_southeast1
$0.2757
n2d-standard-8approximations:
asia_east1
$0.45
asia_east2
$0.5438
asia_northeast1
$0.4986
asia_south1
$0.2565
asia_southeast1
$0.4795
australia_southeast1
$0.5515
n2d-standard-16approximations:
asia_east1
$0.9001
asia_east2
$1.0876
asia_northeast1
$0.9972
asia_south1
$0.513
asia_southeast1
$0.959
australia_southeast1
$1.103
n2d-standard-32approximations:
asia_east1
$1.8001
asia_east2
$2.1752
asia_northeast1
$1.9945
asia_south1
$1.0261
asia_southeast1
$1.9179
australia_southeast1
$2.206
n2d-highmem-2approximations:
asia_east1
$0.1518
asia_east2
$0.1834
asia_northeast1
$0.168
asia_south1
$0.0865
asia_southeast1
$0.1617
australia_southeast1
$0.186
n2d-highmem-4approximations:
asia_east1
$0.3035
asia_east2
$0.3668
asia_northeast1
$0.3361
asia_south1
$0.173
asia_southeast1
$0.3234
australia_southeast1
$0.372
n2d-highmem-8approximations:
asia_east1
$0.6071
asia_east2
$0.7336
asia_northeast1
$0.6721
asia_south1
$0.346
asia_southeast1
$0.6468
australia_southeast1
$0.744
n2d-highmem-16approximations:
asia_east1
$1.2142
asia_east2
$1.4672
asia_northeast1
$1.3443
asia_south1
$0.6921
asia_southeast1
$1.2936
australia_southeast1
$1.4879
n2d-highcpu-2approximations:
asia_east1
$0.0831
asia_east2
$0.1004
asia_northeast1
$0.0921
asia_south1
$0.0473
asia_southeast1
$0.0885
australia_southeast1
$0.1018
n2d-highcpu-4approximations:
asia_east1
$0.1661
asia_east2
$0.2007
asia_northeast1
$0.1842
asia_south1
$0.0947
asia_southeast1
$0.177
australia_southeast1
$0.2036
n2d-highcpu-8approximations:
asia_east1
$0.3322
asia_east2
$0.4015
asia_northeast1
$0.3685
asia_south1
$0.1894
asia_southeast1
$0.354
australia_southeast1
$0.4071
n2d-highcpu-16approximations:
asia_east1
$0.6645
asia_east2
$0.8029
asia_northeast1
$0.737
asia_south1
$0.3787
asia_southeast1
$0.708
australia_southeast1
$0.8143
n2d-highcpu-32approximations:
asia_east1
$1.3289
asia_east2
$1.6059
asia_northeast1
$1.4739
asia_south1
$0.7575
asia_southeast1
$1.4159
australia_southeast1
$1.6286
C2 Series
c2-standard-4approximations:
asia_east1
$0.278
asia_east2
$0.336
asia_northeast1
$0.308
asia_northeast3
$0.308
asia_south1
$0.2884
asia_southeast1
$0.2962
australia_southeast1
$0.3407
c2-standard-8approximations:
asia_east1
$0.5561
asia_east2
$0.672
asia_northeast1
$0.6161
asia_northeast3
$0.6161
asia_south1
$0.5768
asia_southeast1
$0.5924
australia_southeast1
$0.6814
c2-standard-16approximations:
asia_east1
$1.1122
asia_east2
$1.3439
asia_northeast1
$1.2321
asia_northeast3
$1.2321
asia_south1
$1.1536
asia_southeast1
$1.1849
australia_southeast1
$1.3629
c2-standard-30approximations:
asia_east1
$2.0853
asia_east2
$2.5199
asia_northeast1
$2.3103
asia_northeast3
$2.3103
asia_south1
$2.1631
asia_southeast1
$2.2217
australia_southeast1
$2.5553
c2-standard-60approximations:
asia_east1
$4.1706
asia_east2
$5.0397
asia_northeast1
$4.6205
asia_northeast3
$4.6205
asia_south1
$4.3262
asia_southeast1
$4.4433
australia_southeast1
$5.1107
C2D Series
c2d-standard-2approximations:
asia_east1
$0.1209
asia_south1
$0.0689
asia_southeast1
$0.1288
c2d-standard-4approximations:
asia_east1
$0.2418
asia_south1
$0.1378
asia_southeast1
$0.2576
c2d-standard-8approximations:
asia_east1
$0.4836
asia_south1
$0.2757
asia_southeast1
$0.5153
c2d-standard-16approximations:
asia_east1
$0.9672
asia_south1
$0.5513
asia_southeast1
$1.0305
c2d-standard-32approximations:
asia_east1
$1.9345
asia_south1
$1.1027
asia_southeast1
$2.0611
c2d-standard-56approximations:
asia_east1
$3.3853
asia_south1
$1.9297
asia_southeast1
$3.6069
c2d-standard-112approximations:
asia_east1
$6.7706
asia_south1
$3.8593
asia_southeast1
$7.2137
c2d-highmem-2approximations:
asia_east1
$0.1631
asia_south1
$0.093
asia_southeast1
$0.1737
c2d-highmem-4approximations:
asia_east1
$0.3262
asia_south1
$0.1859
asia_southeast1
$0.3475
c2d-highmem-8approximations:
asia_east1
$0.6523
asia_south1
$0.3718
asia_southeast1
$0.695
c2d-highmem-16approximations:
asia_east1
$1.3046
asia_south1
$0.7436
asia_southeast1
$1.39
c2d-highmem-32approximations:
asia_east1
$2.6092
asia_south1
$1.4873
asia_southeast1
$2.78
c2d-highmem-56approximations:
asia_east1
$4.5662
asia_south1
$2.6028
asia_southeast1
$4.865
c2d-highmem-112approximations:
asia_east1
$9.1323
asia_south1
$5.2055
asia_southeast1
$9.7299
c2d-highcpu-2approximations:
asia_east1
$0.0998
asia_south1
$0.0569
asia_southeast1
$0.1063
c2d-highcpu-4approximations:
asia_east1
$0.1996
asia_south1
$0.1138
asia_southeast1
$0.2127
c2d-highcpu-8approximations:
asia_east1
$0.3993
asia_south1
$0.2276
asia_southeast1
$0.4254
c2d-highcpu-16approximations:
asia_east1
$0.7985
asia_south1
$0.4552
asia_southeast1
$0.8508
c2d-highcpu-32approximations:
asia_east1
$1.5971
asia_south1
$0.9104
asia_southeast1
$1.7016
c2d-highcpu-56approximations:
asia_east1
$2.7949
asia_south1
$1.5931
asia_southeast1
$2.9778
c2d-highcpu-112approximations:
asia_east1
$5.5898
asia_south1
$3.1862
asia_southeast1
$5.9556
C3 Series
c3-highcpu-4approximations:
asia_southeast1
$0.2445
c3-highcpu-8approximations:
asia_southeast1
$0.489
c3-highcpu-22approximations:
asia_southeast1
$1.3449
c3-highcpu-44approximations:
asia_southeast1
$2.6897
c3-highcpu-88approximations:
asia_southeast1
$5.3794
c3-highcpu-176approximations:
asia_southeast1
$10.7589
A2 Series*
a2-highgpu-1gapproximations:
asia-northeast1
$4.6575
asia-northeast3
$4.6575
asia-southeast1
$4.6163
a2-highgpu-2gapproximations:
asia-northeast1
$9.3151
asia-northeast3
$9.3151
asia-southeast1
$9.2327
a2-highgpu-4gapproximations:
asia-northeast1
$18.6301
asia-northeast3
$18.6301
asia-southeast1
$18.4653
a2-highgpu-8gapproximations:
asia-northeast1
$37.2603
asia-northeast3
$37.2603
asia-southeast1
$36.9306
a2-megagpu-16gapproximations:
asia-northeast1
$70.0363
asia-northeast3
$70.0363
asia-southeast1
$69.5557
a2-ultragpu-1gapproximations:
asia-southeast1
$7.1328
a2-ultragpu-2gapproximations:
asia-southeast1
$14.2657
a2-ultragpu-4gapproximations:
asia-southeast1
$28.5314
a2-ultragpu-8gapproximations:
asia-southeast1
$57.0628
A3 Series*
a3-highgpu-2gapproximations:
asia-southeast1
$32.6472
a3-highgpu-4gapproximations:
asia-southeast1
$65.2945
a3-highgpu-8gapproximations:
asia-southeast1
$130.589
*When consuming from a reservation or spot capacity, billing is spread
across two SKUs: the GCE SKU with the label 'vertex-ai-online-prediction' and
the Vertex AI Management Fee SKU. This enables you to use your Committed Use
Discounts (CUDs) in Vertex AI.
Pricing for the Middle East
N2 Series
n2-standard-2approximations:
me_west1
$0.1229
n2-standard-4approximations:
me_west1
$0.2457
n2-standard-8approximations:
me_west1
$0.4914
n2-standard-16approximations:
me_west1
$0.9828
n2-standard-32approximations:
me_west1
$1.9657
n2-highmem-2approximations:
me_west1
$0.1657
n2-highmem-4approximations:
me_west1
$0.3315
n2-highmem-8approximations:
me_west1
$0.6629
n2-highmem-16approximations:
me_west1
$1.3259
n2-highcpu-2approximations:
me_west1
$0.0907
n2-highcpu-4approximations:
me_west1
$0.1814
n2-highcpu-8approximations:
me_west1
$0.3628
n2-highcpu-16approximations:
me_west1
$0.7256
n2-highcpu-32approximations:
me_west1
$1.4511
N2D Series
n2d-standard-2approximations:
me_west1
$0.1069
n2d-standard-4approximations:
me_west1
$0.2138
n2d-standard-8approximations:
me_west1
$0.4275
n2d-standard-16approximations:
me_west1
$0.8551
n2d-standard-32approximations:
me_west1
$1.7101
n2d-highmem-2approximations:
me_west1
$0.1442
n2d-highmem-4approximations:
me_west1
$0.2884
n2d-highmem-8approximations:
me_west1
$0.5767
n2d-highmem-16approximations:
me_west1
$1.1535
n2d-highcpu-2approximations:
me_west1
$0.0789
n2d-highcpu-4approximations:
me_west1
$0.1578
n2d-highcpu-8approximations:
me_west1
$0.3156
n2d-highcpu-16approximations:
me_west1
$0.6312
n2d-highcpu-32approximations:
me_west1
$1.2625
Each machine type
is charged as the following SKUs on your Google Cloud bill:
vCPU cost: measured in vCPU hours
RAM cost: measured in GB hours
GPU cost: if either built into the machine or optionally configured,
measured in GPU hours
The prices for machine types are used to approximate the
total hourly cost for each prediction node of a model version using that machine
type.
For example, a machine type of n1-highcpu-32 includes 32 vCPUs and 32 GB of RAM.
Therefore, the hourly pricing equals 32 vCPU hours + 32 GB hours.
The SKU pricing table is available by region. Each table shows the vCPU, RAM,
and built-in GPU pricing for prediction machine types, which more precisely
reflect the SKUs charged.
To view the SKU pricing per region, choose a region to view its pricing table:
*When consuming from a reservation or spot capacity, billing is spread
across two SKUs: the GCE SKU with the label 'vertex-ai-online-prediction' and
the Vertex AI Management Fee SKU. This enables you to use your Committed Use
Discounts (CUDs) in Vertex AI.
SKU pricing for Europe
E2 Series
vCPU
Location
Price per hour
Belgium (europe-west1)
$0.0275919 per vCPU hour
London (europe-west2)
$0.0323184 per vCPU hour
Frankfurt (europe-west3)
$0.0323184 per vCPU hour
Netherlands (europe-west4)
$0.0276149 per vCPU hour
Zurich (europe-west6)
$0.0350968 per vCPU hour
Paris (europe-west9)
$0.0351164 per vCPU hour
RAM
Location
Price per hour
Belgium (europe-west1)
$0.0036984 per GB hour
London (europe-west2)
$0.0043309 per GB hour
Frankfurt (europe-west3)
$0.0043309 per GB hour
Netherlands (europe-west4)
$0.0037007 per GB hour
Zurich (europe-west6)
$0.0047035 per GB hour
Paris (europe-west9)
$0.0047069 per GB hour
N1 Series
vCPU
Location
Price per hour
London (europe-west2)
$0.0468395 per vCPU hour
Other Europe regions
$0.0421268 per vCPU hour
RAM
Location
Price per hour
London (europe-west2)
$0.0062767 per GB hour
Other Europe regions
$0.0056373 per GB hour
N2 Series
vCPU
Location
Price per hour
Warsaw (europe_central2)
$0.0468395 per vCPU hour
Belgium (europe_west1)
$0.0399889 per vCPU hour
London (europe_west2)
$0.0468395 per vCPU hour
Frankfurt (europe_west3)
$0.0468395 per vCPU hour
Netherlands (europe_west4)
$0.0399879 per vCPU hour
Zurich (europe_west6)
$0.050899 per vCPU hour
Paris (europe_west9)
$0.0421693 per vCPU hour
RAM
Location
Price per hour
Warsaw (europe_central2)
$0.0062767 per GB hour
Belgium (europe_west1)
$0.0053602 per GB hour
London (europe_west2)
$0.0062767 per GB hour
Frankfurt (europe_west3)
$0.0062767 per GB hour
Netherlands (europe_west4)
$0.0053598 per GB hour
Zurich (europe_west6)
$0.0068195 per GB hour
Paris (europe_west9)
$0.0056522 per GB hour
N2D Series
vCPU
Location
Price per hour
Belgium (europe_west1)
$0.0347909 per vCPU hour
London (europe_west2)
$0.0407502 per vCPU hour
Frankfurt (europe_west3)
$0.0407502 per vCPU hour
Netherlands (europe_west4)
$0.0348197 per vCPU hour
Paris (europe_west9)
$0.0366873 per vCPU hour
RAM
Location
Price per hour
Belgium (europe_west1)
$0.0046632 per GB hour
London (europe_west2)
$0.0054602 per GB hour
Frankfurt (europe_west3)
$0.0054602 per GB hour
Netherlands (europe_west4)
$0.0046667 per GB hour
Paris (europe_west9)
$0.0049174 per GB hour
C2 Series
vCPU
Location
Price per hour
Belgium (europe_west1)
$0.042987 per vCPU hour
London (europe_west2)
$0.0503527 per vCPU hour
Frankfurt (europe_west3)
$0.050347 per vCPU hour
Netherlands (europe_west4)
$0.0430215 per vCPU hour
Zurich (europe_west6)
$0.0547055 per vCPU hour
RAM
Location
Price per hour
Belgium (europe_west1)
$0.0057615 per GB hour
London (europe_west2)
$0.006747 per GB hour
Frankfurt (europe_west3)
$0.006739 per GB hour
Netherlands (europe_west4)
$0.0057615 per GB hour
Zurich (europe_west6)
$0.007337 per GB hour
C2D Series
vCPU
Location
Price per hour
London (europe_west2)
$0.0438012 per vCPU hour
Netherlands (europe_west4)
$0.0374336 per vCPU hour
RAM
Location
Price per hour
London (europe_west2)
$0.005865 per GB hour
Netherlands (europe_west4)
$0.0050128 per GB hour
C3 Series
vCPU
Location
Price per hour
London (europe_west1)
$0.04299 per vCPU hour
Netherlands (europe_west4)
$0.04302 per vCPU hour
RAM
Location
Price per hour
London (europe_west1)
$0.00576 per GB hour
Netherlands (europe_west4)
$0.00577 per GB hour
A2 Series*
vCPU
Location
Price per hour
Netherlands (europe-west4)
$0.0400223 per vCPU hour
RAM
Location
Price per hour
Netherlands (europe-west4)
$0.0053636 per GB hour
GPU
Location
Price per hour
Netherlands (europe-west4)
$3.3741 per GPU hour (A100 40GB)
Netherlands (europe-west4)
$4.97399 per GPU hour (A100 80GB)
G2 Series*
vCPU
Location
Price per hour
Netherlands (europe-west4)
$0.03164 per vCPU hour
RAM
Location
Price per hour
Netherlands (europe-west4)
$0.00371 per GB hour
GPU
Location
Price per hour
Netherlands (europe-west4)
$0.70916 per GPU hour
*When consuming from a reservation or spot capacity, billing is spread
across two SKUs: the GCE SKU with the label 'vertex-ai-online-prediction' and
the Vertex AI Management Fee SKU. This enables you to use your Committed Use
Discounts (CUDs) in Vertex AI.
SKU pricing for Asia Pacific
E2 Series
E2 prediction machine type SKUs
vCPU
Location
Price per hour
Taiwan (asia-east1)
$0.0290432 per vCPU hour
Hong Kong (asia-east2)
$0.0350968 per vCPU hour
Tokyo (asia-northeast1)
$0.0322299 per vCPU hour
Seoul (asia-northeast3)
$0.0322299 per vCPU hour
Mumbai (asia-south1)
$0.0301288 per vCPU hour
Singapore (asia-southeast1)
$0.0309453 per vCPU hour
Sydney (australia-southeast1)
$0.0355925 per vCPU hour
RAM
Location
Price per hour
Taiwan (asia-east1)
$0.0038927 per GB hour
Hong Kong (asia-east2)
$0.0047035 per GB hour
Tokyo (asia-northeast1)
$0.0042999 per GB hour
Seoul (asia-northeast3)
$0.0042999 per GB hour
Mumbai (asia-south1)
$0.0040376 per GB hour
Singapore (asia-southeast1)
$0.0041458 per GB hour
Sydney (australia-southeast1)
$0.004769 per GB hour
N1 Series
N1 Prediction machine type SKUs
vCPU
Location
Price per hour
Tokyo (asia-northeast1)
$0.0467107 per vCPU hour
Singapore (asia-southeast1)
$0.04484885 per vCPU hour
Sydney (australia-southeast1)
$0.0515844 per vCPU hour
Other Asia Pacific regions
$0.0421268 per vCPU hour
RAM
Location
Price per hour
Tokyo (asia-northeast1)
$0.00623185 per GB hour
Singapore (asia-southeast1)
$0.0060099 per GB hour
Sydney (australia-southeast1)
$0.00691265 per GB hour
Other Asia Pacific regions
$0.0056373 per GB hour
N2 Series
vCPU
Location
Price per hour
Taiwan (asia_east1)
$0.0420923 per vCPU hour
Hong Kong (asia_east2)
$0.0508656 per vCPU hour
Tokyo (asia_northeast1)
$0.0467107 per vCPU hour
Seoul (asia_northeast3)
$0.0467107 per vCPU hour
Mumbai (asia_south1)
$0.0436655 per vCPU hour
Singapore (asia_southeast1)
$0.0448488 per vCPU hour
Jakarta (asia_southeast2)
$0.0488853 per vCPU hour
Sydney (australia_southeast1)
$0.0515844 per vCPU hour
RAM
Location
Price per hour
Taiwan (asia_east1)
$0.0056419 per GB hour
Hong Kong (asia_east2)
$0.0068172 per GB hour
Tokyo (asia_northeast1)
$0.0062318 per GB hour
Seoul (asia_northeast3)
$0.0062318 per GB hour
Mumbai (asia_south1)
$0.0058512 per GB hour
Singapore (asia_southeast1)
$0.0060099 per GB hour
Jakarta (asia_southeast2)
$0.0065504 per GB hour
Sydney (australia_southeast1)
$0.0069126 per GB hour
N2D Series
vCPU
Location
Price per hour
Taiwan (asia_east1)
$0.0366206 per vCPU hour
Hong Kong (asia_east2)
$0.0442531 per vCPU hour
Tokyo (asia_northeast1)
$0.0406387 per vCPU hour
Mumbai (asia_south1)
$0.0208725 per vCPU hour
Singapore (asia_southeast1)
$0.0390184 per vCPU hour
Sydney (australia_southeast1)
$0.0448787 per vCPU hour
RAM
Location
Price per hour
Taiwan (asia_east1)
$0.0049082 per GB hour
Hong Kong (asia_east2)
$0.0059305 per GB hour
Tokyo (asia_northeast1)
$0.0054222 per GB hour
Mumbai (asia_south1)
$0.0027979 per GB hour
Singapore (asia_southeast1)
$0.005229 per GB hour
Sydney (australia_southeast1)
$0.0060145 per GB hour
C2 Series
vCPU
Location
Price per hour
Taiwan (asia_east1)
$0.045249 per vCPU hour
Hong Kong (asia_east2)
$0.0546802 per vCPU hour
Tokyo (asia_northeast1)
$0.0502136 per vCPU hour
Seoul (asia_northeast3)
$0.0502136 per vCPU hour
Mumbai (asia_south1)
$0.0469407 per vCPU hour
Singapore (asia_southeast1)
$0.0482126 per vCPU hour
Sydney (australia_southeast1)
$0.055453 per vCPU hour
RAM
Location
Price per hour
Taiwan (asia_east1)
$0.0060651 per GB hour
Hong Kong (asia_east2)
$0.0073289 per GB hour
Tokyo (asia_northeast1)
$0.0066987 per GB hour
Seoul (asia_northeast3)
$0.0066987 per GB hour
Mumbai (asia_south1)
$0.0062905 per GB hour
Singapore (asia_southeast1)
$0.0064607 per GB hour
Sydney (australia_southeast1)
$0.0074313 per GB hour
C2D Series
vCPU
Location
Price per hour
Taiwan (asia_east1)
$0.0393656 per vCPU hour
Singapore (asia_southeast1)
$0.0419417 per vCPU hour
RAM
Location
Price per hour
Taiwan (asia_east1)
$0.0052716 per GB hour
Singapore (asia_southeast1)
$0.0056166 per GB hour
C3 Series
vCPU
Location
Price per hour
Singapore (asia_southeast1)
$0.04821 per vCPU hour
RAM
Location
Price per hour
Singapore (asia_southeast1)
$0.00646 per GB hour
A2 Series*
A2 Prediction machine type SKUs
vCPU
Location
Price per hour
Tokyo (asia-northeast1)
$0.0467107 per vCPU hour
Seoul (asia-northeast3)
$0.0467107 per vCPU hour
Singapore (asia-southeast1)
$0.0448488 per vCPU hour
RAM
Location
Price per hour
Tokyo (asia-northeast1)
$0.00623185 per GB hour
Seoul (asia-northeast3)
$0.0062318 per GB hour
Singapore (asia-southeast1)
$0.0060099 per GB hour
GPU
Location
Price per hour
Tokyo (asia-northeast1)
$3.5673 per GPU hour (A100 40GB)
Seoul (asia-northeast3)
$3.5673 per GPU hour (A100 40GB)
Singapore (asia-southeast1)
$3.5673 per GPU hour (A100 40GB)
Singapore (asia-southeast1)
$5.57298 per GPU hour (A100 80GB)
*When consuming from a reservation or spot capacity, billing is spread
across two SKUs: the GCE SKU with the label 'vertex-ai-online-prediction' and
the Vertex AI Management Fee SKU. This enables you to use your Committed Use
Discounts (CUDs) in Vertex AI.
SKU pricing for Middle East
N2 Series
vCPU
Location
Price per hour
Tel Aviv (me_west1)
$0.0399879 per vCPU hour
RAM
Location
Price per hour
Tel Aviv (me_west1)
$0.0053598 per GB hour
N2D Series
vCPU
Location
Price per hour
Tel Aviv (me_west1)
$0.03479 per vCPU hour
RAM
Location
Price per hour
Tel Aviv (me_west1)
$0.0046628 per GB hour
Some machine types allow you to add optional
GPU
accelerators for prediction. Optional GPUs incur an additional charge,
separate from those described in the previous table. View each pricing table,
which describes the pricing for each type of optional GPU.
The Americas
Accelerators - price per hour
NVIDIA_TESLA_P4
Iowa (us-central1)
$0.6900
N. Virginia (us-east4)
$0.6900
Montréal (northamerica-northeast1)
$0.7475
NVIDIA_TESLA_P100
Oregon (us-west1)
$1.6790
Iowa (us-central1)
$1.6790
South Carolina (us-east1)
$1.6790
NVIDIA_TESLA_T4
Oregon (us-west1)
$0.4025
Iowa (us-central1)
$0.4025
South Carolina (us-east1)
$0.4025
NVIDIA_TESLA_V100
Oregon (us-west1)
$2.8520
Iowa (us-central1)
$2.8520
Europe
Accelerators - price per hour
NVIDIA_TESLA_P4
Netherlands (europe-west4)
$0.7475
NVIDIA_TESLA_P100
Belgium (europe-west1)
$1.8400
NVIDIA_TESLA_T4
London (europe-west2)
$0.4715
Netherlands (europe-west4)
$0.4370
NVIDIA_TESLA_V100
Netherlands (europe-west4)
$2.9325
Asia Pacific
Accelerators - price per hour
NVIDIA_TESLA_P4
Singapore (asia-southeast1)
$0.7475
Sydney (australia-southeast1)
$0.7475
NVIDIA_TESLA_P100
Taiwan (asia-east1)
$1.8400
NVIDIA_TESLA_T4
Tokyo (asia-northeast1)
$0.4255
Singapore (asia-southeast1)
$0.4255
Seoul (asia-northeast3)
$0.4485
NVIDIA_TESLA_V100
Taiwan (asia-east1)
$2.932
Pricing is per GPU. If you use multiple GPUs per prediction node (or if your
version scales to use multiple nodes),the costs scale accordingly.
AI Platform Prediction serves predictions from your model by running a number of
virtual machines ("nodes"). By default, Vertex AI automatically
scales the number of nodes running at any time. For online prediction, the
number of nodes scales to meet demand. Each node can respond to multiple
prediction requests. For batch prediction, the number of nodes scales to reduce
the total time it takes to run a job. You can customize how prediction nodes
scale.
You are charged for the time that each node runs for your model, including:
When the node is processing a batch prediction job.
When the node is processing an online prediction request.
When the node is in a ready state for serving online predictions.
The cost of one node running for one hour is a node hour. The table of
prediction prices describes the
price of a node hour, which varies across regions and between online prediction
and batch prediction.
You can consume node hours in fractional increments. For example, one node
running for 30 minutes costs 0.5 node hours.
Cost calculations for Compute Engine (N1) machine types
The running time of a node is billed in 30-second increments. This means that
every 30 seconds, your project is billed for 30 seconds worth of whatever
vCPU, RAM, and GPU resources that your node is using at that moment.
More about automatic scaling of prediction nodes
Online prediction
Batch prediction
The priority of the scaling is to reduce the latency of individual
requests. The service keeps your model in a ready state for a few idle
minutes after servicing a request.
The priority of the scaling is to reduce the total elapsed time of
the job.
Scaling affects your total charges each month: the more numerous and
frequent your requests, the more nodes will be used.
Scaling should have little effect on the price of your job, though
there is some overhead involved in bringing up a new node.
You can choose to let the service scale in response to traffic
(automatic scaling) or you can specify a number of nodes to run
constantly to avoid latency (manual scaling).
If you choose automatic scaling, the number of nodes scales
automatically. For AI Platform Prediction legacy (MLS1) machine type
deployments, the number of nodes can scale down to zero for
no-traffic durations. Vertex AI deployments and
other types of AI Platform Prediction deployments cannot
scale down to zero nodes.
If you choose manual scaling, you specify a number of nodes to
keep running all the time. You are charged for all of the time
that these nodes are running, starting at the time of deployment
and persisting until you delete the model version.
You can affect scaling by setting a maximum number of nodes to use for
a batch prediction job, and by setting the number of nodes to keep
running for a model when you deploy it.
Batch prediction jobs are charged after job completion
Batch prediction jobs are charged after job completion, not incrementally during
the job. Any Cloud Billing budget alerts that you have configured aren't
triggered while a job is running. Before starting a large job, consider
running some cost benchmark jobs with small input data first.
Example of a prediction calculation
A real-estate company in an Americas region runs a weekly prediction of
housing values in areas it serves. In one month, it runs predictions for
four weeks in batches of 3920, 4277, 3849, and 3961. Jobs are
limited to one node and each instance takes an average of 0.72
seconds of processing.
First calculate the length of time that each job ran:
This example assumed jobs ran on a single node and took a consistent amount of
time per input instance. In real usage, make sure to account for multiple nodes
and use the actual amount of time each node spends running for your
calculations.
Charges for Vertex Explainable AI
Feature-based explanations
Feature-based explanations
come at no extra charge to prediction prices. However, explanations take longer
to process than normal predictions, so heavy usage of Vertex Explainable AI along with
auto-scaling could result in more nodes being started, which would increase
prediction charges.
When you upload a model or update a model's dataset, you are billed:
per node hour for the batch prediction job that is used to generate the
latent space representations of examples. This is billed at the same
rate as prediction.
a cost for building or updating indexes. This cost is the same as the
indexing costs for Vector Search,
which is number of examples * number of dimensions * 4 bytes per float * $3.00 per GB.
For example, if you have 1 million examples and 1,000 dimension latent
space, the cost is $12 (1,000,000 * 1,000 * 4 * 3.00 / 1,000,000,000).
When you deploy to an endpoint, you are billed per node hour for each node
in your endpoint. All compute associated with the endpoint is charged at
same rate as prediction. However,
because Example-based explanations require additional compute resources to
serve the Vector Search index, this results in more nodes being started
which increases prediction charges.
Vertex AI Neural Architecture Search
The following tables summarize the pricing in each region where
Neural Architecture Search is available.
Prices
The following tables provide the price per hour of various configurations.
You can choose a predefined scale tier or a custom configuration of selected
machine types. If you choose a custom
configuration, sum the costs of the virtual machines you use.
Accelerator-enabled legacy machine types include the cost of the accelerators in
their pricing. If you use Compute Engine machine types and attach
accelerators, the cost of the accelerators is separate. To calculate this cost,
multiply the prices in the following table of accelerators by the number of each type
of accelerator you use.
If you pay in a currency other than USD, the prices listed in your currency on
Cloud Platform SKUs
apply.
Notes:
All use is subject to the Neural Architecture Search quota policy.
You are required to store your data and program files in
Cloud Storage buckets during the Neural Architecture Search lifecycle.
See more about Cloud Storage usage.
The disk price is only charged when you configure the disk
size of each VM to be larger than 100 GB. There is no charge for the first
100 GB (the default disk size) of disk for each VM. For example, if you
configure each VM to have 105 GB of disk, then you are charged for 5 GB of
disk for each VM.
Required use of Cloud Storage
In addition to the costs described in this document, you are required to store
data and program files in Cloud Storage buckets during the
Neural Architecture Search lifecycle. This storage is subject to the
Cloud Storage pricing policy.
Required use of Cloud Storage includes:
Staging your training application package.
Storing your training input data.
Storing the output of your jobs.
Neural Architecture Search doesn't require long-term storage of these items.
You can remove the files as soon as the operation is complete.
Free operations for managing your resources
The resource management operations provided by Neural Architecture Search are
available free of charge. The Neural Architecture Search quota policy does limit
some of these operations.
Resource
Free operations
jobs
get, list, cancel
operations
get, list, cancel, delete
Vertex AI Pipelines
Vertex AI Pipelines charges a run execution fee of $0.03 per
Pipeline Run. You are not charged the execution fee during the Preview release.
You also pay for Google Cloud resources you use with
Vertex AI Pipelines, such as Compute Engine resources consumed by
pipeline components (charged at the same rate as for
Vertex AI training). Finally, you are responsible for the
cost of any services (such as Dataflow) called by your pipeline.
Vertex AI Feature Store
Vertex AI Feature Store is Generally Available (GA) since November 2023. For
information on the previous version of the product go to Vertex AI Feature Store (Legacy).
New Vertex AI Feature Store
The new Vertex AI Feature Store supports functionality across 2 types of operations:
Offline operations are operations to transfer, store,
retrieve and transform data in the offline store (BigQuery)
Online operations are operations to transfer data into the
online store(s) and operations on data while it is in the online store(s).
Offline Operations Pricing
Since BigQuery is used for offline operations, please refer to BigQuery pricing for functionality such as ingestion to the offline store, querying the offline
store, and offline storage.
Online Operations Pricing
For online operations, Vertex AI Feature Store charges for any GA features to
transfer data into the online store, serve data or store data. A node-hour
represents the time a virtual machine spends to complete an operation, charged
to the minute.
Optimized online serving and Bigtable online serving use different architectures, so their nodes are not comparable.
If you pay in a currency other than USD, the prices listed in your currency on
Cloud Platform SKUs
apply.
Online Operations Workload Estimates
Consider the following guidelines when estimating your workloads. The number of
nodes required for a given workload may differ across each serving approach.
Data processing:
Ingestion - One node can ingest approximately a minimum of 100 MB of data per hour into a Bigtable Online Store or an Optimized Online Store if no analytical functions are used.
Bigtable online serving: Each node can support approximately 15,000 QPS and up to 5TB of storage.
Optimized online serving: Performance is based on the machine type and replicas, which are automatically configured to minimize costs subject to the workload. Each node can have
a minimum of 2 and a maximum of 6 replicas for high availability and autoscaling. You're charged for the number of replicas accordingly. For more details, see the example monthly scenarios.
For non embeddings-related workloads, each node can support approximately 500 QPS and up to 200GB of storage.
For embeddings-related workloads, each node can support approximately 500 QPS and up to 4 GB of storage of 512 dimensional data.
You can view the number of nodes (with replicas) in the Metric Explorer:
Example Monthly Scenarios (assuming us-central1)
Data streaming workload - Bigtable online serving with 2.5TB of data
(1GB refreshed daily) and 1200QPS
Prices for Vertex AI Feature Store (Legacy) are based on the amount of feature
data in online and offline storage as well as the availability of online
serving. A node per hour represents the time a virtual machine spends serving
feature data or waiting in a ready state to handle feature data requests.
If you pay in a currency other than USD, the prices listed in your currency on
Cloud Platform SKUs
apply.
When you enable feature value monitoring, billing includes applicable charges above in addition to applicable charges that follow:
$3.50 per GB for all data analyzed. With snapshot analysis enabled, snapshots taken for data in Vertex AI Feature Store (Legacy) are included. With import feature analysis enabled, batches of ingested data are included.
Additional charges for other Vertex AI Feature Store (Legacy) operations used with feature value monitoring include the following:
The snapshot analysis feature periodically takes a snapshot of the feature values based on your configuration for the monitoring interval.
The charge for a snapshot export is the same as a regular batch export operation.
Snapshot Analysis Example
A data scientist enables feature value monitoring for their Vertex AI Feature Store (Legacy) and turns on monitoring for a daily snapshot analysis.
A pipeline runs daily for the entity types monitoring. The pipeline scans 2GB of data in Vertex AI Feature Store (Legacy) and exports a snapshot containing 0.1GB of data.
The total charge for one day's analysis is:
(0.1 GB * $3.50) + (2 GB * $0.005) = $0.36
Ingestion Analysis Example
A data scientist enables feature value monitoring for their Vertex AI Feature Store (Legacy) and turns on monitoring for ingestion operations.
An ingestion operation imports 1GB of data into Vertex AI Feature Store (Legacy).
The total charge for feature value monitoring is:
(1 GB * $3.50) = $3.50
Vertex ML Metadata
Metadata storage is measured in binary gigabytes (GiB), where 1 GiB is
1,073,741,824 bytes. This unit of measurement is also known as a gibibyte.
Vertex ML Metadata charges $10 per gibibyte (GiB) per month for
metadata storage. Prices are pro-rated per megabyte (MB). For example, if
you store 10 MB of metadata, you are charged $0.10 per month for that 10 MB
of metadata.
Prices are the same in all regions where Vertex ML Metadata is
supported.
Vertex AI TensorBoard
To use Vertex AI TensorBoard, request that the IAM administrator of the
project assign you to the role
"Vertex AI TensorBoard Web App User". The Vertex AI Administrator role
also has access.
Beginning in August 2023, Vertex AI TensorBoard pricing changed
from a per-user monthly license of $300/month to $10 GiB/month for data storage
of logs and metrics. This means no more subscription fees. You will only pay for the
storage you’ve used. See the
Vertex AI TensorBoard: Delete Outdated TensorBoard Experiments
tutorial for how to manage storage.
Vertex AI Vizier
Vertex AI Vizier is a black-box optimization service inside Vertex AI.
The Vertex AI Vizier pricing model consists of the following:
The first 100 Vertex AI Vizier trials per calendar month are available at no charge (trials using RANDOM_SEARCH and GRID_SEARCH do not count against this total).
After 100 Vertex AI Vizier trials, subsequent trials during the same calendar month are charged at $1 per trial (trials that use RANDOM_SEARCH or GRID_SEARCH incur no charges).
Vector Search
Pricing for Vector Search Approximate Nearest Neighbor service consists of:
Per node hour pricing for each VM used to host a deployed index.
A cost for building new indexes, updating existing indexes, and using streaming index updates.
Data processed during building and updating indexes is measured in binary
gigabytes (GiB), where 1 GiB is 1,073,741,824 bytes. This unit of measurement
is also known as a gibibyte.
Vector Search charges $3.00 per gibibyte (GiB) of data
processed in all regions. Vector Search charges $0.45/GiB ingested
for Streaming Update inserts.
The following tables summarize the pricing of an index serving in each region where
the Vector Search is available. The price corresponds to the machine type,
by region, and is charged per node hour.
The Americas
Region
e2-standard-2
e2-standard-16
e2-highmem-16
n2d-standard-32
n1-standard-16
n1-standard-32
us_central1
0.094
0.75
1.012
1.893
1.064
2.128
us_east1
0.094
0.75
1.012
1.893
1.064
2.128
us_east4
0.10
0.845
1.14
2.132
1.198
2.397
us_west1
0.094
0.75
1.012
1.893
1.064
2.128
us_west2
0.113
0.901
1.216
2.273
1.279
2.558
us_west3
0.113
0.901
1.216
N/A
1.279
2.558
us_west4
0.106
0.845
1.14
2.132
1.198
2.397
us_south1
0.111
0.886
1.195
N/A
N/A
N/A
northamerica_northeast1
0.103
0.826
1.115
2.084
1.172
2.343
northamerica_northeast2
0.103
0.826
1.115
N/A
N/A
N/A
southamerica_east1
0.149
1.191
1.607
3.004
1.69
3.38
Europe
Region
e2-standard-2
e2-standard-16
e2-highmem-16
n2d-standard-32
n1-standard-16
n1-standard-32
europe_central2
0.121
0.967
1.304
N/A
N/A
N/A
europe_north1
0.103
0.826
1.115
2.084
1.172
2.343
europe_west1
0.103
0.826
1.114
2.082
1.171
2.343
europe_west2
0.121
0.967
1.304
2.438
1.371
2.742
europe_west3
0.121
0.967
1.304
2.438
1.371
2.742
europe_west4
0.103
0.826
1.115
2.084
1.172
2.343
europe_west6
0.131
1.050
1.417
N/A
1.489
2.978
europe_west9
0.131
1.051
1.417
2.195
N/A
N/A
Asia Pacific
Region
e2-standard-2
e2-standard-16
e2-highmem-16
n2d-standard-32
n1-standard-16
n1-standard-32
asia_east1
0.109
0.869
1.172
2.191
1.232
2.464
asia_east2
0.131
1.050
1.417
2.648
1.489
2.978
asia_south1
0.113
0.901
1.216
1.249
1.278
2.556
asia_southeast1
0.116
0.926
1.249
2.335
1.313
2.625
asia_southeast2
0.126
1.009
1.361
N/A
N/A
N/A
asia_northeast1
0.12
0.963
1.298
2.428
1.366
2.733
asia_northeast2
0.12
0.963
1.298
2.428
1.366
2.733
asia_northeast3
0.12
0.963
1.298
N/A
1.367
2.733
australia_southeast1
0.133
1.065
1.436
2.686
1.51
3.02
Middle East
Region
e2-standard-2
e2-standard-16
e2-highmem-16
n2d-standard-32
n1-standard-16
n1-standard-32
me_west1
0.103
0.826
1.114
2.082
N/A
N/A
Vector Search pricing examples
Vector Search pricing is determined by the size of your data, the
amount of queries per second (QPS) you want to run, and the number of nodes you use.
To get your estimated serving cost, you need to calculate your total data size.
Your data size is the number of your embeddings/vectors* the number of dimensions
you have* 4 bytes per dimension. After you have the size of your data you can calculate
the serving cost and the building cost. The serving cost plus the building cost
equals your monthly total cost.
Building cost: data size(in GiB) * $3/GiB * # of updates/month
Streaming update: Vector Search uses heuristics-based metrics to determine when to trigger compaction. If the oldest uncompacted data is five days old, compaction is always triggered. You are billed for the cost of rebuilding the index at the same rate of a batch update, in addition to the streaming update costs.
Number of embeddings/vectors
Number of dimensions
Queries per second (QPS)
Machine Type
Nodes
Estimated monthly serving cost
2 million
128
100
e2-standard-2
1
$68
20 million
256
1,000
e2-standard-16
1
$547
20 million
256
3,000
e2-standard-16
3
$1,642
100 million
256
500
e2-highmem-16
2
$1,477
1 billion
100
500
e2-highmem-16
8
$5,910
All examples are based on machine types in us-central1.
The cost you incur will vary with recall rate and latency requirements. The estimated monthly
serving cost is directly related to the number of nodes used in the console.
To learn more about configuration parameters that affect cost, see
Configuration parameters which affect recall and latency.
If you have high queries per second (QPS), batching these queries can
reduce total costs up to 30%-40%.
Vertex AI Model Registry
The Vertex AI Model Registry is a central repository which tracks and lists your
models and model versions. You can import models into Vertex AI and they appear in
the Vertex AI Model Registry. There is no cost associated with having your models in the
Model Registry. Cost is only incurred when you deploy the model to an endpoint or
perform a batch prediction on the model. This cost is determined by the type of model you are deploying.
To learn more about pricing for deploying custom models from the Vertex AI Model Registry, see
Custom-trained models. To learn more about pricing for
deploying AutoML models, see Pricing for AutoML models.
Vertex AI Model Monitoring
Vertex AI enables you to monitor the continued effectiveness of your
model after you deploy it to production. For more information, see
Introduction to Vertex AI Model Monitoring.
When you use Vertex AI Model Monitoring, you are billed for the following:
$3.50 per GB for all data analyzed, including
the training data provided and prediction data logged in a BigQuery table.
Charges for other Google Cloud products that you use with Model Monitoring, such as BigQuery
storage or Batch Explain when attribution monitoring is enabled.
Vertex AI Model Monitoring is supported in the following regions: us-central1,
europe-west4, asia-east1, and asia-southeast1. Prices are the same for all
regions.
Data sizes are measured after they are converted to TfRecord format.
Training datasets incur a one-time charge when you set up a
Vertex AI Model Monitoring job.
Prediction Datasets consist of logs collected from the Online
Prediction service. As prediction requests arrive during different time windows,
the data for each time window is collected and the sum of the
data analyzed for each prediction window is used to calculate the charge.
Example:
A data scientist runs model monitoring on the prediction traffic belonging to
their model.
The model is trained from a BigQuery dataset. The data size after converting to
TfRecord is 1.5GB.
Prediction data logged between 1:00 - 2:00 p.m. is 0.1 GB,
between 3:00 - 4:00 p.m. is 0.2 GB.
The total price for setting up the model monitoring job is:
In addition to the costs mentioned previously,
you also pay for any Google Cloud resources that you use.
For example:
Data analysis services: You incur BigQuery costs when you
issue SQL queries within a notebook (see
BigQuery pricing).
Customer-managed encryption keys: You incur costs when you use
customer-managed encryption keys. Each time
your managed notebooks or
user-managed notebooks instance
uses a Cloud Key Management Service key, that operation is billed at the rate
of Cloud KMS key operations
(see Cloud Key Management Service pricing).
Deep Learning Containers, Deep Learning VM, and AI Platform Pipelines
For Deep Learning Containers, Deep Learning VM Images,
and AI Platform Pipelines,
pricing is calculated based on the compute and storage resources that you use.
These resources are charged at the same rate you currently
pay for Compute Engine and
Cloud Storage.
In addition to the compute and storage costs,
you also pay for any Google Cloud resources that you use.
For example:
Data analysis services: You incur BigQuery costs when you
issue SQL queries within a notebook (see
BigQuery pricing).
Customer-managed encryption keys: You incur costs when you use
customer-managed encryption keys. Each time
your managed notebooks or
user-managed notebooks instance
uses a Cloud Key Management Service key, that operation is billed at the rate
of Cloud KMS key operations
(see Cloud Key Management Service pricing).
Data labeling
Vertex AI enables you to request human labeling for a collection
of data that you plan to use to train a custom machine learning model.
Prices for the service are computed based on the type of labeling task.
For regular labeling tasks, the prices are determined by the number of
annotation units.
For an image classification task, units are determined the number of
images and the number of human labelers. For example, an image with
3 human labelers counts for 1 * 3 = 3 units. The price for single-label
and multi-label classification are the same.
For an image bounding box task, units are determined by the number of
bounding boxes identified in the images and the number of human labelers.
For example, if an image with 2 bounding boxes and 3 human labelers
counts for 2 * 3 = 6 units. Images without bounding boxes will not be
charged.
For an image segmentation/rotated box/polyline/polygon task, units are
determined in the same way as a image bounding box task.
For a video classification task, units are determined by the video length
(every 5 seconds is a price unit) and the number of human labelers. For
example, a 25 seconds video with 3 human labelers counts for 25 / 5 * 3 =
15 units. The price for single-label and multi-label classification are
the same.
For a video object tracking task, unit are determined by the number of
objects identified in the video and the number of human labelers. For
example, for a video with 2 objects and 3 human labelers, it counts for
2 * 3 = 6 units. Video without objects will not be charged.
For a video action recognition task, units are determined in the same way as a video
object tracking task.
For a text classification task, units are determined by text length
(every 50 words is a price unit) and the number of human labelers. For
example, one piece of text with 100 words and 3 human labelers counts for
100 / 50 * 3 = 6 units. The price for single-label and multi-label
classification is the same.
For a text sentiment task, units are determined in the same way as a text
classification task.
For a text entity extraction task, units are determined by text length
(every 50 words is a price unit), the number of entities identified, and
the number of human labelers. For example, a piece of text with 100 words,
2 entities identified, and 3 human labelers counts for 100 / 50 * 2 * 3 =
12 units. Text without entities will not be charged.
For image/video/text classification and text sentiment tasks, human labelers
may lose track of classes if the label set size is too large. As a result, we
send at most 20 classes to the human labelers at a time. For example, if
the label set size of a labeling task is 40, each data item will be sent for
human review 40 / 20 = 2 times, and we will charge 2 times of the price
(calculated above) accordingly.
For a labeling task that enables the custom labeler feature, each data item is
counted as 1 custom labeler unit.
For an active learning labeling task for data items with annotations that
are generated by models (without a human labeler's help), each data item is
counted as 1 active learning unit.
For an active learning labeling task for data items with annotations that are
generated by human labelers, each data item is counted as a regular labeling
task as described above.
The table below provides the price per 1,000 units per human labeler, based on
the unit listed for each objective. Tier 1 pricing applies to the first 50,000
units per month in each Google Cloud project; Tier 2 pricing applies to the next
950,000 units per month in the project, up to 1,000,000 units.
Contact us for pricing above 1,000,000
units per month.
Data type
Objective
Unit
Tier 1
Tier 2
Image
Classification
Image
$35
$25
Bounding box
Bounding box
$63
$49
Segmentation
Segment
$870
$850
Rotated box
Bounding box
$86
$60
Polygon/polyline
Polygon/Polyline
$257
$180
Video
Classification
5sec video
$86
$60
Object tracking
Bounding box
$86
$60
Action recognition
Event in 30sec video
$214
$150
Text
Classification
50 words
$129
$90
Sentiment
50 words
$200
$140
Entity extraction
Entity
$86
$60
Active Learning
All
Data item
$80
$56
Custom Labeler
All
Data item
$80
$56
Required use of Cloud Storage
In addition to the costs described in this document, you are required to store
data and program files in Cloud Storage buckets during the
Vertex AI lifecycle. This storage is subject to the
Cloud Storage pricing policy.
Required use of Cloud Storage includes:
Staging your training application package for custom-trained models.
Storing your training input data.
Storing the output of your training jobs.
Vertex AI does not require long-term storage of these items.
You can remove the files as soon as the operation is complete.
Free operations for managing your resources
The resource management operations provided by AI Platform are
available free of charge. The AI Platform quota policy does limit
some of these operations.
Resource
Free operations
models
create, get, list, delete
versions
create, get, list, delete, setDefault
jobs
get, list, cancel
operations
get, list, cancel, delete
Google Cloud costs
If you store images to be analyzed in Cloud Storage or use other
Google Cloud resources in tandem with Vertex AI, then
you will also be billed for the use of those services.
With Google Cloud's pay-as-you-go pricing, you only pay for the services you
use. Connect with our sales team to get a custom quote for your organization.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Hard to understand","hardToUnderstand","thumb-down"],["Incorrect information or sample code","incorrectInformationOrSampleCode","thumb-down"],["Missing the information/samples I need","missingTheInformationSamplesINeed","thumb-down"],["Other","otherDown","thumb-down"]],[],[],[]]