GPU pricing

This page describes the pricing information for Compute Engine GPUs. This page does not cover disk and images, networking, sole-tenant nodes pricing or VM instance pricing.

Compute Engine charges for usage based on the following price sheet. A bill is sent out at the end of each billing cycle, providing a sum of Google Cloud charges. Prices on this page are listed in U.S. dollars (USD).

For Compute Engine, disk size, machine type memory, and network usage are calculated in JEDEC binary gigabytes (GB), or IEC gibibytes (GiB), where 1 GiB is 230 bytes. Similarly, 1 TiB is 240 bytes, or 1024 JEDEC GBs.

If you pay in a currency other than USD, the prices listed in your currency on Cloud Platform SKUs apply.

You can also find pricing information with the following options:

Overview

You can attach one or more GPUs to your virtual machine (VM) instances to accelerate specific workloads or offload work from your vCPUs. Each GPU adds to the cost of your instance in addition to the cost of the machine type. GPUs are subject to the same billing policy as vCPUs and memory.

Eligible GPU devices that are attached to standard VM instances automatically receive any applicable sustained use discounts (SUDs), similar to vCPUs. GPUs attached to Spot VMs (or preemptible VMs), are charged at the Spot prices for GPUs but don't receive SUDs.

GPUs are also eligible for resource-based committed use discounts (CUDs). To receive resource-based CUDs for your GPUs, you must create and attach a reservation for your GPUs when you purchase your resource-based commitment. The attached reservation cannot be modified or deleted for the duration of the commitment. For more information, see Attach reservations to commitments. However, you can also reserve capacity for your GPU resources in a specific zone, even when you don't purchase a commitment. When you don't purchase a commitment for your GPUs, you pay normal on-demand prices; any applicable SUDs apply automatically.

GPU prices are listed by region. GPU devices are available only in specific zones within some regions. Read GPUs on Compute Engine to see a complete list of regions and zones where GPU devices are available.

You can also use the Google Cloud Pricing Calculator to help determine the total cost of your instances including both the cost of GPUs and machine type configurations.

GPUs that are attached to accelerator-optimized VMs

Each accelerator-optimized machine type has a specific model of NVIDIA GPUs attached.

  • For A3 accelerator-optimized machine types, NVIDIA H100 80GB GPUs are attached.
  • For A2 accelerator-optimized machine types, NVIDIA A100 GPUs are attached. These are available in both A100 40GB and A100 80GB options.
  • For G2 accelerator-optimized machine types, NVIDIA L4 GPUs are attached.

For GPUs that are attached to accelerator-optimized machine types, the total cost of running these machine types which includes the GPUs cost, are available in the Accelerator-optimized machine type family pricing documentation.

Other GPUs models


**For committed use discounts pricing on the A2 ultra machine series, connect with your sales account team.

*Spot prices are dynamic and can change up to once every 30 days, but provide discounts of 60-91% off of the corresponding on-demand price for most machine types and GPUs. Spot prices also provide smaller discounts for local SSDs and A3 machine types. For more information, see the Spot VMs documentation.

If you pay in a currency other than USD, the prices listed in your currency on Cloud Platform SKUs apply.

Cloud TPU pricing

Information about billing for Cloud TPU resources is available on the Cloud TPU pricing page.

What's next

Request a custom quote

With Google Cloud's pay-as-you-go pricing, you only pay for the services you use. Connect with our sales team to get a custom quote for your organization.
Contact sales