Accelerate innovation on Google Cloud and NVIDIA

NVIDIA and Google Cloud provide accelerator-optimized solutions that support demanding workloads, including generative AI, high-performance computing, data analytics, graphics, and gaming workloads.

Engineering what's next

Google Distributed Cloud runs AI models on-premises

Learn how Google Cloud and NVIDIA collaborate to provide Gemini and generative AI to the edge and regulated environments with Google Distributed Cloud.

NVIDIA-accelerated computing on Google Cloud

Recapping on NVIDIA GTC 2026 San Jose. See the highlights:


“We are moving from training AI to producing intelligence. These data centers are no longer just storing information. They are factories generating tokens, generating intelligence... Our expanded collaboration with Google Cloud will help developers accelerate their work with infrastructure that supercharges energy efficiency and reduces costs.”

Jensen Huang, GTC 2026 keynote

Recapping on NVIDIA GTC 2026 San Jose. See the highlights:


“We are moving from training AI to producing intelligence. These data centers are no longer just storing information. They are factories generating tokens, generating intelligence... Our expanded collaboration with Google Cloud will help developers accelerate their work with infrastructure that supercharges energy efficiency and reduces costs.”

Jensen Huang, GTC 2026 keynote

High Performing GPUs on Google Cloud

Speed up machine learning, scientific computing, and generative AI with high-performance GPUs on Google Cloud.

Key benefits:

  • Run workloads (generative AI, 3D visualization, HPC) with advanced AI hardware/software
  • Access diverse GPUs for varied performance and pricing
  • Optimize workloads with flexible pricing and machine customizations

Key features

  • Diverse GPU Offerings: Compute Engine offers NVIDIA GPUs: RTX PRO 6000, GB300, GB200, B200, H200, H100, L4, P100, P4, T4, V100, A100. Options cover various cost/performance needs.
  • Adaptable Performance: Achieve optimal balance of processor, memory, high-performance disk, and up to 8 GPUs per instance. Benefit from per-second billing.
  • Use Google Cloud Advantages: Run GPU workloads on Google Cloud and access industry-leading storage, networking, and data analytics.

NVIDIA technologies on Google Cloud

Google Kubernetes Engine (GKE)

Use GKE scalability, NVIDIA Multi-Instance GPU (MIG) support, and GPU time-sharing for efficient generative AI training, inference, and other compute-intensive workloads. Optimize resource utilization and minimize operational costs.

Vertex AI

Combine NVIDIA accelerated computing with Vertex AI, a unified MLOps platform. Utilize NVIDIA GPUs and AI software (such as, Triton™ Inference Server) within Vertex AI Training, Prediction, Pipelines, and Notebooks to accelerate generative AI development and deployment without infrastructure complexities.

Cloud Run

Deploy generative AI faster with NVIDIA NIM on Cloud Run, a fully managed serverless platform. Cloud Run GPU support speeds up NIM to optimize performance and accelerate gen AI model deployment in a serverless environment.

Dynamic Workload Scheduler

Access NVIDIA GPU capacity on Google Cloud for short-duration AI workloads (training, fine-tuning, experimentation). Flexible scheduling and atomic provisioning enhance resource utilization and optimize costs across services like GKE, Vertex AI, and Batch.

Google Distributed Cloud

The NVIDIA Blackwell platform on Google Distributed Cloud enables secure, on-premises deployment of advanced agentic AI (including Google Gemini models). This offers improved AI performance and scalability for sensitive, regulated workloads, ensuring data privacy, sovereignty, and compliance.


Technical resources for deploying NVIDIA technologies on Google Cloud

Google Cloud basics

  • GPUs on Compute Engine: Compute Engine provides GPUs that you can add to your virtual machine instances. Learn more
  • Using GPUs for training models in the cloud: Run the training process for many deep learning models, like image classification, video analysis, and natural language processing. Learn more
  • Attaching GPUs to Dataproc clusters: Attach GPUs to the master and worker Compute Engine nodes in a Dataproc cluster to accelerate specific workloads, such as machine learning and data processing. Learn more
  • Using GPUs with Dataflow: Using GPUs in Dataflow jobs lets you speed up some of your machine learning and other compute intensive data processing tasks. Learn more


Tutorials

  • Learn how to add or remove GPUs from a Compute Engine VM. Learn more
  • Installing GPUs drivers: This guide shows ways to install NVIDIA proprietary drivers you’ve created an instance with one or more GPUs. Learn more
  • GPUs on Google Kubernetes Engine: Learn how to use GPU hardware accelerators in your Google Kubernetes Engine clusters nodes. Learn more

View all product documentation


GTC Studio Insights featuring Google Cloud's Chelsie Czop

Watch the latest AI podcast: Tiffany Janzen sits down with Google Cloud Chelsie Czop to discuss the partnership between NVIDIA and Google Cloud and the future of AI Infrastructure

Recorded at NVIDIA GTC Insights studio

Google Cloud and NVIDIA partnership

Customer stories

Learn how SandboxAQ accelerates scientific discovery with AI
SANDBOXAQ logo
PUMA logo
Learn how PUMA built an AI jersey designer with Google Cloud and NVIDIA

Augment Code

Augment Code speeds up AI coding on Google Cloud and NVIDIA

Galileo

Galileo: De-risk LLMs and build reliable AI apps at scale with Gemini, NVIDIA, and Google Cloud

Galileo + NVIDIA + Google Cloud
LiveX AI + NVIDIA + Google Cloud

Baseten

How Baseten achieves 225% better cost-performance for AI inference with NVIDIA and Google Cloud

LiveX AI

LiveX AI reduces support costs by 85% using GKE and NVIDIA AI-powered agents

LiveX AI + NVIDIA + Google Cloud
Compared with another inference platform, running on GKE with NVIDIA NIM and GPUs delivered 6.1x acceleration in average answer/response generation speed for the Amazfit AI agent.

Jia Li Co-Founder, Chief AI Officer, LiveX AI

Read more

Take the next step

Tell us what you’re building. A Google Cloud NVIDIA expert can help you find the right solution.




Google Cloud