Overview of GPUs in Dataflow | Google Cloud

English
Deutsch
Español – América Latina
Français
Português – Brasil
中文 – 简体
日本語
한국어

Console

Contact Us Start free

Join the Apache Beam community on July 8th-9th for the Beam Summit 2025 to learn more about Beam and share your expertise.

GPUs with Dataflow

Dataflow GPUs bring the accelerated benefits directly to your stream or batch data processing pipeline. Use Dataflow to simplify the process of getting data to the GPU and to take advantage of data locality. At the same time, get all the benefits of the fully managed Dataflow system: host provisioning, autoscaling, fault tolerance, and more.

Dataflow support for GPUs

Run a Dataflow job with GPUs

Follow this tutorial to see how to build a custom container image and run a Dataflow pipeline with GPUs.

Run a simple PyTorch pipeline with GPUs

Follow this tutorial to build a PyTorch pipeline and run it on Dataflow with GPUs.

Go to GitHub

Run a pipeline with NVIDIA L4 GPUs

The L4 GPU type is useful for running machine learning inference pipelines.

Use the L4 GPU type

Run a simple TensorFlow pipeline with GPUs

Follow this tutorial to build a TensorFlow pipeline and run it on Dataflow with GPUs.

Go to GitHub

Resources

Developer best practices

See an example of a developer workflow for building pipelines using GPUs.

About GPUs with Dataflow

Right fitting

Use right fitting with your batch jobs to customize worker resources and reduce costs.

Learn more

Troubleshoot jobs that use GPUs

If you run into problems running your Dataflow job with GPUs, follow these troubleshooting steps.

Troubleshoot

Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.

Last updated 2025-04-30 UTC.