Stay organized with collections Save and categorize content based on your preferences.

Dataflow is a managed service for executing a wide variety of data processing patterns. The documentation on this site shows you how to deploy your batch and streaming data processing pipelines using Dataflow, including directions for using service features.

The Apache Beam SDK is an open source programming model that enables you to develop both batch and streaming pipelines. You create your pipelines with an Apache Beam program and then run them on the Dataflow service. The Apache Beam documentation provides in-depth conceptual information and reference material for the Apache Beam programming model, SDKs, and other runners.

Apache, Apache Beam, Beam, the Beam logo, and the Beam firefly mascot are trademarks of The Apache Software Foundation in the United States and/or other countries.

Use cases

Explore use cases, reference architectures, whitepapers, best practices, and industry solutions.

Videos