Datastream’s PostgreSQL source and BigQuery destination are generally available. Read the blog.
Jump to
Datastream

Datastream

Serverless and easy-to-use change data capture and replication service.

  • Access to streaming, low-latency data from MySQL, PostgreSQL, AlloyDB, and Oracle databases

  • Near real-time analytics in BigQuery with Datastream for BigQuery

  • Easy-to-use setup with built-in secure connectivity for faster time to value

  • Serverless platform that automatically scales, with no resources to provision or manage

  • Secure foundation for database synchronization and event-driven architectures

Benefits

Replicate and synchronize data with minimal latency

Synchronize data across heterogeneous databases, storage systems, and applications reliably, with low latency, while minimizing impact on source performance.

Scale up or down with a serverless architecture

Get up and running fast with a serverless and easy-to-use service that seamlessly scales up or down, and has no infrastructure to manage.

Unmatched flexibility with Google Cloud services

Connect and integrate data across your organization with the best of Google Cloud services like BigQuery, Spanner, Dataflow, and Data Fusion.

Key features

Datastream's differentiated approach

Streaming data from relational databases

Datastream reads and delivers every change—insert, update, and delete—from your MySQL, PostgreSQL, AlloyDB and Oracle databases to load data into BigQuery, Cloud SQL, Cloud Storage, and Spanner. Agentless and Google-native, it reliably streams every event as it happens. Datastream processes over half a trillion events per month.

Normalized data types across sources

Datastream normalizes every event’s data type from the source database’s type into a unified Datastream type. This lossless data type normalization across sources means easier downstream processing in a source-agnostic way, regardless of where the data originated.

Schema drift resolution

As source schemas change, Datastream allows for fast and seamless schema drift resolution. Datastream rotates files, creating a new file in the destination bucket, on every schema change. Original source data types are just an API-call away with an up-to-date, versioned Schema Registry.

Secure by design

Datastream supports multiple secure, private connectivity methods to protect data in transit. In addition, data is encrypted in transit and at rest so you can rest easy knowing your data is protected as it streams.

Just Eat Takeaway logo
Datastream was instrumental in achieving high-quality, reconciled data replication from our various operational data sources into BigQuery. It is easy to use, serverless and highly scalable, allowing us to set up the first streams in a short timeframe. Speed was critical to us because the data ingestion was the foundation for our data platform migration.

Oleksandr Kaleniuk, Data Ingestion Lead at Just Eat Takeaway

Documentation

Find resources and documentation for Datastream

Google Cloud Basics

Datastream overview

Learn how Datastream helps you replicate and synchronize data across heterogeneous databases, storage systems, and applications.
Quickstart

Datastream quickstart using the Cloud Console

Learn how to use the Google Cloud Console as a visual interface to start streaming data.
Tutorial

Configure Datastream using the API

Learn how to use the API to configure Datastream to transfer data from a source Oracle database into Cloud Storage.
Google Cloud Basics

How to migrate from Oracle to Cloud SQL for PostgreSQL

Use Google’s Datastream-based, open-source toolkit to migrate from Oracle to Cloud SQL for PostgreSQL.
Tutorial

Replicate data into BigQuery with Datastream and Dataflow

Learn how to combine Datastream with Dataflow templates to replicate data from a relational database.
Tutorial

How to replicate data from an Oracle database into BigQuery

Watch this video to learn how to replicate data in real time from Oracle into BigQuery using Data Fusion’s replication accelerator that's integrated with Datastream.

Not seeing what you’re looking for?

Use cases

Continuous change data capture: replicate every event as it happens

Use case
Real time, anytime change streams

Change data capture integrates data by reading change events (inserts, updates, and deletes) from source databases and writing them to a data destination, so action can be taken. Datastream supports change streams from Oracle and MySQL databases into BigQuery, Cloud SQL, Cloud Storage, and Spanner, enabling real-time analytics, database replication, and other use cases. Additional sources and destinations are coming in the future.

Datastream diagram

Pricing

Datastream pricing details

Datastream pricing is calculated based upon actual monthly data processed. Additional pricing details are available on the Datastream pricing page.

Additional resources like Cloud Storage, Dataflow, and BigQuery are billed per that service’s pricing.

Partners

Recommended partners

Google Cloud partners can help you get the most out of your data with Datastream.


ORACLE® is a registered trademark of Oracle Corporation.

Take the next step

Start building on Google Cloud with $300 in free credits and 20+ always free products.

Google Cloud
  • ‪English‬
  • ‪Deutsch‬
  • ‪Español‬
  • ‪Español (Latinoamérica)‬
  • ‪Français‬
  • ‪Indonesia‬
  • ‪Italiano‬
  • ‪Português (Brasil)‬
  • ‪简体中文‬
  • ‪繁體中文‬
  • ‪日本語‬
  • ‪한국어‬
Console
Google Cloud