Fully managed, cloud-native data integration at any scale.
New customers get $300 in free credits to spend on Data Fusion. All customers get the first 120 hours of pipeline development free per month, per account, not charged against your credits.
Visual point-and-click interface enabling code-free deployment of ETL/ELT data pipelines
Broad library of 150+ preconfigured connectors and transformations, at no additional cost
Natively integrated best-in-class Google Cloud services
End-to-end data lineage for root cause and impact analysis
Built with an open source core (CDAP) for pipeline portability
Benefits
Data Fusion’s intuitive drag-and-drop interface, pre-built connectors, and self-service model of code-free data integration remove technical expertise-based bottlenecks and accelerate time to insight.
A serverless approach leveraging the scalability and reliability of Google services like Dataproc means Data Fusion offers the best of data integration capabilities with a lower total cost of ownership.
With built-in features like end-to-end data lineage, integration metadata, and cloud-native security and data protection services, Data Fusion assists teams with root cause or impact analysis and compliance.
Key features
Data Fusion is built using open source project CDAP, and this open core ensures data pipeline portability for users. CDAP’s broad integration with on-premises and public cloud platforms gives Cloud Data Fusion users the ability to break down silos and deliver insights that were previously inaccessible.
Data Fusion’s integration with Google Cloud simplifies data security and ensures data is immediately available for analysis. Whether you’re curating a data lake with Cloud Storage and Dataproc, moving data into BigQuery for data warehousing, or transforming data to land it in a relational store like Cloud Spanner, Cloud Data Fusion’s integration makes development and iteration fast and easy.
Cloud Data Fusion offers pre-built transformations for both batch and real-time processing. It provides the ability to create an internal library of custom connections and transformations that can be validated, shared, and reused across teams. It lays the foundation of collaborative data engineering and improves productivity. That means less waiting for ETL developers and data engineers and, importantly, less sweating about code quality.
Customers
What's new
Sign up for Google Cloud newsletters to receive product updates, event information, special offers, and more.
Documentation
Use cases
Cloud Data Fusion helps users build scalable, distributed data lakes on Google Cloud by integrating data from siloed on-premises platforms. Customers can leverage the scale of the cloud to centralize data and drive more value out of their data as a result. The self-service capabilities of Cloud Data Fusion increase process visibility and lower the overall cost of operational support.
Cloud Data Fusion can help organizations better understand their customers by breaking down data silos and enabling development of agile, cloud-based data warehouse solutions in BigQuery. A trusted, unified view of customer engagement and behavior unlocks the ability to drive a better customer experience, which leads to higher retention and higher revenue per customer.
Many users today want to establish a unified analytics environment across a myriad of expensive, on-premises data marts. Employing a wide range of disconnected tools and stop-gap measures creates data quality and security challenges. Cloud Data Fusion’s vast variety of connectors, visual interfaces, and abstractions centered around business logic helps in lowering TCO, promoting self-service and standardization, and reducing repetitive work.
All features
Code-free self-service |