Break free from data silos with Dataplex’s intelligent data fabric that enables organizations to centrally discover, manage, monitor, and govern data across data lakes, data warehouses, and data marts with consistent controls, providing access to trusted data and powering analytics and AI at scale.
Single pane of glass for data management across data silos
Centralized security and governance enabling distributed ownership of data with global control
Unified search and data discovery, based on business context, across distributed data
Built-in data intelligence to enable trust in data and accelerate time to insights
An open platform with support for open source tools and a robust partner ecosystem
Benefits
Get the freedom to store data where you want for the best price and performance while choosing the best analytics tools, open source or cloud native, to accelerate the entire analytics lifecycle.
Automate data discovery, metadata harvesting, data lifecycle management, data profiling, data quality and lineage using Google’s best-in-class AI/ML capabilities to reduce management costs.
Enable standardization and unification of metadata, security policies, governance, and data classification for consistency across distributed data.
Key features
Automate data discovery, classification, and metadata enrichment of structured, semi-structured, and unstructured data, stored in Google Cloud and beyond, with built-in data intelligence. Manage technical, operational, and business metadata, for all your data, in a unified, flexible, and powerful Data Catalog. Easily search, find, and understand your data with built-in faceted-search interface using the same search technology as Gmail.
Logically organize your data that spans multiple storage services into business-specific domains using Dataplex lakes and data zones. Manage, curate, tier, and archive your data easily with one click.
Enable central policy management, monitoring, and auditing for data authorization and classification, across data silos. Facilitate distributed data ownership based on business domains with global monitoring and governance.
Automate data quality across distributed data and enable access to data you can trust. Use automatically captured data lineage to better understand your data, trace dependencies, and effectively troubleshoot data issues.
Easily understand where your data comes from and the transformations it goes through with end-to-end data lineage. Automatically processed for Google Cloud data sources and extendible to 3rd party data sources.
Customers
We have PBs of data stored in Google Cloud, accessed by 1,000s of internal users daily. Dataplex enables us to deliver a business domain-specific, self-service data platform across distributed data, with decentralized data ownership but centralized governance and visibility. We are very excited to adopt Dataplex as a central component for building a unified data mesh across our analytics data.
Saral Jain, Director of Engineering, Snap Inc