Data Catalog

A fully managed and highly scalable data discovery and metadata management service.

View documentation for this product.

Discover, understand, and manage your data

Data Catalog is a fully managed and scalable metadata management service that empowers organizations to quickly discover, understand, and manage all their data. It offers a simple and easy-to-use search interface for data discovery, a flexible and powerful cataloging system for capturing both technical and business metadata, and a strong security and compliance foundation with Cloud Data Loss Prevention (DLP) and Cloud Identity and Access Management (IAM) integrations.

What's new

Simplifies data discovery at any scale

Data Catalog is a fully managed metadata management service that simplifies data discovery at any scale; there’s no infrastructure to set up or manage. The service is powered by the same Google search technology that supports Gmail and Drive, so customers can quickly and easily find data assets wherever they are. It offers a powerful and easy-to-use search interface, with built-in access level controls, allowing customers to get started with data exploration in a seamless and more secure way. Enterprises of all sizes now have a powerful tool to organize their business metadata as schematized tags, enabling them to find any data within their organizations, thereby fostering a culture of data-driven decision-making.

Offers a unified view of all datasets

Data Catalog offers a central and more secure data catalog across Google Cloud, allowing organizations to have a unified view of all their data assets. The service automatically ingests technical metadata for BigQuery and Cloud Pub/Sub and allows customers to capture business metadata in schematized format via tags, custom APIs, and the UI, offering a simple and efficient way to catalog their data assets. We are building an ecosystem with strategic partners so customers can discover all their data assets wherever they are. With Data Catalog, organizations can promote knowledge sharing and collaboration across the organization, allowing users to generate more value from their data assets.

Provides a foundation for data governance

Data Catalog provides a foundation for governance by offering a strong security and compliance foundation with access level controls (ACLs) that extend to govern the data, so the right people find and access the right data. Data Catalog also helps you discover your data and starts you on the path to transparency; in order to govern your data, you need to know what data you have and where it is. Integration with Cloud DLP provides auto discovery and tagging of sensitive information, thereby simplifying the process of finding and governing sensitive data.

Features

Serverless

Fully managed and scalable metadata management service; requires no infrastructure to set up or manage, allowing you to focus on your business.

Metadata-as-a-service

Metadata management service for cataloging data assets via custom APIs and the UI, thereby providing a unified view of data wherever it is.

Central catalog

A flexible and powerful cataloging system for capturing both technical metadata (automatically) as well as business metadata (tags) in a structured format.

Search and discovery

A simple and easy-to-use UI with powerful structured search capabilities to quickly and easily find data assets; powered by the same Google search technology that supports Gmail and Drive.

Schematized metadata

Supports schematized tags (e.g., Enum, Bool, DateTime) and not just simple text tags—providing organizations rich and organized business metadata.

Cloud IAM integration

Provides access-level controls and honors source ACLs for read, write, and search for the data assets; giving you enterprise-grade access control.

On-prem connectors

Ingest technical metadata from non-Google Cloud data assets to Data Catalog for a unified view of all your data assets.

Governance

Offers a strong security and compliance foundation with Cloud DLP and Cloud IAM integrations.

Cloud DLP integration

Discovers and classifies sensitive data, providing intelligence and helping to simplify the process of governing your data.

Product integration overview

Data Catalog product integration overview

Partners and integrations

Data Catalog integrations with strategic partners build a strong ecosystem and create a foundation for long-term relationships, allowing customers to have a unified data discovery experience for hybrid cloud scenarios, using their platform of choice.

Asset description.Asset description.Asset description.

Technical resources

Pricing

Pricing for Data Catalog is split between  metadata storage and API calls – both on a consumption basis.  

Data Catalog metadata storage includes any new metadata stored in Data Catalog, including:

  • Business metadata, such as Data Catalog tag templates and tags
  • Cloud Storage filesets 
  • Schemas attached to Pub/Sub topics 
  • Custom types metadata stored in Data Catalog etc.

Data Catalog metadata storage does not include the technical metadata stored by other Google Cloud services, for example, dataset table and column names stored in BigQuery.

Detailed pricing and examples for both metadata storage and API calls may be found in the Data Catalog documentation.

Take the next step

Get $300 in free credits to learn and build on Google Cloud for up to 12 months.

Need help getting started?
Work with a trusted partner
Continue browsing