Data CatalogBeta

A fully managed and highly scalable data discovery and metadata management service.

Discover and manage your data

Discover and manage your data in Google Cloud

Data Catalog is a fully managed and scalable metadata management service that empowers organizations to quickly discover, manage, and understand all their data in Google Cloud. It offers a simple and easy-to-use search interface for data discovery, a flexible and powerful cataloging system for capturing both technical and business metadata, and a strong security and compliance foundation with Cloud Data Loss Prevention (DLP) and Cloud Identity and Access Management (IAM) integrations.

Simplifies data discovery

Simplifies data discovery at any scale

Data Catalog is a fully managed metadata management service that simplifies data discovery at any scale; there’s no infrastructure to set up or manage. The service is powered by Google search technology that supports Gmail and Drive so customers can quickly and easily find data assets wherever they are. It offers a powerful and easy-to-use search interface, with built-in access level controls, allowing customers to get started with data exploration in a seamless and more secure way. Enterprises of all sizes now have a powerful tool to organize their business metadata as schematized tags, enabling them to find any data within their organizations, thereby fostering a culture of data-driven decision-making.

Unified view of all datasets

Offers a unified view of all datasets

Data Catalog offers a central and more secure data catalog across Google Cloud, allowing organizations to have a unified view of all their data assets. The service automatically ingests technical metadata for BigQuery and Cloud Pub/Sub and allows customers to capture business metadata in schematized format via tags, custom APIs, and the UI, offering a simple and efficient way to catalog their data assets. We are building an ecosystem with strategic partners so customers can discover all their data assets wherever they are. With Data Catalog, organizations can promote knowledge sharing and collaboration across the organization, allowing users to generate more value from their data assets.

Data governance

Provides a foundation for data governance

Data Catalog provides a foundation for governance by offering a strong security and compliance foundation with access level controls (ACLs) that extend to govern the data, so the right people find and access the right data. Data Catalog also helps you discover your data and starts you on the path to transparency; in order to govern your data, you need to know what data you have and where it is. Integration with Cloud DLP provides auto discovery and tagging of sensitive information, thereby simplifying the process of finding and governing sensitive data.

Features

Serverless

Fully managed and scalable metadata management service; requires no infrastructure to set up or manage, allowing you to focus on your business.

Metadata-as-a-service

Metadata management service for cataloging data assets via custom APIs and the UI, thereby providing a unified view of data wherever it is.

Central catalog

A flexible and powerful cataloging system for capturing both technical metadata (automatically) as well as business metadata (tags) in a structured format.

Search and discovery

A simple and easy-to-use UI with powerful structured search capabilities to quickly and easily find data assets; powered by Google search technology that supports Gmail and Drive.

Schematized metadata

Supports schematized tags (e.g., Enum, Bool, DateTime) and not just simple text tags — providing organizations rich and organized business metadata.

Cloud DLP integration

Discovers and classifies sensitive data, providing intelligence and helping to simplify the process of governing your data.

Cloud IAM integration

Provides access-level controls and honors source ACLs for read, write, and search for the data assets; giving you enterprise-grade access control.

Governance

Offers a strong security and compliance foundation with Cloud DLP and Cloud IAM integrations.

Data Catalog’s fully managed and scalable service gives us the flexibility to use it as a back-end system, powering our custom solution. It offers a central catalog system for all our metadata, which allows us to quickly discover all our data assets in GCP.

Crystal Widjaja, SVP, Business Intelligence & Growth, Go-Jek

Pricing

Pricing for Data Catalog is broken down into metadata pricing as well as catalog API calls pricing.

Metadata pricing

Metadata

Business metadata as well as any on-premises stored metadata ingested by Data Catalog.

Checkmark No charge for the first 1 MB of metadata stored

Checkmark $100 per GB per month for stored metadata above 1 MB

Note: Business metadata refers to additional business context metadata that is valuable for a customer (e.g., has_pii, data_owner, delete_by_date, retain_till_date, business_logic, data_glossary_term, etc.).

Technical metadata refers to metadata that is already present in GCP services (e.g., table and column names or descriptions, date created or modified, etc.) This is available at no charge for all users, which simplifies discovery for data assets in GCP.

Catalog API Calls pricing

Catalog API Calls

Represents all read, write, and search API calls.

Checkmark No charge for the first 1 million Catalog API calls per month

Checkmark $10 per 100,000 API calls per month above 1 million

Partnerships and integrations

Data Catalog integrations with strategic partners builds a strong ecosystem and creates a foundation for long-term relationships, allowing customers to have a unified data discovery experience for hybrid cloud scenarios, using their platform of choice.

Informatica is partnering with Google Cloud to help customers discover and manage all their data assets wherever they are. As customers embark on this journey, interoperability between Google Cloud Data Catalog and Informatica’s Enterprise Data Catalog and Axon Data Governance offerings provides customers with a comprehensive view of their data assets and enables data users to find relevant and trusted data for their business needs.

Ronen Schwartz, Senior Vice President and General Manager, Big Data, Cloud and Data Integration, Informatica
Collibra logo Informatica logo Tableau logo Looker logo

Resources

Google Cloud

Get started

Learn and build

Get started with Data Catalog

Need more help?

Our experts will help you build the right solution or find the right partner for your needs.

Products listed on this page are in alpha, beta, or early access. For more information on our product launch stages, see here.

Pomohla vám tato stránka? Ohodnoťte ji prosím:

Typ zpětné vazby...