Stay organized with collections Save and categorize content based on your preferences.

Dataplex pricing

Dataplex pricing is based on pay-as-you-go usage. Dataplex currently charges based on the following Dataplex and Data Catalog SKUs:

  • Dataplex processing (standard and premium)
  • Dataplex shuffle storage
  • Data Catalog API calls
  • Data Catalog metadata storage

The following is a high-level overview of how each Dataplex key capability is billed:

Capability Dataplex processing Dataplex shuffle storage Data Catalog metadata storage
Cloud Storage metadata harvesting Standard N/A N/A
Data exploration workbench Premium Yes N/A
Data lineage * Premium N/A Yes
Data quality ** Premium N/A Yes - if published to Data Catalog
Data profiling ** Premium N/A Yes - if published to Data Catalog
Enrich metadata in Data Catalog N/A N/A Yes

In addition to the above, Data Catalog API and Data Lineage API calls are charged based on the Data Catalog API charges.

* The data lineage feature is currently free of charge in the preview phase. Billing will commence once this feature becomes generally available.

** The data quality and data profiling features are currently available at no charge during the preview phase. Billing will commence during the public preview and this page will be updated at least one month in advance. Customers previewing these features at that time will receive a notification about the changes.

Other usage

Data organization features in Dataplex (lake / zone / asset setup) and security policy application / propagation, are provided free of charge.

In addition, some Dataplex functionalities (including scheduled data quality and data ingestion tasks, and Dataplex managed connectors for ingesting metadata from CloudSQL and Looker) will trigger job execution via Dataproc Serverless, BigQuery, Dataflow, and Cloud Scheduler. Those usages will be charged according to the Dataproc, BigQuery, Dataflow, and Cloud Scheduler pricing models respectively, and charges will show up under Dataproc, BigQuery, and Dataflow instead of Dataplex.

Dataplex processing pricing

Dataplex standard and premium processing are metered by the Data Compute Unit (DCU). DCU-hour is an abstract billing unit for Dataplex and the actual metering depends on the individual features you use.

Dataplex standard processing pricing

Dataplex standard tier covers the data discovery functionality that discovers metadata across Dataplex managed data. Below are the prices as per the region of your choice.

Dataplex free tier

As part of the Google Cloud Free Tier, Dataplex offers some resources free of charge up to a specific limit. These free usage limits are available during and after the free trial period. If you go over these usage limits and are no longer in the free trial period, you will be charged according to the pricing as described in the sections above.

Resource Monthly free usage limits
Dataplex Processing 100 DCU-hour

Dataplex premium processing pricing

Dataplex premium processing tier covers the data exploration workbench, data lineage, data quality, and data profiling capabilities of Dataplex.

DCU charges for each feature is calculated as below:

  • For data exploration workbench, the DCU-hour is calculated based on the compute consumption of the session.

  • For data lineage, the DCU-hour is proportional to the processing involved to automatically parse lineage.

    For detailed examples on calculating the data lineage cost, see Estimate data lineage pricing.

  • For data profiling and data quality, the DCU-hour consumption is proportional to the processing involved in profiling the data and computing the data quality metrics.

Dataplex shuffle storage pricing

Shuffle storage pricing covers any disk storage specified in the environments configured for the data exploration workbench.

Data Catalog pricing

Data Catalog charges apply to metadata storage and API calls made to Data Catalog and Data Lineage API. Metadata storage and API call charges accrue daily. You can view unbilled usage on the Google Cloud console.

Metadata storage pricing

Metadata storage is measured in gibibytes (GiB), where 1 GiB is 1,073,741,824 bytes. Data Catalog measures the average amount of the stored metadata during a short time interval. For billing, these measurements are combined into a one-month average, which is multiplied by the monthly rate.

Monthly average storage Price per month
Up to 1 MiB No charge
Over 1 MiB $100 per GiB per month

If you pay in a currency other than USD, the prices listed in your currency on Cloud Platform SKUs apply.

API charges

Data Catalog charges for API calls made to Data Catalog API and Data Lineage API.

API calls Price
1 million in a month No charge
Over 1 million in a month $10 per 100,000 API calls

If you pay in a currency other than USD, the prices listed in your currency on Cloud Platform SKUs apply.

For detailed examples on calculating the Data Catalog cost, see Data Catalog pricing examples.

What's next

Request a custom quote

With Google Cloud's pay-as-you-go pricing, you only pay for the services you use. Connect with our sales team to get a custom quote for your organization.
Contact sales