Dataplex pricing
Dataplex pricing is based on pay-as-you-go usage. Dataplex currently charges based on the following Dataplex and Data Catalog SKUs:
- Dataplex processing (standard and premium)
- Dataplex shuffle storage
- Data Catalog API calls
- Data Catalog metadata storage
The following is a high-level overview of how each Dataplex key capability is billed:
Capability | Dataplex processing | Dataplex shuffle storage | Data Catalog metadata storage |
---|---|---|---|
Cloud Storage metadata harvesting | Standard | N/A | N/A |
Data exploration workbench | Premium | Yes | N/A |
Data lineage | Premium | N/A | Yes |
Data quality * | Premium | N/A | Yes - if published to Data Catalog |
Data profiling * | Premium | N/A | Yes - if published to Data Catalog |
Enrich metadata in Data Catalog | N/A | N/A | Yes |
In addition to the above, Data Catalog API and Data Lineage API calls are charged based on the Data Catalog API charges.
* Billing for data quality and data profiling features will begin on June 16, 2023. Until that date, these features are free of charge. To learn more, see auto data quality pricing and data profiling pricing.
Other usage
Data organization features in Dataplex (lake, zone, or asset setup) and security policy application and propagation, are provided free of charge.
In addition, some Dataplex functionalities (including scheduled data quality and data ingestion tasks, and Dataplex managed connectors for ingesting metadata from CloudSQL and Looker) will trigger job execution via Dataproc Serverless, BigQuery, Dataflow, and Cloud Scheduler. Those usages will be charged according to the Dataproc, BigQuery, Dataflow, and Cloud Scheduler pricing models respectively, and charges will show up under Dataproc, BigQuery, and Dataflow instead of Dataplex.
Dataplex processing pricing
Dataplex standard and premium processing are metered by the Data Compute Unit (DCU). DCU-hour is an abstract billing unit for Dataplex and the actual metering depends on the individual features you use.
Dataplex standard processing pricing
Dataplex standard tier covers the data discovery functionality that discovers metadata across Dataplex managed data. Below are the prices as per the region of your choice.
Dataplex free tier
As part of the Google Cloud Free Tier, Dataplex offers some resources free of charge up to a specific limit. These free usage limits are available during and after the free trial period. If you go over these usage limits and are no longer in the free trial period, you will be charged according to the pricing as described in the sections above.
Resource | Monthly free usage limits |
---|---|
Dataplex Processing | 100 DCU-hour |
Dataplex premium processing pricing
Dataplex premium processing tier covers the data exploration workbench, data lineage, data quality, and data profiling capabilities of Dataplex.
DCU charges for each feature is calculated as below:
For data exploration workbench, the DCU-hour is calculated based on the compute consumption of the session.
For data lineage, the DCU-hour is proportional to the processing involved to automatically parse lineage.
For detailed examples on calculating the data lineage cost, see Estimate data lineage pricing.
For data profiling and data quality, the DCU-hour consumption is proportional to the processing involved in profiling the data and computing the data quality metrics. This is billed per second, with a minimum of one minute.
Dataplex shuffle storage pricing
Shuffle storage pricing covers any disk storage specified in the environments configured for the data exploration workbench.
Data Catalog pricing
Data Catalog charges apply to metadata storage and API calls made to Data Catalog and Data Lineage API. Metadata storage and API call charges accrue daily. You can view unbilled usage on the Google Cloud console.
Metadata storage pricing
Metadata storage is measured in gibibytes (GiB), where 1 GiB is 1,073,741,824 bytes. Data Catalog measures the average amount of the stored metadata during a short time interval. For billing, these measurements are combined into a one-month average, which is multiplied by the monthly rate.
Monthly average storage | Price per month |
---|---|
Up to 1 MiB | No charge |
Over 1 MiB | $100 per GiB per month |
If you pay in a currency other than USD, the prices listed in your currency on Cloud Platform SKUs apply.
API charges
Data Catalog charges for API calls made to Data Catalog API and Data Lineage API.
API calls | Price |
---|---|
1 million in a month | No charge |
Over 1 million in a month | $10 per 100,000 API calls |
If you pay in a currency other than USD, the prices listed in your currency on Cloud Platform SKUs apply.
For detailed examples on calculating the Data Catalog cost, see Data Catalog pricing examples.
What's next
- Read the product documentation: Dataplex, Data Catalog.
- Get started with Dataplex.
- Try the Pricing calculator.
- Learn about Dataplex solutions and use cases.