Knowledge Catalog pricing

Knowledge Catalog pricing is based on pay-as-you-go usage. Knowledge Catalog currently charges based on the following SKUs:

  • Knowledge Catalog processing (standard and premium)
  • Metadata storage

The following is a high-level overview of how each key Knowledge Catalog capability is billed:

Capability

Cloud Storage metadata harvesting

Standard

N/A

Data lineage

Premium

Yes

Data quality

Premium

Yes - if published to Catalog

Data profiling

Premium

Yes - if published to Catalog

Enrich metadata in Knowledge Catalog

N/A

Yes

Gemini-powered features in Knowledge Catalog including data insights and automated metadata generation features are billed as part of Gemini in BigQuery or Gemini Code Assist (link: https://cloud.google.com/products/gemini/pricing#gemini-in-bigquery-pricing)

Other usage

Data organization features in Knowledge Catalog (lake, zone, or asset setup) and security policy application and propagation, are provided free of charge.

In addition, some Knowledge Catalog functionalities (including discovery scans, scheduled data quality and data ingestion tasks, and Knowledge Catalog managed connectors for ingesting metadata from CloudSQL and Looker) trigger job execution using GCS, Dataproc Serverless, BigQuery, Dataflow, and Cloud Scheduler. Those usages are charged according to the GCS, Dataproc, BigQuery, Dataflow, and Cloud Scheduler pricing models respectively, and charges will show up under GCS, Dataproc, BigQuery, and Dataflow instead of Knowledge Catalog.

Knowledge Catalog processing pricing

Knowledge Catalog standard and premium processing are metered by the Data Compute Unit (DCU). DCU-hour is an abstract billing unit for Knowledge Catalog and the actual metering depends on the individual features you use.

Knowledge Catalog standard processing pricing

Knowledge Catalog standard tier covers the data discovery functionality that discovers metadata across Knowledge Catalog managed data. The following are the prices as per the region of your choice.

  • Johannesburg (africa-south1)
  • Taiwan (asia-east1)
  • Hong Kong (asia-east2)
  • Tokyo (asia-northeast1)
  • Osaka (asia-northeast2)
  • Seoul (asia-northeast3)
  • Mumbai (asia-south1)
  • Singapore (asia-southeast1)
  • Jakarta (asia-southeast2)
  • Bangkok (asia-southeast3)
  • Sydney (australia-southeast1)
  • Melbourne (australia-southeast2)
  • Warsaw (europe-central2)
  • Finland (europe-north1)
  • Stockholm (europe-north2)
  • Madrid (europe-southwest1)
  • Belgium (europe-west1)
  • Berlin (europe-west10)
  • Turin (europe-west12)
  • London (europe-west2)
  • Frankfurt (europe-west3)
  • Netherlands (europe-west4)
  • Zurich (europe-west6)
  • Milan (europe-west8)
  • Paris (europe-west9)
  • Doha (me-central1)
  • Dammam (me-central2)
  • Tel Aviv (me-west1)
  • Montreal (northamerica-northeast1)
  • Toronto (northamerica-northeast2)
  • Mexico (northamerica-south1)
  • Sao Paulo (southamerica-east1)
  • Santiago (southamerica-west1)
  • Iowa (us-central1)
  • Oklahoma (us-central2)
  • South Carolina (us-east1)
  • Northern Virginia (us-east4)
  • Columbus (us-east5)
  • Dallas (us-south1)
  • Oregon (us-west1)
  • Los Angeles (us-west2)
  • Salt Lake City (us-west3)
  • Las Vegas (us-west4)
Show discount options

Item

Meter

Default* (USD)
BigQuery CUD - 1 Year* (USD)
BigQuery CUD - 3 Year* (USD)

Knowledge Catalog processing

per DCU per unit time

$0.06$0.054$0.048
* Each consumption model has a unique ID. You may need to opt-in to be eligible for consumption model discounts. Click here to learn more.

Knowledge Catalog free tier

As part of the Google Cloud Free Tier, Knowledge Catalog offers some resources free of charge up to a specific limit. These free usage limits are available during and after the free trial period. If you go over these usage limits and are no longer in the free trial period, you will be charged according to the pricing as described in the sections above.

Note: The Knowledge Catalog free tier is only available for the Knowledge Catalog Standard Processing SKU, and is not available for the Knowledge Catalog Premium Processing SKU.

Resource

Monthly free usage limits

Knowledge Catalog Processing

100 DCU-hour

Knowledge Catalog premium processing pricing

The Knowledge Catalog premium processing tier covers data lineage, data quality, and data profiling.

  • Johannesburg (africa-south1)
  • Taiwan (asia-east1)
  • Hong Kong (asia-east2)
  • Tokyo (asia-northeast1)
  • Osaka (asia-northeast2)
  • Seoul (asia-northeast3)
  • Mumbai (asia-south1)
  • Delhi (asia-south2)
  • Singapore (asia-southeast1)
  • Jakarta (asia-southeast2)
  • Bangkok (asia-southeast3)
  • Sydney (australia-southeast1)
  • Melbourne (australia-southeast2)
  • Warsaw (europe-central2)
  • Finland (europe-north1)
  • Stockholm (europe-north2)
  • Madrid (europe-southwest1)
  • Belgium (europe-west1)
  • Berlin (europe-west10)
  • Turin (europe-west12)
  • London (europe-west2)
  • Frankfurt (europe-west3)
  • Netherlands (europe-west4)
  • Zurich (europe-west6)
  • Milan (europe-west8)
  • Paris (europe-west9)
  • Doha (me-central1)
  • Dammam (me-central2)
  • Tel Aviv (me-west1)
  • Montreal (northamerica-northeast1)
  • Toronto (northamerica-northeast2)
  • Mexico (northamerica-south1)
  • Sao Paulo (southamerica-east1)
  • Santiago (southamerica-west1)
  • Iowa (us-central1)
  • Oklahoma (us-central2)
  • South Carolina (us-east1)
  • Northern Virginia (us-east4)
  • Columbus (us-east5)
  • Alabama (us-east7)
  • Dallas (us-south1)
  • Oregon (us-west1)
  • Los Angeles (us-west2)
  • Salt Lake City (us-west3)
  • Las Vegas (us-west4)
  • Phoenix (us-west8)
Show discount options

Item

Meter

Default* (USD)
BigQuery CUD - 1 Year* (USD)
BigQuery CUD - 3 Year* (USD)

Knowledge Catalog premium processing pricing

per DCU per unit time

$0.089$0.0801$0.0712
* Each consumption model has a unique ID. You may need to opt-in to be eligible for consumption model discounts. Click here to learn more.

Calculation of DCU charges

DCU charges for each feature are calculated as follows:

1. Auto data quality scans:

  • The DCU-hour consumption is proportional to the processing involved in profiling the data and computing the data quality metrics. This is billed per second, with a minimum of one minute.
  • The charge depends on the number of rows, the number of columns, the amount of data that you've scanned, the data quality rule configuration, the partitioning and clustering settings on the table, and the frequency of the scan.
  • For data quality anomaly detection scans, DCU charges do not apply. Instead, standard BigQuery pricing for compute, storage, and BQML model training, processing, and deploying apply.
  • Using a custom execution identity changes how you are billed for the scan. When you specify a custom execution identity, the compute and storage costs associated with the scan are billed directly to your BigQuery project, bypassing the standard Knowledge Catalog Premium SKUs.

2. There are several options to reduce the cost of auto data quality scans:

3. To filter aggregate charges, use the following labels available in billing export in BigQuery:

  • goog-dataplex-datascan-data-source-dataplex-entity
  • goog-dataplex-datascan-data-source-dataplex-lake
  • goog-dataplex-datascan-data-source-dataplex-zone
  • goog-dataplex-datascan-data-source-project
  • goog-dataplex-datascan-data-source-region
  • goog-dataplex-datascan-id
  • goog-dataplex-datascan-job-id

4. Data Profiling scans:

  • The DCU-hour consumption is proportional to the processing involved in profiling the data and computing the data quality metrics. This is billed per second, with a minimum of one minute.
  • The charge depends on the number of rows, numbers of columns, the amount of data scanned, partitioning and clustering settings on the table, and the frequency of the scan.
  • Using a custom execution identity changes how you are billed for the scan. When you specify a custom execution identity, the compute and storage costs associated with the scan are billed directly to your BigQuery project, bypassing the standard Knowledge Catalog Premium SKUs.

5. There are several options to reduce the cost of data profiling scans:

  • Sampling
  • Incremental scans
  • Column filtering
  • Row filtering
  • To separate data profiling charges from other charges in the Knowledge Catalog premium processing SKU, on the Cloud Billing report, use the label goog-dataplex-workload-type with value DATA_PROFILE.

6. To filter aggregate charges, use the following labels available in billing export in BigQuery:

  • goog-dataplex-datascan-data-source-dataplex-entity
  • goog-dataplex-datascan-data-source-dataplex-lake
  • goog-dataplex-datascan-data-source-dataplex-zone
  • goog-dataplex-datascan-data-source-project
  • goog-dataplex-datascan-data-source-region
  • goog-dataplex-datascan-id
  • goog-dataplex-datascan-job-id

7. Data Lineage:

  • The DCU-hour consumption is proportional to the processing involved to automatically parse lineage.
  • To separate data lineage charges from other charges in the Knowledge Catalog premium processing SKU, on the Cloud Billing report, use the label goog-dataplex-workload-type with value LINEAGE.
  • If you call the Data Lineage API Origin sourceType with a value other than CUSTOM, it causes additional costs.

Data lineage pricing example

User A enables data lineage to track lineage for BigQuery in their project. The project is in the us-central1 location. During one month, data lineage consumes 100 DCU-hours of Knowledge Catalog Premium processing, and generates 1GiB of data lineage metadata. The cost is:

  • Example
Loading...

Knowledge Catalog metadata storage pricing

Metadata storage pricing

Knowledge Catalog uses the metadata storage SKU to charge for metadata storage. Metadata storage is measured in gibibytes (GiB), where 1 GiB is 1,073,741,824 bytes. Knowledge Catalog measures the average amount of the stored metadata during a short time interval. For billing, these measurements are combined into a one-month average, which is multiplied by the monthly rate.

Note: Metadata storage for automatically ingested Google Cloud technical metadata is offered at no charge.

If you pay in a currency other than USD, the prices listed in your currency on Cloud Platform SKUs apply.

Knowledge Catalog storage pricing

Metadata storage charges (including those for entries and aspects) are billed to the project where the respective resource was created.

Monthly average storage

Price (USD)

Any

$0.002739726 / 1 gibibyte hour

When a resource in Data Catalog is made simultaneously available in the Knowledge Catalog, you are charged for only one active instance of such resource.

Knowledge Catalog API charges

As users interact with the Knowledge Catalog, API calls are free of charge, including:

  • Creating and managing Knowledge Catalog resources
  • Creating and managing lineage resources
  • Catalog search

Pricing example

This section provides examples of how to calculate the Knowledge Catalog cost.

Small aspects

  • User A creates and applies small aspects (1024 bytes each). For $10 per month, the user can store 5 GiB of metadata, which corresponds approximately to 5M aspects. Assuming one aspect per table, this amounts to a total of 5M tables with aspects.
  • User B creates 5M aspects of 1 KB each on the 10th of the month, and deletes the aspects on the 20th. The cost is $3.33, calculated as 5 GiB of data divided by one-third month:
  • Example
Loading...

Large aspects

  • User C creates and applies large aspects (10 KB each). For $10 per month, the user can store 5 GiB of metadata, which corresponds to approximately 500k aspects. Assuming one aspect per table, it amounts to a total of 500k tables with aspects.
  • User D creates 10k aspect types (for example ETL, data governance, data quality), and applies large aspects (10 KB each) using each of the 10 aspect types. For $10 per month, the user can store 5 GiB of metadata, which corresponds approximately to 500k aspects. Assuming 10 aspects per table, it amounts to a total of 50k tables with aspects.

Data Catalog pricing (Deprecated)

This section describes the pricing for Data Catalog, which is in deprecation phase (migration to the Knowledge Catalog in progress). For more information about the differences between Knowledge Catalog and Data Catalog, see Knowledge Catalog versus Data Catalog.

Data Catalog charges apply to metadata storage for Data Catalog and API calls made to the Data Catalog API.

Metadata storage and API call charges accrue daily. You can view unbilled usage on the Google Cloud console.

Note: Pricing models apply to accounts, not projects, unless specified otherwise.

Data Catalog storage pricing

Monthly average storage

Price (USD)

Up to 1 MiB

No charge

Over 1 MiB

$0.002739726 / 1 gibibyte hour

Data Catalog API charges

Data Catalog API calls are billed as described in the following table:

Note: Search queries performed on the Data Catalog page in the Google Cloud console are offered at no charge.

Item

Price (USD)

API calls

0 count to 1,000,000 count
$0.00 (Free) / 100,000 count, per 1 month / account
1,000,000 count and above
$10.00 / 100,000 count, per 1 month / account

If you pay in a currency other than USD, the prices listed in your currency on Cloud Platform SKUs apply.

What's next

Request a custom quote

With Google Cloud's pay-as-you-go pricing, you only pay for the services you use. Connect with our sales team to get a custom quote for your organization.
Google Cloud