Google Cloud's Lakehouse (formerly BigLake) is an open, cross-cloud lakehouse platform that empowers customers to build high-performance, governed and managed lakehouses in an open and interoperable manner whiling connecting Apache Iceberg data to Google Cloud's powerful engines like BigQuery and Managed Service for Apache Spark. It provides:
BigLake (Lakehouse) tables perform automatic table storage optimization, including adaptive file sizing, automatic clustering, garbage collection, as well as BigQuery metadata (CMETA) generation and refresh to optimize query performance. Your compute usage for these background table management will be charged accordingly:
Pricing heading | Billing for | List price per DCU-hour* |
|---|---|---|
BigLake (Lakehouse) table management | Storage optimization CMETA generation and refresh | $0.12 / 1 hour |
*The list prices depend on the region.
Lakehouse runtime catalog (formerly BigLake metastore)
Lakehouse runtime catalog (formerly BigLake) is a serverless and scalable runtime catalog that connects lakehouse data stored in Google Cloud to multiple runtimes, like Apache Spark and BigQuery. Configuration is done via the Iceberg REST catalog and Hive API.
Pricing heading | Billing for | List price* |
|---|---|---|
Class A Operations (writes and metadata operations)
| Metadata Access charges for writes, updates, list, create and config operations Specific Operations:
Note:
Metadata file > 1MB
| 0 count to 5,001 count Free per 1 month / account 5,001 count and above $6.00 / 1,000,000 count, per 1 month / account |
Class B Operations (reads and delete operations)
| Metadata Access charges for reads, get and delete operations Specific Operations:
Note: Metadata file > 1MB
| 0 count to 50,001 count Free per 1 month / account 50,001 count and above $0.90 / 1,000,000 count, per 1 month / account |
* The list prices depend on the region.
With Lakehouse, you can connect to your preferred engines, including BigQuery and Google Cloud Managed Service for Spark. Compute charges based on compute pricing for BigQuery and Managed Service for Apache Spark.
Your data remains in your Google Cloud Storage bucket, so you pay regular GCS rates directly for the storage space used. This cost depends entirely on the storage class you choose. Learn more about Google Cloud Storage pricing.