Dataproc Metastore documentation
Dataproc Metastore is a fully managed Apache Hive metastore (HMS) that runs on Google Cloud. An HMS is the established standard in the open source big data ecosystem for managing technical metadata. Dataproc Metastore is highly available, autohealing, and serverless. It helps you manage your data lake metadata and provides interoperability between the various data processing engines and tools you're using.
Start your next project with $300 in free credit
Build and test a proof of concept with the free trial credits and free monthly usage of 20+ products.
Documentation resources
Related videos
Data Analytics Deep Dives - Dataplex Explore
Provides an overview of Dataplex Explore for executing some Spark SQL against BigQuery internal tables, external tables and Hive tables. The demo also shows how you can use a notebook along with scheduling and sharing your artifacts. Everything is
Modernize your data lake to accelerate loan processing
Watch to learn how financial service companies can use Google Cloud's data lake solution to accelerate loan processing, allowing people and businesses alike to acquire the funds they need in a timely manner. In this video, we’ll demo how a financial
Democratizing Dataproc (Cloud Next '19)
dunnhumby uses Dataproc as a data platform on which our data scientist and product teams run ETL and machine learning routines. We encourage product teams to autonomously spin up clusters only when they need to and to use Apache Airflow to coordinate