You can install additional components when you create a Dataproc cluster using the Optional Components feature. This page describes the Hive WebHCat component.
The Hive WebHCat
component provides a REST API for HCatalog. The REST service is available on port
on the cluster's first master node.
Install the component
Install the component when you create a Dataproc cluster. Components can be added to clusters created with Dataproc version 1.3 and later.
See Supported Cloud Dataproc versions for the component version included in each Dataproc image release.
To create a Dataproc cluster that includes the Hive WebHCat component,
gcloud dataproc clusters create cluster-name
command with the
--optional-components flag (using image version
1.3 or later).
gcloud dataproc clusters create cluster-name \ --optional-components=HIVE_WEBHCAT \ --image-version=1.3 \ ... other flags
REST APIThe Hive WebHCat component can be specified through the Dataproc API using SoftwareConfig.Component as part of a clusters.create request.
In the Cloud Console, open the Dataproc Create a cluster page. Click "Advanced options" at the bottom of the page to view the Optional Components section.
Click "Select component" to open the Optional components selection panel. Select one or more components to install on your cluster.