You can install additional components when you create a Cloud Dataproc cluster using the Optional Components feature. This page describes the Zeppelin component.
The Zeppelin Notebook
component is a Web-based notebook for interactive data analytics. The Zeppelin
Web UI is available on port
8080 on the cluster's first master node.
By default, notebooks are saved in Cloud Storage
in the Cloud Dataproc staging bucket, which is specified by the user or
when the cluster is created. The location can be changed at cluster creation
time via the
Install the component
Install the component when you create a Cloud Dataproc cluster. Components can be added to clusters created with Cloud Dataproc version 1.3 and later.
See Supported Cloud Dataproc versions for the component version included in each Cloud Dataproc image release.
To create a Cloud Dataproc cluster that includes the Zeppelin component,
gcloud dataproc clusters create cluster-name
command with the
--optional-components flag (using image version
1.3 or later).
gcloud dataproc clusters create cluster-name \ --optional-components=ZEPPELIN \ --image-version=1.3 \ --enable-component-gateway \ ... other flags
REST APIThe Zeppelin component can be specified through the Cloud Dataproc API using SoftwareConfig.Component as part of a clusters.create request.
- Enable the component.
- In the GCP Console, open the Cloud Dataproc Create a cluster page. Click "Advanced options" at the bottom of the page to view the Optional Components section.
- Click "Select component" to open the Optional components selection panel. Select "Zeppelin Notebook" and other optional components to install on your cluster.
- Enable the Component Gateway (requires image version 1.3.29 or higher)
to enable easy access to the Zeppelin notebook and other
component web interfaces from the Google Cloud Platform Console (see
Viewing and Accessing Component Gateway URLs).
- Check the Component Gateway checkbox on the Create a cluster form.
Open the Zeppelin notebook
See Viewing and Accessing Component Gateway URLs to click Component Gateway links on the GCP Console to open the Zeppelin notebook UI running on the cluster's master node in your local browser.