Cloud Dataproc Druid Component

You can install additional components when you create a Cloud Dataproc cluster using the Optional Components feature. This page describes the Druid component.

The Apache Druid component is an open source distributed OLAP data store. The Druid component installs Druid services on the Cloud Dataproc cluster master (Coordinator, Broker, and Overlord) and worker (Historical, Realtime and MiddleManager) nodes.

Install the component

Install the component when you create a Cloud Dataproc cluster. Components can be added to clusters created with Cloud Dataproc version 1.3 and later. The Druid component requires the installation of the Zookeeper component (as shown in the gcloud command-line tool example, below).

See Supported Cloud Dataproc versions for the component version included in each Cloud Dataproc image release.

gcloud command

To create a Cloud Dataproc cluster that includes the Druid component, use the gcloud dataproc beta clusters create cluster-name command with the --optional-components flag (using image version 1.3 or later).

gcloud beta dataproc clusters create cluster-name \
    --optional-components=DRUID,ZOOKEEPER \
    --image-version=1.3 \
  ... other flags

REST API

The Druid component can be specified through the Cloud Dataproc API using SoftwareConfig.Component as part of a clusters.create request.

Console

Currently, the Druid Cloud Dataproc component is not supported in the Google Cloud Platform Console.

¿Te sirvió esta página? Envíanos tu opinión:

Enviar comentarios sobre…

Cloud Dataproc Documentation
¿Necesitas ayuda? Visita nuestra página de asistencia.