Using Stackdriver Monitoring for Dataflow Pipelines

Google Stackdriver provides powerful monitoring, logging, and diagnostics. Cloud Dataflow integration with Stackdriver Monitoring allows you to access Dataflow job metrics such as Job Status, Element Counts, System Lag (for streaming jobs), and User Counters from the Stackdriver dashboards. You can also employ Stackdriver alerting capabilities to be notified of a variety of conditions, such as long streaming system lag or failed jobs.

Before you begin

Follow one of the quickstarts to get your Dataflow project set up and construct and run your pipeline.

Explore Metrics

You can explore Dataflow metrics using Stackdriver. Follow the steps in this section to observe the several standard metrics provided for each of your Dataflow pipelines.

Note: Any aggregator you define in your Dataflow pipeline will be reported by Dataflow to Stackdriver as a custom metric. Dataflow will report incremental updates to Stackdriver approximately every 30 seconds. All user metrics will be exported as the “double” data type to avoid conflicts.

  1. Go to the Google Cloud Platform Console and select the Stackdriver Monitoring menu. Stackdriver Monitoring

  2. Follow the steps in the Google Cloud Platform Console to create a Stackdriver account and start a free trial for Stackdriver.

  3. Go to the Cloud Dataflow dashboard in Stackdriver and navigate to Resources > Metrics Explorer (Beta). Metrics Explorer

  4. In the Metrics Explorer, select the dataflow_job resource type. dataflow_job

  5. From the list that appears, select a metric you'd like to observe for one of your jobs.

    Choose metrics
    For example: The WindowedWordCount pipeline is a streaming pipeline that reads from a PubSub topic and writes to BigQuery. It has 5 steps, one of which is PubsubIO.Read. The image below displays the dataflow/job/element_count for the PubsubIO.Read step of the WindowedWordCount pipeline. Example

Create alerts and dashboards

Stackdriver does not only provide you with access to Dataflow-related metrics, but also allows you to create alerts and dashboards so you can chart time series of metrics and choose to be notified when these metrics reach specified values.

Create groups of resources

You can create resource groups that include multiple Dataflow pipelines so that you can easily set alerts and build dashboards.

  1. In the Cloud Dataflow dashboard in Stackdriver go to the Groups menu and select Create Group.

  2. Add filter criteria that define the Dataflow resources included in the group. For example, one of your filter criteria can be the name prefix of your pipelines. Create group

  3. After the group is created, you will be able to see the basic metrics related to resources in that group. Create group

Create alerts for Dataflow metrics

Stackdriver gives you the ability to create alerts and be notified when a certain metric crosses a specified threshold. For example, when System Lag of a streaming pipeline increases above a predefined value.

  1. In the Cloud Dataflow dashboard in Stackdriver, go to the Alerting menu and select Policies Overview.

  2. Click on Add Policy. Add policy

  3. In the Create new alerting policy page, you can define the alerting conditions and the channels of communication for alerts.
    For example, to set an alert on the System Lag for the WindowedWordCount Dataflow pipeline group, select “Dataflow Job” in the Resource Type dropdown, “Group” in the Applies To dropdown, and “System Lag” in the If Metric dropdown. Create alert

  4. After you've created an alert, you can review the events related to Dataflow by navigating to Alerting > Events. Every time an alert is triggered by a Metric Threshold condition, an Incident and a corresponding Event are created in Stackdriver. If you specified a notification mechanism in the alert (email, SMS, etc), you will also receive a notification. Incident alert

Build your own custom monitoring dashboard

You can build Stackdriver Monitoring dashboards with the most relevant Dataflow-related charts.

  1. In the Cloud Dataflow dashboard in Stackdriver, go to the Dashboards menu and select Create Dashboard.

  2. Click on Add Chart.

  3. In the Add Chart window, select "Dataflow Job" as the Resource Type, select a metric you want to chart in the Metric Type field, and select a group that contains Dataflow pipelines in the Filter panel. Add chart

You can add as many charts to the dashboard as you like.

What's next

To learn more, consider exploring these other resources:

Send feedback about...

Cloud Dataflow Documentation