Dataproc is a managed Apache Spark and Apache Hadoop service that lets you take advantage of open
source data tools for batch processing, querying, streaming, and machine learning.
Dataproc automation helps you create clusters quickly, manage them easily, and save
money by turning clusters off when you don't need them. With less time and money spent on
administration, you can focus on your jobs and your data.
This course features a combination of lectures, demos, and hands-on labs to implement logistic regression using a machine learning library for Apache Spark running on a Dataproc cluster to develop a model for data from a multivariable dataset.