Data processing with Hadoop and Spark

Run Hadoop and Spark in a simpler, more cost-effective way. Learn more about this solution.

Produced by OmniGraffle 7.7.1 2018-07-10 21:24:39 +0000 05_Data Processing with Apache Hadoop and Apache Spark Layer 1 zone internal - blue Jobs API Spark PySpark Spark SQL Hadoop Hive Pig laptop Clients zone internal - blue Data Sources/Sinks BigQuery BigQuery BigQuery Fill-1 Fill-4 Fill-6 Fill-8 Fill-10 Fill-12 Fill-14 Cloud Bigtable Cloud Bigtable Cloud Bigtable Fill-1 Fill-4 Fill-6 Fill-8 Fill-10 Fill-12 Fill-14 Fill-16 Fill-18 Fill-20 Fill-22 Cloud Storage Cloud Storage Fill-1 Fill-4 Fill-6 Fill-8 Stroke-21 Stroke-22 App Engine Cloud Dataproc Clusters Cloud Dataproc Fill-1 Fill-4 Fill-7 Fill-9 Fill-11 Fill-13 Fill-15 Fill-17 App Engine Stackdriver Logging & monitoring Stackdriver Fill-1 Fill-4 Fill-6 Fill-8 Fill-10

Related products in this architecture

Cloud Dataproc

Rethink how you operate Hadoop and Spark with Cloud Dataproc. Cut operations that took days or hours down to minutes or seconds. Spin-up clusters as needed in 90 seconds, and tear them down when done, so you’ll get more time to focus on gaining value from your data. Plus, Cloud Dataproc gives you the same per-second billing flexibility used throughout Google Cloud Platform.

learn more arrow_forward
Google Cloud

Get started

Learn and build

New to GCP? Get started with any GCP product for free with a $300 credit.

Need more help?

Our experts will help you build the right solution or find the right partner for your needs.