Empowering businesses with Google Cloud AI
Machine learning has produced business and research breakthroughs ranging from network security to medical diagnoses. We built the Tensor Processing Unit (TPU) in order to make it possible for anyone to achieve similar breakthroughs. Cloud TPU is the custom-designed machine learning ASIC that powers Google products like Translate, Photos, Search, Assistant, and Gmail. Here’s how you can put the TPU and machine learning to work accelerating your company’s success, especially at scale.
Machine learning performance and benchmarks
Built for AI on Google Cloud
Cloud TPU is designed to run cutting-edge machine learning models with AI services on Google Cloud. And its custom high-speed network offers up to 11.5 petaflops of performance in a single pod — enough computational power to transform your business or create the next research breakthrough.
Iterate faster on your ML solutions
Training machine learning models is like compiling code: you need to update often, and you want to do so as efficiently as possible. ML models need to be trained over and over as apps are built, deployed, and refined. Cloud TPU’s robust performance and low cost make it ideal for machine learning teams looking to iterate quickly and frequently on their solutions.
Proven, state-of-the-art models
You can build your own machine learning-powered solutions for many real-world use cases. Just bring your data, download a Google-optimized reference model, and start training.
Cloud TPU offering
Cloud TPU v2
Cloud TPU v3
Cloud TPU v2 Pod (alpha)
Cloud TPU features
Get started immediately by leveraging our growing library of optimized models for Cloud TPU. These provide optimized performance, accuracy, and quality in image classification, object detection, language modeling, speech recognition, and more.
Connect Cloud TPUs to custom machine types
You can connect to Cloud TPUs from custom AI Platform Deep Learning VM Image types, which can help you optimally balance processor speeds, memory, and high-performance storage resources for your workloads.
Fully integrated with Google Cloud Platform
At their core, Cloud TPUs and Google Cloud’s data and analytics services are fully integrated with other Google Cloud Platform offerings, like Google Kubernetes Engine (GKE). So when you run machine learning workloads on Cloud TPUs, you benefit from GCP’s industry-leading storage, networking, and data analytics technologies.
Preemptible Cloud TPU
You can save money by using preemptible Cloud TPUs for fault-tolerant machine learning workloads, such as long training runs with checkpointing or batch prediction on large datasets. Preemptible Cloud TPUs are 70% cheaper than on-demand instances, making everything from your first experiments to large-scale hyperparameter searches more affordable than ever.
Cloud TPU charges for using preemptible and non-preemptible (on-demand) Cloud TPU use to train machine learning models. For detailed pricing information, please view the pricing guide.
Single Cloud TPU v2 device pricing
The following table shows the pricing per region for using a single Cloud TPU v2 device.
|Cloud TPU v2||$4.50 USD per TPU per hour.|
|Preemptible TPU v2||$1.35 USD per TPU per hour.|
|Cloud TPU v2||$4.95 USD per TPU per hour.|
|Preemptible TPU v2||$1.485 USD per TPU per hour.|
|Cloud TPU v2||$5.22 USD per TPU per hour.|
|Preemptible TPU v2||$1.566 USD per TPU per hour.|
Single Cloud TPU v3 device pricing
The following table shows the pricing per region for using a single Cloud TPU v3 device.
|Cloud TPU v3||$8.00 USD per TPU per hour.|
|Preemptible TPU v3||$2.40 USD per TPU per hour.|
|Cloud TPU v3||$8.80 USD per TPU per hour.|
|Preemptible TPU v3||$2.64 USD per TPU per hour.|
Cloud TPU v2 Pod is in alpha. For more information on our product launch stages, see here.