GKE 上的 AI/机器学习编排文档
使用 Google Kubernetes Engine (GKE) 平台编排功能运行经过优化的 AI/机器学习工作负载。借助 Google Kubernetes Engine (GKE),您可以实现一个可直接用于生产环境的强大 AI/机器学习平台,并具备托管式 Kubernetes 的所有优势和以下功能:
- 支持使用 GPU 和 TPU 大规模训练和服务工作负载的基础设施编排。
- 与分布式计算和数据处理框架灵活集成。
- 在同一基础设施上支持多个团队,以最大限度地提高资源利用率。
文档资源
相关资源
相关视频
Why GKE is perfect for running batch workloads
Kubernetes is the top container orchestration platform for batch workloads like data processing, machine learning, and scientific simulations. In this video, Mofi Rahman, Cloud Advocate at Google, discusses why Google Kubernetes Engine (GKE) is the
What's new in Kubernetes: Run batch and high performance computing in GKE
Learn best practices on how to run batch and high-performance computing workloads on Google Kubernetes Engine (GKE) and how PGS used these to replace their 260,000-core Cray supercomputers. Hear about the latest feature launches in the data
Intro to building large GKE clusters - part 2
In this episode of GKE Essentials, we continue our 2-part series on building large scale GKE clusters, this time exploring how to think about your usage of GKE and foundational Google Cloud resources you need when building a large GKE cluster. Watch
Intro to building large GKE clusters
Have you ever wanted to scale up GKE clusters? In this episode of GKE Essentials, Anthony shows why you might want to build larger GKE clusters and how you can design clusters to be able to scale. Be sure to stay tuned for part two coming soon!
Designing Google Kubernetes clusters for massive scale and performance
Being an industry leader in running massive scale container workloads is what Google is known for. Learn from us and our customers the key best practices for designing clusters and workloads for large scale and performance on GKE. This session will