Take control of GKE scaling with new quota monitoring
Artemi Ollin
GKE Product Manager
Managing the growth of your Kubernetes clusters within Google Kubernetes Engine (GKE) just got easier. We've recently introduced the ability to directly monitor and set alerts for crucial scalability limits, providing you with deeper insight and control over your Kubernetes environment.
Effective scalability management is essential for avoiding outages and optimizing resource usage in Kubernetes. These new monitoring features bring you:
- Peace of mind: Potential capacity issues can be proactively addressed before they cause problems, ensuring uninterrupted operations.
- Clearer understanding: Gain a deeper insight into your clusters’ architectural constraints, allowing for informed decision-making.
- Optimization opportunities: Analyze usage trends and identify ways to fine-tune your cluster configurations for optimal resource utilization.
Here are the specific limits you can now keep track of:
- Etcd database size (GiB): Understand how much space your Kubernetes cluster state is consuming.
- Nodes per cluster: Get proactive alerts on your cluster's overall node capacity.
- Nodes per node pool (all zones): Manage node distribution and limits across specific node pools.
- Pods per cluster (GKE Standard / GKE Autopilot): Ensure you have the pod capacity to support your applications.
- Containers per cluster (GKE Standard / GKE Autopilot): Prevent issues by understanding the maximum number of containers your cluster can support.
Get started
You'll find these new quota monitoring and alerting features directly within the Google Cloud console. To get there, you can use the link to a pre-filtered list of GKE quotas or navigate to the Quotas page under the IAM & Admin section in the console and then filter by the Kubernetes Engine API service. To search for a specific quota, use the Filter table. You can filter by the exact quota name, location, cluster name, or node pool (where applicable). You can also create alerts for a specific quota by following the guide. Alerts can be configured to notify you when a quota is approaching or has exceeded its limit.
By using the Cloud Monitoring API and console you can monitor GKE quota usage in greater depth. The API allows you to programmatically access quota metrics and create custom dashboards and alerts. The console provides a graphical interface for monitoring quota usage and creating alerts. Custom dashboards can be created to visualize quota usage over time. Alerts can be configured to notify you when quota usage reaches a certain threshold. This can help you proactively manage your quotas and avoid unexpected outages. See the guide for more details.
Need more information? Explore the official Google Cloud documentation for more in-depth guidance:
- Understanding GKE Quotas and Limits: Quotas and limits | Google Kubernetes Engine (GKE)
- Best practices planning and designing large-size clusters: Plan for large GKE clusters | Google Kubernetes Engine (GKE)
- Setting up a quota alert: Monitor and alert with quota metrics
- Using GKE observability metrics: View observability metrics | Google Kubernetes Engine (GKE)