Operations (formerly Stackdriver)

Monitor, troubleshoot, and improve application performance on your Google Cloud environment.

Try Google Cloud free
  • action/check_circle_24px Created with Sketch.

    Collect metrics, logs, and traces across Google Cloud Platform and your applications

  • action/check_circle_24px Created with Sketch.

    Use built-in out-of-the-box dashboards and views to monitor the platform and applications

  • action/check_circle_24px Created with Sketch.

    Query and analyze these signals

  • action/check_circle_24px Created with Sketch.

    Set up appropriate performance and availability indicators

  • action/check_circle_24px Created with Sketch.

    Setup alerts and notification rules with your existing systems

Key features

Real-time log management and analysis

Cloud Logging is a fully managed service that performs at scale and can ingest application and system log data, as well as custom log data from GKE environments and VMs. Cloud Logging allows you to analyze and export selected logs to long-term storage in real time.

Built-in observability at scale

Cloud Monitoring provides visibility into the performance, uptime, and overall health of cloud-powered applications. Collect metrics, events, and metadata from Google Cloud services, hosted uptime probes, application instrumentation, and a variety of common application components.

Monitor and improve your apps performance

Application Performance Management (APM) includes tools to help you reduce latency and cost, so you can run more efficient applications. With Cloud Trace, Cloud Debugger, and Cloud Profiler, you gain insight into how your code and services are functioning and troubleshoot if needed.

View all features

Customers

Dacsee Founders
Dacsee: Disrupting the ride-hailing business with Google Kubernetes Engine.
Read the story

Story highlights

  • Reduced time spent on infrastructure management

  • Managing all operations products in one centralized platform

  • Saved ~80% of time that would be spent troubleshooting

Partner

Documentation

Tutorial
Dashboard API: Build your own Cloud Monitoring dashboard

Tips for shareable and reusable dashboard creation.

Tutorial
Get started with Cloud Logging

Guides and set-up docs to help you get up and running with Cloud Logging.

Tutorial
Cloud Audit Logs

Learn how Cloud Audit Logs maintains three audit logs: admin activity, data access, and system event.

Tutorial
Get started with Cloud Monitoring

Learn about Workspaces, monitoring agent, uptime checks, and other features.

Architecture
Google Cloud metrics

See which metrics Cloud Monitoring supports.

Google Cloud Basics
Monitoring and logging support for GKE

Learn about Google Kubernetes Engine’s native integration with Cloud Monitoring and Cloud Logging.

Architecture
Hybrid and multi-cloud deployments

This document discusses monitoring and logging architectures for hybrid and multi-cloud deployments.

Tutorial
Qwiklabs Quest: Google Cloud’s Operations Suite

In this fundamental-level quest, you’ll learn the ins and outs of Google Cloud's operations suite to generate insights into the health of your applications.

Use cases

Use case
Centralize your logging and operations

Integrated logging provides critical insights into platform events for development, DevOps/SRE, and security teams. Ingest logs from Google Cloud services and external sources for short-term operations and long-term log analysis. Use integrated audit logging to perform detailed forensic analysis. Integrate with your third-party logging systems using real-time log exports.

Cloud Logging collects all logs, including audit logs, platform logs, user logs, and external logs sent to the API, which are sent to the Logs Router where they are delivered to Cloud Logging, BigQuery, or externally via integration with Pub/Sub.
Use case
Build observability into apps and infrastructure

Cloud Logging and Cloud Monitoring services provide your SRE/DevOps teams with the observability needed to monitor Google Cloud, on-premises, and third-party providers. Logging and Monitoring are integrated with Security Command Center to provide the security and operations teams the insights they need.

Cloud Logging and Cloud Monitoring services provide your SRE/DevOps teams with the observability needed to monitor Google Cloud, on-premises, and third-party providers. Logging and Monitoring are integrated with Security Command Center to provide the security and operations teams the insights they need.
Use case
Reduce latency and inefficiency with APM

Reduce latency and cost by using Application Performance Management tools. Make your applications faster and more reliable whether they are hosted on Google Cloud or not. Use Cloud Trace’s distributed trace to see how requests propagate through your application. Use Cloud Profiler to help identify latency and inefficiency in your code. Troubleshoot your application in production without stopping or slowing down your apps by using Cloud Debugger.

Reduce latency and inefficiency with APM

All features

Log management Logs Router allows customers to control where logs are sent. All logs, including audit logs, platform logs, and user logs, are sent to the Cloud Logging API where they pass through the log router. The log router checks each log entry against existing rules to determine which log entries to discard, which to ingest, and which to include in exports.
Log insights Error Reporting analyzes and aggregates the errors in your cloud applications. Notifies you when new errors are detected.
Proactive monitoring Cloud Monitoring allows you to create alerting policies to notify you when metrics, health check results, and uptime check results meet specified criteria. Integrated with a wide variety of notification channels, including Slack and PagerDuty.
Custom visualization Cloud Monitoring Dashboards provides default out-of-the-box dashboards and allows you to define custom dashboards with powerful visualization tools to suit your needs.
Health check monitoring Cloud Monitoring provides endpoint checks to web applications and other internet-accessible services running on your cloud environment. You can configure uptime checks associated with URLs, groups, or resources, such as instances and load balancers.
Service monitoring Service Monitoring provides out-of-the-box telemetry and dashboards that allow troubleshooting in context through topology and context graphs, plus automation of health monitoring through SLOs and error budget management.
Latency management Cloud Trace provides latency sampling and reporting for App Engine, including per-URL statistics and latency distributions.
Debugging Cloud Debugger connects your application’s production data to your source code by inspecting the state of your application at any code location in production without stopping or slowing down your requests.
Performance and cost management Cloud Profiler provides continuous profiling of resource consumption in your production applications, helping you identify and eliminate potential performance issues.
Security management Cloud Audit Logs provides near real-time user activity visibility across Google Cloud.

Pricing

Control your own usage and spending: pay only for what you use. Free usage allotments let you get started with no up-front fees or commitments.

Free usage allotments let you get started with no up-front fees or commitments.

Partners

Get support from a rich and growing ecosystem of technology integrations to expand the IT ops, security, and compliance capabilities available to Google Cloud customers.