Operations

Operations designed to monitor, troubleshoot and improve infrastructure and application performance
Monitor, troubleshoot, and improve infrastructure and application performance

Google Cloud’s operations suite (formerly Stackdriver) are designed to monitor, troubleshoot, and improve cloud infrastructure, software, and application performance. Efficiently build and run workloads, keeping applications performant and available.

  • Collect signals across Google Cloud internal and external apps, platforms, and services
  • Analyze and monitor your operational telemetry
  • Set up appropriate performance and availability indicators
  • Use built-in observability to troubleshoot and improve your applications
  • Automate ops using both out-of-the-box tools and tools customized through programmatic interfaces

Key features

For Ops, SecOps, SRE, DevOps

Real-time log management and analysis

Cloud Logging is a fully managed service that performs at scale and can ingest application and system log data, as well as custom log data from thousands of VMs. Cloud Logging allows you to analyze and export selected logs to long-term storage in real time.

For Ops, SRE, DevOps

Built-in observability at scale

Cloud Monitoring provides visibility into the performance, uptime, and overall health of cloud-powered applications. Collect metrics, events, and metadata from Google Cloud services, hosted uptime probes, application instrumentation, and a variety of common application components.

For DevOps

Monitor and improve your apps performance

Application Performance Management (APM) includes tools to help you reduce latency and cost, so you can run more efficient applications. With Cloud Trace, Cloud Debugger, and Cloud Profiler, you gain insight into how your code and services are functioning and troubleshoot if needed.

View all features

Customer stories

Highlights

  • Reduced time spent on infrastructure management using Cloud Monitoring

  • Increased productivity by managing all operations products in one centralized platform

  • Saved as much as 80% of time that would otherwise be spent troubleshooting

Partner

See more customers

What’s new

Documentation

Tutorial
Get started with Cloud Logging

Guides and set-up docs to help you get up and running with Cloud Logging.

Tutorial
Cloud Audit Logs

Learn how Cloud Audit Logs maintains three audit logs: admin activity, data access, and system event.

Tutorial
Get started with Cloud Monitoring

Learn about Workspaces, monitoring agent, uptime checks, and other features.

Architecture
Google Cloud metrics

See which metrics Cloud Monitoring supports.

Google cloud basics
Monitoring and logging support for GKE

Learn about Google Kubernetes Engine’s native integration with Cloud Monitoring and Cloud Logging.

Common use cases

Manage cloud operations

Build observability into your platform through the use of integrated logging, monitoring, and application performance management tools.

Centralize your logging and operations

Integrated logging provides critical insights into platform events for development, DevOps/SRE, and security teams. Ingest logs from Google Cloud services and external sources for short-term operations and long-term log analysis. Use integrated audit logging to perform detailed forensic analysis. Integrate with your third-party logging systems using real-time log exports.

SD overview_Dig_1Layer 1zone external - bluezone external - greenzone external - yellowzone external - greyLogging storageShapeFill-3Fill-4Fill-4-2Fill-4-3Fill-7Cloud StorageBigQueryzone external - grey• Log search & analysis • Log based metrics • Log error analysis • Dashboards from logs • Alerting from logszone external - greyAnywhere…XMLID_5_XMLID_4_XMLID_3_XMLID_2_XMLID_1_logs APIAudit logslogs APIService logsApp logsSyslogPlatform logsCentralized Cloud Logging APIShapeFill-3Fill-4Fill-4-2Fill-4-3Fill-7Log routerShapezone external - greyLog sinks - Inclusion filterszone external - greyExclusion filterszone external - greyUS$0.50/GB*zone external - greyCost per servicePub/Sub

Cloud Logging collects all logs, including audit logs, platform logs, user logs, and external logs sent to the API, which are sent to the Logs Router where they are delivered to Cloud Logging, BigQuery, or externally via integration with Pub/Sub.

Build observability into applications and infrastructure

Integrated observability provides critical insights into platform events for development, DevOps/SRE, and security teams. Cloud Monitoring provides centralized dashboards and alerting to efficiently operate services. Use integrated logging to power vulnerability detection and bring proactive intelligent monitoring to your security and operations team. Customize your log monitoring using Cloud Functions and the Data Loss Protection API.

SD overview_Dig_2Layer 1GCP logoFill-3Fill-5Fill-7Fill-8Fill-9Fill-18Fill-19Fill-20Fill-21Fill-22Fill-23zone internal - blueLogging export optionsBigQueryCloud StorageSecurity Command CenterShapeOvalzone external - greyData centerStorageComputeThird-party log providerShape_1_AlertsDashboardsOn-premises logsLogging metriczone internal - blueGoogle Cloud projectBigQueryCloud StorageCompute EngineGoogle Kubernetes EngineShapeRectangle-pathCloud Functionscolored-32_functionsShapecolored-32_functions-2Shape-2OvalOval-2Oval-3DataprocShapeRectangle-pathzone internal - blueGoogle Cloud projectPub/SubBigQueryCloud StorageCompute EngineGoogle Kubernetes EngineShapeRectangle-pathAI PlatformShapeBindPlanePub/SubCloud LoggingShapeFill-3Fill-4Fill-4-2Fill-4-3Fill-7Logs routerShapeEvent Threat Detectionzone external - greyOperations usersUsersShape_1_Stroke-21Stroke-22Cloud Monitoring

Cloud Logging and Cloud Monitoring services provide your SRE/DevOps teams with the observability needed to monitor Google Cloud, on-premises, and third-party providers. Logging and Monitoring are integrated with Security Command Center to provide the security and operations teams the insights they need.

BindPlane is a registered trademark of Blue Medora.

Reduce latency and inefficiency with Application Performance Management

Reduce latency and cost for your applications by using Application Performance Management tools. By understanding in detail how they behave in production, you can help make your applications faster and more reliable whether they are hosted on Google Cloud or not. Use Cloud Trace’s distributed trace to understand how requests propagate through your application. Use Cloud Profiler to help identify latency and inefficiency in your code. Troubleshoot your application in production without stopping or slowing down your apps by using Cloud Debugger.

SD overview_Dig_3Layer 1GCP logoFill-3Fill-5Fill-7Fill-8Fill-9Fill-18Fill-19Fill-20Fill-21Fill-22Fill-23zone external - greyData center, third-party cloud providersComputeCloud TraceShapeFill-1Shape-2Fill-1-2Shape-3Fill-1-3Istio tracesOpenTelemetry tracesTraces, debugging, profiler telemetryTraces, debugging, profiler telemetryCloud DebuggerPage-1icon_debugDEBUG_SECTIONShapeCloud Profilercolored-32_profilerGroupOvalOval-2Combined-Shapezone internal - blueGoogle Cloud projectCompute EngineGoogle Kubernetes EngineShapeRectangle-pathApp EngineShapezone internal - blueGoogle Cloud projectCompute EngineGoogle Kubernetes EngineShapeRectangle-pathApp EngineShapeGoogle Cloud APM LibrariesIstiohullmainsailheadsailTrace & APM APIsOpenTelemetryzone external - greyDeveloperszone external - greyDevOps/SREUsersShape_1_Stroke-21Stroke-22UsersShape_1_Stroke-21Stroke-22

All features

Log management Logs Router allows customers to control where logs are sent. All logs, including audit logs, platform logs, and user logs, are sent to the Cloud Logging API where they pass through the log router. The log router checks each log entry against existing rules to determine which log entries to discard, which to ingest, and which to include in exports.
Log insights Error Reporting analyzes and aggregates the errors in your cloud applications. Notifies you when new errors are detected.
Proactive monitoring Cloud Monitoring allows you to create alerting policies to notify you when metrics, health check results, and uptime check results meet specified criteria. Integrated with a wide variety of notification channels, including Slack and PagerDuty.
Custom visualization Cloud Monitoring Dashboards provides default out-of-the-box dashboards and allows you to define custom dashboards with powerful visualization tools to suit your needs.
Health check monitoring Cloud Monitoring provides endpoint checks to web applications and other internet-accessible services running on your cloud environment. You can configure uptime checks associated with URLs, groups, or resources, such as instances and load balancers.
Service monitoring Service Monitoring provides out-of-the-box telemetry and dashboards that allow troubleshooting in context through topology and context graphs, plus automation of health monitoring through SLOs and error budget management.
Latency management Cloud Trace provides latency sampling and reporting for App Engine, including per-URL statistics and latency distributions.
Debugging Cloud Debugger connects your application’s production data to your source code by inspecting the state of your application at any code location in production without stopping or slowing down your requests.
Performance and cost management Cloud Profiler provides continuous profiling of resource consumption in your production applications, helping you identify and eliminate potential performance issues.
Security management Cloud Audit Logs provides near real-time user activity visibility across Google Cloud.

Pricing

Control your own usage and spending: pay only for what you use. Free usage allotments let you get started with no up-front fees or commitments.

Free usage allotments let you get started with no up-front fees or commitments.

View pricing details

Partners

Get support from a rich and growing ecosystem of technology integrations to expand the IT ops, security, and compliance capabilities available to Google Cloud customers.