This page provides links to reliability-focused documentation that supplements the architectural guidance in the Google Cloud infrastructure reliability guide.
Design scalable and resilient applications.
Patterns and best practices for building cloud applications that are resilient and scalable.
-
Best practices to help you identify, prevent, detect, and respond to ransomware attacks.
Plan for disaster recovery (DR).
A series of documents that focus on designing for DR in the cloud.
Create reliable operational processes and tools.
Operational principles to run workloads in the cloud reliably, including guidance about deploying updates to production and testing for failures.
Learn about deployment archetypes for cloud applications
An article in ACM Computing Surveys that explores six cloud-based deployment archetypes and their tradeoffs between high availability, latency, and cost.
Learn about Google Cloud deployment archetypes.
Describes the following six deployment archetypes: zonal, regional, multi-regional, global, hybrid, and multicloud. It also presents Google Cloud-specific design considerations for each deployment archetype.
Learn how Google Cloud manages changes.
Explains how Google Cloud teams apply reliability best practices and engineering standards to develop and release changes to our platform and services.