The Architecture Center provides content resources across a wide variety of AI and machine learning subjects. This page provides information to help you get started with generative AI, traditional AI, and machine learning. It also provides a list of all the AI and machine learning (ML) content in the Architecture Center.
Get started
The documents listed on this page can help you get started with designing, building, and deploying AI and ML solutions on Google Cloud.
Explore generative AI
Start by learning about the fundamentals of generative AI on Google Cloud, on the Cloud documentation site:
- To learn the stages of developing a generative AI application and explore the products and tools for your use case, see Build a generative AI application on Google Cloud.
- To identify when generative AI, traditional AI (which includes prediction and classification), or a combination of both might suit your business use case, see When to use generative AI or traditional AI.
- To define an AI business use case with a business value-driven decision approach, see Evaluate and define your generative AI business use case.
- To address the challenges of model selection, evaluation, tuning, and development, see Develop a generative AI application.
To explore a generative AI and machine learning blueprint that deploys a pipeline for creating AI models, see Build and deploy generative AI and machine learning models in an enterprise. The guide explains the entire AI development lifecycle, from preliminary data exploration and experimentation through model training, deployment, and monitoring.
Browse the following example architectures that use generative AI:
- Generative AI document summarization
- Generative AI knowledge base
- Generative AI RAG with Cloud SQL
- Infrastructure for a RAG-capable generative AI application using Vertex AI and Vector Search
- Infrastructure for a RAG-capable generative AI application using Vertex AI and AlloyDB for PostgreSQL
- Infrastructure for a RAG-capable generative AI application using GKE
- Model development and data labeling with Google Cloud and Labelbox
For information about Google Cloud generative AI offerings, see Vertex AI and running your foundation model on GKE.
Design and build
To select the best combination of storage options for your AI workload, see Design storage for AI and ML workloads in Google Cloud.
Google Cloud provides a suite of AI and machine learning services to help you summarize documents with generative AI, build image processing pipelines, and innovate with generative AI solutions.
Keep exploring
The documents that are listed later on this page and in the left navigation can help you build an AI or ML solution. The documents are organized in the following categories:
- Generative AI: Follow these architectures to design and build generative AI solutions.
- Model training: Implement machine learning, federated learning, and personalized intelligent experiences.
- MLOps: Implement and automate continuous integration, continuous delivery, and continuous training for machine learning systems.
- AI and ML applications: Build applications on Google Cloud that are customized for your AI and ML workloads.
AI and machine learning resources in the Architecture Center
You can filter the following list of AI and machine learning resources by typing a product name or a phrase that's in the resource title or description.
Architecture for MLOps using TensorFlow Extended, Vertex AI Pipelines, and Cloud Build This document describes the overall architecture of a machine learning (ML) system using TensorFlow Extended (TFX) libraries. It also discusses how to set up a continuous integration (CI), continuous delivery (CD), and continuous training (CT) for... Products used: Cloud Build |
Best practices for implementing machine learning on Google Cloud Introduces best practices for implementing machine learning (ML) on Google Cloud, with a focus on custom-trained models based on your data and code. Products used: Vertex AI, Vertex AI, Vertex AI, Vertex Explainable AI, Vertex Feature Store, Vertex Pipelines, Vertex Tensorboard |
Build an ML vision analytics solution with Dataflow and Cloud Vision API How to deploy a Dataflow pipeline to process large-scale image files with Cloud Vision. Dataflow stores the results in BigQuery so that you can use them to train BigQuery ML pre-built models. Products used: BigQuery, Cloud Build, Cloud Pub/Sub, Cloud Storage, Cloud Vision, Dataflow |
Build and deploy generative AI and machine learning models in an enterprise Describes the generative AI and machine learning (ML) blueprint, which deploys a pipeline for creating AI models. |
Cross-silo and cross-device federated learning on Google Cloud Provides guidance to help you create a federated learning platform that supports either a cross-silo or cross-device architecture. |
Data science with R on Google Cloud: Exploratory data analysis Shows you how to get started with data science at scale with R on Google Cloud. This document is intended for those who have some experience with R and with Jupyter notebooks, and who are comfortable with SQL. Products used: BigQuery, Cloud Storage, Notebooks, Vertex AI |
Deploy and operate generative AI applications Discusses techniques for building and operating generative AI applications using MLOps and DevOps principles. |
Design storage for AI and ML workloads in Google Cloud Map the AI and ML workload stages to Google Cloud storage options, and select the recommended storage options for your AI and ML workloads. Products used: Cloud Storage, Filestore, Persistent Disk |
Geospatial analytics architecture Learn about Google Cloud geospatial capabilities and how you can use these capabilities in your geospatial analytics applications. Products used: BigQuery, Dataflow |
Google Workspace Backup with Afi.ai Describes how to set up an automated Google Workspace backup using Afi.ai. Products used: Cloud Storage |
Guidelines for developing high-quality, predictive ML solutions Collates some guidelines to help you assess, ensure, and control quality in machine learning (ML) solutions. |
Infrastructure for a RAG-capable generative AI application using GKE Shows you how to design the infrastructure for a generative AI application with RAG using GKE. Products used: Cloud SQL, Cloud Storage, Google Kubernetes Engine (GKE) |
Design infrastructure to run a generative AI application with retrieval-augmented generation. Products used: AlloyDB for PostgreSQL, BigQuery, Cloud Logging, Cloud Monitoring, Cloud Pub/Sub, Cloud Run, Cloud Storage, Document AI, Vertex AI |
Infrastructure for a RAG-capable generative AI application using Vertex AI and Vector Search Design infrastructure for a generative AI application with retrieval-augmented generation (RAG). Products used: BigQuery, Cloud Logging, Cloud Monitoring, Cloud Pub/Sub, Cloud Run, Cloud Storage, Vertex AI |
Jump Start Solution: AI/ML image processing on Cloud Functions Analyze images by using pretrained machine learning models and an image-processing app deployed on Cloud Functions. |
Jump Start Solution: Analytics lakehouse Unify data lakes and data warehouses by creating an analytics lakehouse using BigQuery to store, process, analyze, and activate data. |
Jump Start Solution: Data warehouse with BigQuery Build a data warehouse with a dashboard and visualization tool using BigQuery. |
Jump Start Solution: Generative AI document summarization Process and summarize documents on demand by using Vertex AI Generative AI and large language models (LLMs). |
Jump Start Solution: Generative AI Knowledge Base Extract question and answer pairs from documents on demand by using Vertex AI Generative AI and large language models (LLMs)... |
Jump Start Solution: Generative AI RAG with Cloud SQL Deploy a retrieval augmented generation (RAG) application with vector embeddings and Cloud SQL. |
MLOps: Continuous delivery and automation pipelines in machine learning Discusses techniques for implementing and automating continuous integration (CI), continuous delivery (CD), and continuous training (CT) for machine learning (ML) systems. |
Model development and data labeling with Google Cloud and Labelbox Provides guidance for building a standardized pipeline to help accelerate the development of ML models. |
Serve Spark ML models using Vertex AI Shows how to serve (run) online predictions from machine learning (ML) models that are built by using Spark MLlib and managed by using Vertex AI. Products used: Vertex AI |
Use generative AI for utilization management A reference architecture for health insurance companies to automate prior authorization (PA) request processing and improve their utilization review (UR) processes. Products used: BigQuery, Cloud Logging, Cloud Monitoring, Cloud Pub/Sub, Cloud Run, Cloud Storage, Document AI, Vertex AI |
Use Vertex AI Pipelines for propensity modeling on Google Cloud Describes an example of an automated pipeline in Google Cloud that performs propensity modeling. Products used: BigQuery, Cloud Functions, Vertex AI |