Developers & Practitioners

Hands-on with Gemma 3 on Google Cloud

November 17, 2025

https://storage.googleapis.com/gweb-cloudblog-publish/images/gemma_3_hero.max-2500x2500.png

Olivier Bourgeois

Developer Relations Engineer

The landscape of generative AI is shifting. While proprietary APIs are powerful, there is a growing demand for open models—models where the architecture and weights are publicly available. This shift puts control back in the hands of developers, offering transparency, data privacy, and the ability to fine-tune for specific use cases.

To help you navigate this landscape, we are releasing two new hands-on labs featuring Gemma 3, Google’s latest family of lightweight, state-of-the-art open models.

Why Gemma?

Built from the same research and technology as Gemini, Gemma models are designed for responsible AI development. Gemma 3 is particularly exciting because it offers multimodal capabilities (text and image) and fits efficiently on smaller hardware footprints while delivering massive performance.

But running a model on your laptop is very different from running it in production. You need scale, reliability, and hardware acceleration (GPUs). The question is: Where should you deploy?

We have prepared two different paths for you, depending on your infrastructure needs: Cloud Run or Google Kubernetes Engine (GKE).

Path 1: The Serverless Approach (Cloud Run)

Best for: Developers who want an API up and running instantly without managing infrastructure, scaling to zero when not in use.

If your priority is simplicity and cost-efficiency for stateless workloads, Cloud Run is your answer. It abstracts away the server management entirely. With the recent addition of GPU support on Cloud Run, you can now serve modern LLMs without provisioning a cluster.

Path 2: The Platform Approach (GKE)

Best for: Engineering teams building complex AI platforms, requiring high throughput, custom orchestration, or integration with a broader microservices ecosystem.

When your application graduates from a prototype to a high-traffic production system, you need the control of Kubernetes. GKE Autopilot gives you that power while still handling the heavy lifting of node management. This path creates a seamless journey from local testing to cloud production.

Which Path Will You Choose?

Whether you are looking for the serverless simplicity of Cloud Run or the robust orchestration of GKE, Google Cloud provides the tools to take Gemma 3 from a concept to a deployed application.

Dive into the labs today and start building:

Share your progress and connect with others on the journey using the hashtag #ProductionReadyAI. Happy learning!

These labs are part of the Open Models module in our official Production-Ready AI with Google Cloud program. Explore the full curriculum for more content that will help you bridge the gap from a promising prototype to a production-grade AI application.

Posted in

Developers & Practitioners

https://storage.googleapis.com/gweb-cloudblog-publish/images/finetuning-hero-image.max-700x700.jpg

Developers & Practitioners

Mastering Model Adaptation: A Guide to Fine-Tuning on Google Cloud

By Drew Brown • 5-minute read

https://storage.googleapis.com/gweb-cloudblog-publish/images/header-alt-text-2436x1200.max-700x700.png

Developers & Practitioners

7 Technical Takeaways from Using Gemini to Generate Code Samples at Scale

By Nim Jayawardena • 8-minute read

https://storage.googleapis.com/gweb-cloudblog-publish/images/GEAR_Website_Graphics_1920x1080-2.max-700x700.png

AI & Machine Learning

Gemini Enterprise Agent Ready (GEAR) program now available, a new path to building AI agents at scale

By Peder Ulander • 2-minute read

https://storage.googleapis.com/gweb-cloudblog-publish/images/build-anything-gemini-3-antigravity.max-700x700.png

Developers & Practitioners

Agent Factory Recap: Build an AI Workforce with Gemini 3

By Smitha Kolan • 6-minute read

Hands-on with Gemma 3 on Google Cloud

Olivier Bourgeois

Why Gemma?

Path 1: The Serverless Approach (Cloud Run)

Path 2: The Platform Approach (GKE)

Which Path Will You Choose?

Related articles

Mastering Model Adaptation: A Guide to Fine-Tuning on Google Cloud

7 Technical Takeaways from Using Gemini to Generate Code Samples at Scale

Gemini Enterprise Agent Ready (GEAR) program now available, a new path to building AI agents at scale

Agent Factory Recap: Build an AI Workforce with Gemini 3