How Gemini on GDC connected API on GDC connected works

This page describes how Gemini on Google Distributed Cloud connected API works, including information about its infrastructure, hardware and software capabilities.

Overview

Gemini on GDC connected API is a turnkey software solution that lets you deploy Gemini models on your own Google-certified hardware connected to your local network. The models are hosted within Confidential Virtual Machines (CVMs) deployed on Distributed Cloud connected clusters. The required hardware has a specific configuration established and certified by Google.

Google-certified hardware

Gemini on GDC connected API requires Google-certified hardware configured exactly as described in Google-certified hardware. You must order and install this hardware on your premises before ordering Gemini on GDC connected API. You must provide the corresponding service tags and other hardware identification information to Google when ordering Gemini on GDC connected API, as described in Installation requirements.

The Google-certified hardware has the following specifications:

  • Platform: Dell XE9680 server chassis
  • CPU: Two (2) Intel Xeon Platinum 8592+ 64-core processors
  • Memory: 2TB DDR5 5600 MT/s RDIMM RAM
  • GPU: NVIDIA HGX H200 SXM assembly with eight (8) NVIDIA H200 series GPUs
  • Storage: FIPS-certified NVMe storage configured as follows:
    • 2x 960 GB drives in hardware RAID1 for the operating system
    • 2x 15.36 TB drives in software RAID1 reserved for system-reserved functions
    • 6x 15.36 TB drives for storing data for the confidential virtual machines that locally serve your Gemini on GDC connected API endpoints.
  • Networking:
    • Two (2) Mellanox ConnectX-6 dual-port 100 GbE QFSP56 network adapters
    • One (1) integrated Broadcom 5720 1 GbE network adapter
  • Management: Integrated Dell Remote Access Controller 9 (iDRAC9) Datacenter 16G for embedded system management
  • Power: Six (6) 2800W Titanium-certified power supplies

A single deployment of Gemini on GDC connected API consists of one (1) Google-certified machine. Gemini on GDC connected API does not run on non-Google-certified hardware configurations.

Confidential virtual machines

The models served by Gemini on GDC connected API are hosted within Confidential Virtual Machines running on Distributed Cloud connected clusters. Remote attestation ensures the security and integrity of the deployed CVMs and the secure software environment that runs them.

Remote attestation

Gemini on GDC connected API requires a constant internet connection to maintain remote attestation of the deployed CVMs and the secure software environment that hosts them. The attestation mechanism is implemented using Intel® Software Guard Extension (Intel® SGX) Remote Attestation Services. You must have an Intel® SGX Provisioning Certification Service subscription and provide the corresponding API keys to Google when ordering Gemini on GDC connected API.

Remote attestation failure

The remote attestation service establishes a heartbeat between your Gemini on GDC connected API deployment and Google, which requires an uninterrupted internet connection. If this heartbeat is interrupted for more than 30 consecutive minutes, the CVMs are shut down until the heartbeat is reestablished. Contact your Google representative to resolve attestation failures.

Supported models

This release of Gemini on GDC connected API supports the following models:

  • Gemini 2.0 Flash 128k with public ID gemini-2.0-flash-001.

For more information on the workloads suitable for Gemini on GDC connected API, see Gemini overview.

Deployment steps

At a high level, the deployment of a Gemini model on Gemini on GDC connected API looks as follows:

  1. Procure and install the Google-certified hardware, as described in Hardware.
  2. Complete the steps necessary to satisfy all of the installation requirements described in Installation requirements.
  3. Order Gemini on GDC connected API as described in Order Gemini on Google Distributed Cloud connected API on Distributed Cloud connected. When ordering, you must provide the information listed in Installation requirements so that Google can provision your Gemini on GDC connected API deployment for software management and remote attestation of the CVMs that host the models you want to deploy.
  4. Schedule a visit with a Google technician who must visit your location to install and configure the Gemini on GDC connected API software on your Google-certified hardware.
  5. Enable the required APIs, as described in Enable the required APIs.
  6. Grant your Gemini on Google Distributed Cloud connected API administrators and Gemini on GDC connected API model consumer accounts the required roles, as described in Grant the required roles.
  7. Create and deploy your Gemini on GDC connected API endpoints, as described in Deploy a Gemini on GDC connected API endpoint.
  8. Interact with your deployed Gemini on GDC connected API endpoints as described in Interact with a Gemini on GDC connected API endpoint.

What's next