Gemini Online Inference on Vertex AI Service Level Agreement (SLA)

During the Term of the agreement under which Google has agreed to provide Google Cloud Platform to Customer (as applicable, the "Agreement"), the Covered Service will provide a Monthly Uptime Percentage to Customer (the "Service Level Objective" or "SLO") as follows:

Covered Service

Monthly Uptime Percentage

generateContent and streamGenerateContent methods of Gemini Online Inference on Vertex AI

99.5%

Covered Service

Monthly Uptime Percentage

generateContent and streamGenerateContent methods of Gemini Online Inference on Vertex AI

99.5%

If Google does not meet the SLO, and if Customer meets its obligations under this SLA, Customer will be eligible to receive the Financial Credits described below. Monthly Uptime Percentage and Financial Credit are determined on a calendar month basis per Project. This SLA states Customer's sole and exclusive remedy for any failure by Google to meet the SLO. Capitalized terms used in this SLA, but not defined in this SLA, have the meaning set forth in the Agreement. If the Agreement authorizes the resale or supply of Google Cloud Platform under a Google Cloud partner or reseller program, then all references to Customer in this SLA mean Partner or Reseller (as applicable), and any Financial Credit(s) will only apply for impacted Partner or Reseller order(s) under the Agreement.

Definitions

The following definitions apply to the SLA:

  • "Covered Service" means the generateContent and streamGenerateContent methods of Gemini Online Inference API on Vertex AI.
  • "Downtime" means more than a five percent Error Rate and is measured based on server side Error Rate.
  • "Downtime Period" means a period of five or more consecutive minutes of Downtime. Partial minutes or intermittent Downtime for a period of less than five minutes will not count towards any Downtime Periods.
  • "Error Rate" means the number of Valid Requests that result in an error code response with HTTP Status 5XX divided by the total number of Valid Requests during the Downtime Period.
  • "Financial Credit" means the following:

Monthly Uptime Percentage

Percentage of monthly bill for the Covered Service that does not meet SLO that will be credited to Customer's future monthly bills

99.0% - < 99.5%

10%

95.0% - < 99.0%

25%

< 95.0%

50%

Monthly Uptime Percentage

Percentage of monthly bill for the Covered Service that does not meet SLO that will be credited to Customer's future monthly bills

99.0% - < 99.5%

10%

95.0% - < 99.0%

25%

< 95.0%

50%

  • "Monthly Uptime Percentage" means total number of minutes in a month, minus the number of minutes of Downtime suffered from all Downtime Periods in a month, divided by the total number of minutes in a month.
  • "Valid Requests" are requests that conform to the Documentation, and that would normally result in a non-error response.

Customer Must Request Financial Credit

In order to receive any of the Financial Credits described above, Customer must notify Google technical support within 30 days from the time Customer becomes eligible to receive a Financial Credit. Customer must also provide Google with log files showing Downtime Periods and the date and time they occurred. If Customer does not comply with these requirements, Customer will forfeit its right to receive a Financial Credit. Further, Customer must be in good standing under the Agreement in order to receive a Financial Credit.

Maximum Financial Credit

The maximum aggregate number of Financial Credits issued by Google to Customer for all Downtime Periods in a single billing month will not exceed 50% of the amount due from Customer for the Covered Service  that did not meet SLO for the applicable month. Financial Credits will be in the form of a monetary credit applied to future use of the Covered Service and will be applied within 60 days after the Financial Credit was agreed to by Google.

SLA Exclusions

The SLA does not apply to any (a) features or services designated pre-general availability (unless otherwise set forth in the associated Documentation); (b) features or services excluded from the SLA (in the associated Documentation), including requests made using Grounding with Google Search; or (c) errors (i) caused by factors outside of Google's reasonable control; (ii) that resulted from Customer's software or hardware or third party software or hardware, or both; (iii) that resulted from abuses or other behaviors that violate the Agreement; or (iv) that resulted from quotas applied by the system or listed in the Documentation or Admin Console.

Última modificación: 30 de julio de 2024
Google Cloud
  • ‪English‬
  • ‪Deutsch‬
  • ‪Español‬
  • ‪Español (Latinoamérica)‬
  • ‪Français‬
  • ‪Indonesia‬
  • ‪Italiano‬
  • ‪Português (Brasil)‬
  • ‪简体中文‬
  • ‪繁體中文‬
  • ‪日本語‬
  • ‪한국어‬
Consola
Google Cloud