Vertex AI Search pricing

Prices are listed in US Dollars (USD).

Pricing for listed Generally Available (GA) functionality is invoked on Sep 1, 2023.

Vertex AI Search lets developers, even those with limited machine learning skills, tap into the power of Google's foundation large language models, search and recommendations expertise to create enterprise-grade generative AI applications

✩ Google-internal note: For more information on pricing, see go/vertexsc-pricing.

Vertex AI Search provides the ability to quickly build Search Engines for website, unstructured data, structured data to retrieve information and generate grounded answers. You can read more about the features available in Vertex AI Search.

Vertex AI Search offers two pricing models: General and Configurable. You must choose one model for your search application and its associated data stores.

General Pricing

Configurable Pricing

Best for

Getting started fast

Workloads with less than 15 M queries

Unpredictable traffic

Workloads with more than 15M queries per month

Predictable traffic / throughput need

Flexibility to not always use semantic for all queries

Pricing Meter

Pay-per-query and per-GB of data indexed

Monthly subscription for query capacity (QPM) and storage, with pay-per-query for advanced add-on features

Important: A data store created with the Configurable pricing model can only be used by a search app that also uses the Configurable pricing model.

Vertex AI Search General Pricing

This model is based on pay-as-you-go pricing for search queries and data storage.

Free Trial: You'll have access to 10,000 queries per account, per month at no cost to explore Vertex AI Search without an initial investment. Excludes Advanced Generative Answers.

Pricing for search queries

Type

Price (USD)

Search Standard Edition

Includes semantic retrieval and kpi optimization

$1.50 / 1,000 query

Search Enterprise Edition

includes Core Generative Answers (AI Mode)

$4.00 / 1,000 query

Advanced Generative Answers (AI Mode)

(can be added to both Standard Edition and Enterprise Edition)

+$4.00 / 1,000 user input query

Search Standard Edition – Unstructured Search + Structured Search capabilities

Search Enterprise Edition – Unstructured Search + Structured Search + Website Search capabilities

Vertex AI Search Enterprise Edition includes core Generative Answers (AI Mode). Provide answers, summaries and follow ups at no additional cost. Note: Core Generative Answers do not include suggested follow ups, complex query handling, long query handling, and multimodality.

Advanced Generative Answers includes advanced features such as suggested follow ups, complex query handling, and multimodality.


Query: Billing metered per individual Request or Query Input

A request or query is defined as any API call to Enterprise Search, whether direct with API usage or indirect with integration or console usage.

For example, when a user asks a natural language question, and the search engine responds that is one query.


Advanced Generative Answers (AI Mode) is added to a Query to augment the processing to use a Generative AI feature,

Advanced Generative Answers (AI Mode) can be used interchangeably with either Search Standard Edition or Search Enterprise Edition.

Example: executing a Search Enterprise query with a multimodal request = 1 Search Enterprise query + 1 Advanced Generative Answers (AI Mode) query

Pricing for indexing / data storage

See Index Storage pricing section

Pricing Example for Vertex AI Search GA functionality

Example of unstructured data : Document Search

Assumption:

  • 10M Standard Edition queries annually
  • 10M Enterprise Edition queries with 2M Advanced Generative Answers (AI Mode) queries annually
  • 100K documents @ ~1mb each

Item

SKU Volume

Rate

Total List Price

Standard Search

10M annual (with 10k free trial)

* $1.50 / 1k queries =

$14,985

Search Enterprise with core Generative Answers (AI Mode)

10M annual (with 10k free trial)

* $4.00 / 1k queries =

$39,960

Advanced Generative Answers (AI Mode)

Each query added to either Standard or Enterprise query

2M annual

* +$4.00 / 1k user input queries =

+$8,000

Data Indexed

100GB annual 10GB free so 90GB used in price calc

* $5.00 / GB * 12 month =

$5,400

Vertex AI Search Configurable Pricing

This model provides predictable costs through monthly subscriptions for core search capacity, with optional pay-as-you-go add-ons for additional features. It is designed for customers with consistent workloads seeking greater cost control.

Minimum monthly commitment: 1000 queries per minute (QPM) and 50 GB of storage.

Core Subscription Pricing (Billed Monthly)

SKU

Price (USD)

Description

Query Unit

$0.008219178 / 1 hour

A subscription for your search application's query throughput capacity

Storage Unit

$0.001369863 / 1 hour

A subscription for the raw data stored for indexing

Pay-As-You-Go Add-ons. (Billed per 1,000 count)

Add-on

Price (USD)

Description

Semantic

$0.75 / 1k count + $1.50 / GB / month for embeddings

Enables semantic understanding, hybrid search, and is required for AI Overview and AI Mode.

The additional storage charge covers the cost of generating and maintaining embeddings.

KPI & Personalization

$0.20 / 1,000 count

Enables event re-ranking and personalization to optimize for business KPIs (engagement, conversion)

Core Generative Answers

$2.00 / 1,000 count

Generates answers, citations, and follow-ups. Requires the Semantic add-on.

Advanced Generative Answers (AI Mode)

$4.00 / 1k count

Handles complex, multi-turn conversational count and multimodality. Requires the Semantic add-on.

Overages: Usage exceeding your subscribed QPM is considered an overage. By default, overage count are billed at the General Pricing model's Standard Edition rate ($1.50 / 1,000 queries).

Scaling: You can scale up your QPM or Storage subscription at any time, with costs pro-rated for the remainder of the month. Scaling down takes effect at the beginning of the next billing cycle.

Example for Vertex AI Search Configurable Pricing

Example of structured data : Hotel Search

Assumption:

  • 1M documents (catalog items) @ ~100kb each
  • 1500 Query Per Minute monthly subscription
  • 50 Million monthly queries with KPI optimization add on
  • 30 Million monthly queries with Semantic add on
  • 10 Million monthly queries with Core Generative Answers (AI Overviews)

Item

SKU Volume

Rate

Total List Price

Core Subscription - Query Unit

1500 Query Per Minute

* $6.00 / QPM x Month subscription =

$9,000

Core Subscription - Storage Unit

100 GB

* $1/GB x Month subscription =

$100

Add on - Semantic Indexing

100 GB

* $1.50 / GB x Month

$150

Add on - Semantic Queries

30 Million

* $0.75 / 1k queries

$22,500

Add on - KPI & Personalization

50 Million

* $0.20 / 1k queries

$10,000

Add on - Core Generative Answers

10 Million

* $2.00 / 1k queries

$20,000

Index storage pricing

Pricing for Vertex AI Search Index Data Storage

Type

Price (USD)

Index Storage

$0.006849315 / 1 gibibyte hour

* Free quota of 10 GiB per month provided

** Shared across Vertex AI Search

***The index storage cost is applied to the total size of the raw data, sampled regularly and computed as an average for the month. Operations to refresh the data do not result in additional cost.

**** For website data store, storage is calculated as 500 kibibytes (KiB) * "number of pages on website", where 1 KiB is 1,024 bytes. (500 KiB is ~0.000477 GiB; so data indexing pricing for a 1000 page website is $2.38 per month.)

Vertex AI Search for Healthcare pricing

Vertex AI Search for Healthcare provides the ability to quickly build medically tuned Search Engines over healthcare data.

Type

Price (USD)

Healthcare Search

$20.00 / 1,000 count

Vertex AI Search for Healthcare includes some features in Preview such as GenAI answers, streaming updates to the index, and others. These features may be priced differently than the current listed price when they are released to General Availability.

Pricing Example for Healthcare Search

Assumption:

  • 1,000,000 healthcare search requests a month
  • 1,000 GiB of healthcare data indexed

SKU Volume

Rate

Total List Price

1,000,000 searches

$20/1000

$20,000

1,000 GiB

$5/GiB

$5,000

Vertex AI Search for Media pricing

Vertex AI Search for Media enables you to provide highly relevant video results, leveraging Google's query and contextual understanding to improve discovery across your media site.

Type

Price (USD)

Vertex AI Search: Data Index

0 gibibyte month to 10 gibibyte month
$0.00 (Free) / 1 gibibyte hour, per 1 month / account
10 gibibyte month and above
$0.006849315 / 1 gibibyte hour, per 1 month / account

Type

Price (USD)

Vertex AI Search: Media Search API Request Count

$2.00 / 1,000 count

Vertex AI Search for Media Recommendations pricing

The only Media Recommendations operations that incur charges are training, tuning, or requesting predictions by calling the recommend method. There's no charge for importing or managing user events or document information.

Training (per node per hour) costs are charged on a daily basis if your model is actively training or if you have submitted a request to resume training. After you pause or delete a model, you are no longer charged. See the documentation for managing training.

Tuning (per node per hour) costs for active models are charged after the tune completes successfully. You are only charged for an incomplete tune if you pause or delete a model during an ongoing tune. In this case, you are then charged for the node hours that were consumed before the model tuning stopped. See the documentation for managing tuning.

Type

Price (USD)

Predictions requests per month

$0.20 / 1,000 count

Type

Price (USD)

Training and Tuning

$2.50 / 1 hour

Pricing Example for Media Recommendations

Assumption:

  • 1,000,000,000 prediction requests a month
  • Trains a single model per day, which automatically retrains once per day
  • Amounts to about 500 node hours of model training and 100 hours of model tuning per month

SKU Volume

Rate

Total List Price

1b predictions

* $0.20 / 1k predictions =

$200,000

500 node hours (Training)

* $2.50 / hour =

$1,250

100 node hours (Tuning)

* $2.50 / hour =

$250

Total

$201,500

Google Cloud Observability charges

Media Recommendations logs an error to Google Cloud Observability for each API request that results in an error, such as a user event request that contains malformed JSON, or a document import request with a negative price. Media Recommendations also logs an error for every prediction request with a document that is not in the imported datastore.

Google Cloud Observability charges by the GiB of logs stored and for retention beyond the default retention period. For details about the free allotment and data retention, see the Google Cloud Observability pricing page.

The size of the logging data depends on the size of your JSON payload, but a GiB would be approximately 200,000 Media Recommendations errors.

Grounded Generation API pricing

The Grounded Generation API enables you to create generative answers to your prompts using information on Google Search or your own data.

Type

Price (USD)

Input prompt (includes user prompt, system instructions, and inline grounding facts)

Charged at the price of the selected Gemini model.

Output

Charged at the price of the selected Gemini model.

Grounded Generation for grounding on your own retrieved data

$2.50 / 1,000 count

Grounded Generation for grounding on Google Search

See Grounding on Google Search.

The additional charges for data retrieval are determined by the select retrieval system (e.g. Vertex AI Search).

Example #1: Grounding on Vertex AI Search and inline grounding facts

The user uses Vertex AI Search and additional grounding facts to generate grounded answers. Each input prompt is 2,500 characters (including inline grounding facts) and each output prompt 200 characters long. The user has selected Gemini 1.5 Flash.

Volume per request

Price per 1,000 requests

Input prompt

2,500 characters

1,000 requests * $0.000125 per 1,000 characters * 2,500 characters per request = $0.3125 per 1,000 requests

Output

200 characters

1,000 requests * $0.000375 per 1,000 characters * 2 characters per request = $0.075 per 1,000 requests

Grounded Generation for grounding on your own retrieved data

1 request

$2.50 / 1,000 count

Data Retrieval: Vertex AI Search (Enterprise edition)

1 request

$4.00 / 1,000 count

Total: $6.8875 per 1,000 requests

Item

Volume per request

Price per 1,000 requests

Input prompt

500 characters

1,000 requests * $0.000125 per 1,000 characters * 500 characters per request = $0.0625 per 1,000 requests

Output

200 characters

1,000 requests * $0.000375 per 1,000 characters * 200 characters per request = $0.075 per 1,000 requests


Grounded Generation for grounding on Google Search

1 request

0 count to 10,000 count
$0.00 (Free) / 1,000 count, per 1 day / account
10,000 count and above
$35.00 / 1,000 count, per 1 day / account

Total: $35.1375 per 1,000 requests

Check Grounding API pricing

Check grounding provides the ability to determine how grounded a piece of text (the answer candidate) is in a given set of reference texts (the facts).

Type

Price (USD)

Check grounding

$0.00075 / 1,000 count

Document AI feature pricing

For full pricing information of all Document AI features, refer to the Document AI pricing page.

For the Document AI features integrated with and billed through Vertex AI Search, refer to the tables below.

Digitize text

Processor

Price (USD)

Number of pages processed for OCR processor.

0 count to 1,000 count
$0.00 (Free) / 1,000 count, per 1 month / account
1,000 count to 5,000,000 count
$1.50 / 1,000 count, per 1 month / account
5,000,000 count and above
$0.60 / 1,000 count, per 1 month / account

Extract structures and entities from documents

Item

Price (USD)

Layout Parser (Includes initial chunking)

$10.00 / 1,000 count

*The size of a page depends on the file format.:

  • Images (JPEG/JPG, PNG, BMP, HEIF): Each image = 1 page
  • PDF: Each page in the PDF = 1 page
  • TIFF: Each image in the TIFF = 1 page
  • Word (DOCX): Up to 3,000 characters = 1 page
  • Excel (XLSX): Each tab = 1 page
  • Powerpoint (PPTX): Each slide = 1 page
  • HTML: Up to 3,000 characters = 1 page
  • Parsed Documents: Up to 3,000 characters = 1 page

Ranking API pricing

The ranking API takes a list of documents and reranks those documents based on how relevant the documents are to a query.

Compared to embeddings, which look only at the semantic similarity of a document and a query, the ranking API can give you precise scores for how well a document answers a given query.

The ranking API can be used to improve the quality of search results after retrieving an initial set of candidate documents.

Rank documents

Item

Price (USD)

Ranking

$1.00 / 1,000 count

A query is defined as having up to 100 documents, though a user can specify more than 100 documents per query. In the case where more than 100 documents are specified, pricing increases by 1 for every multiple of 100 documents.

  • For example:
  • 132 documents to rank = 2 queries
  • 200 documents to rank = 2 queries
  • 399 documents to rank = 4 queries
  • 401 documents to rank = 5 queries

Request a custom quote

With Google Cloud's pay-as-you-go pricing, you only pay for the services you use. Connect with our sales team to get a custom quote for your organization.
Google Cloud