This page explains the pricing and billing for Vertex AI RAG Engine. This page covers the following topics: While Vertex AI RAG Engine itself has no additional charge, you are billed for the underlying Google Cloud components that you use, such as models, reranking, and vector storage. For more information, see the Vertex AI RAG Engine overview. You are billed for the underlying Google Cloud components that you use with Vertex AI RAG Engine. You can ingest data from various sources, such as local file uploads, Cloud Storage, and Google Drive. Accessing files from these sources is free, but the data sources might charge for data transfer, such as data egress costs. File parsing You can choose from the following file parsers: File chunking You can use fixed-size chunking, which is free. When you generate embeddings, Vertex AI RAG Engine uses the embedding model that you specify. You are billed for the costs associated with that model. For more pricing information, see Cost of building and deploying AI models in Vertex AI. For vector search, you can use one of two types of vector databases: RAG-managed database: A fully managed vector database solution. This database serves two purposes: A RAG-managed database uses a Spanner instance as its backend. For each project, Vertex AI RAG Engine provisions a customer-specific Google Cloud project to manage these resources. This ensures that your data is physically isolated. If you choose the If any RAG corpus in your project chooses to use a RAG-managed database for the vector search, you will be charged for the RAG-managed Spanner instance. Vertex AI RAG Engine surfaces Spanner costs from your corresponding RAG-managed project to your Google Cloud project, so that you can see and pay Spanner instance costs. For more pricing details, see Spanner pricing. Bring-Your-Own (BYO) vector database: You can use an existing vector database, such as Vector Search. You are responsible for provisioning, managing, and billing for your vector database. Vertex AI RAG Engine does not charge for this integration. After initial retrieval, you can use one of the following reranking tools to improve search result relevance:
Pricing and billing
Data ingestion
Data transformation
Embedding generation
Data indexing and retrieval
RagManagedDB
Basic tier or Scaled tier, Vertex AI RAG Engine provisions a Spanner Enterprise edition instance in the corresponding project:
Reranking for Vertex AI RAG Engine
What's next
Vertex AI RAG Engine billing
Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.
Last updated 2025-08-23 UTC.