Agent Search on Gemini Enterprise Agent Platform (formerly Vertex AI Search) helps developers build Google-quality search experiences for websites, structured and unstructured data. It also provides an out-of-the-box grounding system and DIY grounding APIs for building generative AI agents and apps.
Overview
Agent Search is a Google Search quality information retrieval and answer generation system that can be a component of any generative AI application that uses your enterprise data.
There are two key opportunities for enterprises to use Agent Search:
The first is to improve the quality of search experiences across your intranet and customer facing websites. With Agent Search, you can go from frustrating keyword matching to modern conversational search experiences similar to Google’s new generative search experience. This can be as easy as adding a search widget to your webpage.
The second opportunity is to improve the quality of your generative AI applications by grounding them in your enterprise data using Agent Search. Here Gemini Enterprise Agent Platform serves as an out-of-the-box system for retrieval augmented generation or RAG.
Yes, you can. Today, there is a lot of excitement about RAG, an architecture that combines LLMs with a data retrieval system, or in other words, a search engine. By grounding LLM responses in your company's own data, it ensures improved accuracy, reliability, and relevance, something that's critical for real-world business applications. You could build your own retrieval augmented generation-based Search but this can be a highly complex process. Agent Search functions as an out-of-the-box RAG system for information retrieval. Under the hood with Agent Search, we’ve simplified the end-to-end search and discovery process of managing ETL, OCR, chunking, embedding, indexing, storing, input cleaning, schema adjustments, information retrieval, and summarization to just a few clicks. This makes it super easy for you to build RAG-powered apps using Agent Search as your retrieval engine.
Yes, Agent Search has specialized offerings tuned for unique industry requirements like searching product catalogs, media libraries, and clinical data repositories. Commerce search on Gemini Enterprise for Customer Experience offers retailers the ability to improve the search, product recommendations, and browsing experience on their channels. Agent Search for media offers media and entertainment companies the ability to provide more personalized content recommendations powered by generative AI, increasing consumer time spent on their platforms, which can lead to higher engagement, revenue, and retention. Agent Search for healthcare and life sciences is a medically tuned search that improves patient and provider experience.
Developing a well-functioning RAG system for DIY grounding can be complex. To address this, Agent Platform offers a comprehensive set of APIs that help developers create bespoke DIY solutions and maintain them. These APIs expose the underlying components of Agent Search's out-of-the-box RAG system, empowering developers to address custom use cases or serve customers who want granular control. These include the Document AI Layout Parser API, ranking API, grounded generation API, and check grounding API.
Agent Search lets organizations and developers set up search engines out of the box. These search engines offer adequate customization for most enterprise needs and even offer automatic fine-tuning for embeddings. In some cases, you may have custom embeddings, and Agent Search works fine with your own embeddings. However, more advanced developers who need direct control of a highly performant vector database to power niche use cases like recommendations and ad serving can use Vector Search, the vector database used by Agent Search as a component for their use cases. We’ve recently updated Vector Search’s user experience so developers can create and deploy indexes without coding. We’ve also significantly reduced indexing latency from hours to minutes for smaller datasets.
Agent Search makes it significantly easier for you to build high-quality, AI-powered search experiences into your applications. It is built on Google’s deep expertise and decades of experience in semantic search and so provides more relevant search results. This improves the quality of information retrieval for apps that use your enterprise data. Customization options let you tailor the search experience to your specific needs, while robust enterprise-grade features take care of scalability, privacy, and governance. For more specialized use cases, Agent Search offers vertical specific offerings for retail, media, healthcare, and DIY vector search capabilities.
Agent Search is underpinned by a variety of Google Search technologies, including semantic search, which helps deliver more relevant results than traditional keyword-based search techniques by using natural language processing and machine learning techniques to infer relationships within the content and intent from the user’s query input. Agent Search also benefits from Google’s expertise in understanding how users search and factors in content relevance to order displayed results.
Agent Search is now generally available. You can access via the Google Cloud Console. Please don't hesitate to contact your Google Cloud sales team for assistance or access to preview features.
Agent Search is powered by foundation models. This means you can offer your customers multi-turn (the ability to easily ask follow-up questions), multimodal (search using images in addition to text), immersive search experiences that are similar to Google's search generative experience. Your customers or employees can view crisp summaries on top of search results with citations and links to data sources that help in knowledge discovery.
Yes, when you use Agent Search from Google Cloud, your data is secure in your cloud instance. Google does not access or use your data to train models or for any other purpose you have not explicitly authorized. Agent Search also meets specific industry compliance standards like HIPAA, ISO 27000-series, and SOC -1/2/3. We’re expanding support for access transparency to provide customers with awareness of Googler administrative access to their data. Virtual Private Cloud Service Controls prevent customers or employees from infiltrating or exfiltrating data. We are also offering Customer-managed Encryption Keys (CMEK) in Preview, allowing customers to encrypt their core content with their own encryption keys.
All search results from Agent Search are grounded to your enterprise data or applications you have provided access to. Google Cloud offers grounding out of the box for search results in applications built using Agent Search. Further, Agent Search offers citations and links for summaries generated, which means information presented can be verified by users. You have full control in determining what data sources are used and you can even program responses for off-topic questions.
Agent Search can connect to your first-party, Google, and third-party applications through Gemini Enterprise Agent Platform extensions and data connectors. Agent Platform's extensions help in ingesting data and drive transactions on the users' behalf while data connectors ingest data with read-only access to key applications like Jira, Confluence, and Salesforce. Together, Agent Platform extensions and data connectors ensure your data is fresh across your search engines.
How It Works
Your organization may have terabytes of data; and organizing it to be easily found can be one of the most challenging problems to solve. You could also have a public-facing website and need high-quality search for your customers. For both cases, you can use Agent Search to create search engines. Watch this video and discover how to make an internal search app with minimal coding and minimal setup.
Agent Search brings together the power of deep information retrieval, state-of-the-art natural language processing, and the latest in large language processing to understand user intent and return the most relevant results for the user.
This tutorial explains how to create three search apps, one for each type of data: website, structured data, and unstructured data.
Watch the video on the right for an overview and head over to the product documentation link below for detailed instructions.
Agent Search brings together the power of deep information retrieval, state-of-the-art natural language processing, and the latest in large language processing to understand user intent and return the most relevant results for the user.
This tutorial explains how to create three search apps, one for each type of data: website, structured data, and unstructured data.
Watch the video on the right for an overview and head over to the product documentation link below for detailed instructions.
Find similar things in seconds, even with billions of items. Vector Search unlocks powerful semantic matching for recommendations, chatbots, and more. Let's see how to build a recommendation engine with Vector Search:
Find similar things in seconds, even with billions of items. Vector Search unlocks powerful semantic matching for recommendations, chatbots, and more. Let's see how to build a recommendation engine with Vector Search:
Searching data in healthcare can be a difficult task due to the complexities of medical terminology and data standardization.
Agent Search uses its medical tuning to find relevant information from structured and unstructured patient records. It understands medical abbreviations like "abx" and can answer questions with MedLM to provide generative AI answers grounded on patient data. The product integrates with Healthcare Data Engine for a seamless experience.
Searching data in healthcare can be a difficult task due to the complexities of medical terminology and data standardization.
Agent Search uses its medical tuning to find relevant information from structured and unstructured patient records. It understands medical abbreviations like "abx" and can answer questions with MedLM to provide generative AI answers grounded on patient data. The product integrates with Healthcare Data Engine for a seamless experience.