Quickstart guide for GenAI search

Follow this guide to know how to work with the GenAI search feature and provison your project to access advanced features.

Prerequisite: allowlist Google Cloud project

We recommended you use a project that has never provisioned Document AI Warehouse. You can follow this guide to create a new project.

Private preview features such as GenAI search require allowlisting.

For GenAI search, most TTP projects are already allowlisted. If you're in the TTP program and need to provision Document AI Warehouse for GenAI search in a different project:

For other private preview features, such as the retention folders for records management and legal holds, contact your account manager for the allowlisting process and form.

Provision Document AI Warehouse in a brand new project

  1. Open Document AI Warehouse:

      https://console.cloud.google.com/ai/docai-warehouse?project=<ID_PROJECT>
    
  2. On the provisioning page, select US (United States), where it's currently available.

  3. Enable the Document AI Warehouse API.

  4. Select an access control (ACL) mode and enable GenAI. When choosing ACL mode, you can select either:

    • Document-level access control with users in Cloud Identity.
    • Universal access: no document-level access control.

    Ensure Question & Answering is enabled:

  5. Then click Provision to start provisioning your project. It takes three to five minutes to set up the instance.

  6. Skip the step to configure web application.

  7. Create a default document schema that can be used for OCR extracted PDF or TXT files. It contains the raw text field for indexing but doesn't contain properties.

  8. Click Done to complete.

  9. Click Get Started to go to the document view.

Give users access to your project

Allowlist your users for access to GenAI in search UI

Add the Google Cloud IAM based emails for your users to this Google group, so the users are allowlisted to use the GenAI search UI.

Configure Schemas for structured documents (Optional)

If you have structured document types (for example invoices, tax forms, other structured document types) and plan to extract entities from your documents using specialized or custom Document AI Warehouse processors, you can setup complex schemas where the extracted Document AI entities can be mapped to "properties" in these schemas.

This should be done before starting the bulk upload job, which runs Document AI Warehouse processors to extract entities, maps entities to text, date, numeric, and enum properties in the schema you create and indexes these properties. This enables you to search or filter documents based on property values.

Appendix

Enable GenAI in a provisioned project

  • In the final view of the provisioning, or in the settings tab, go to show details.

  • Go to step 2, scroll down, and find the question and answer section. Mark the checkbox and update.

  • It takes up to 5 minutes for the update to finish so do not close the window during the process. Once it is finished, the QA is enabled.

Configure Document AI Warehouse web application (OneStack UI)

The web application is at a different URL (not in Google Cloud console) and is needed only if you would like to use the advanced features including:

  • Folders to organize documents.
  • Edit document properties.
  • Configure policies, rules, or other advanced features.

Follow the configure web application guide to set up the web application.

Next steps