Document AI Warehouse release notes

This page documents production updates to Document AI Warehouse. You can periodically check this page for announcements about new or updated features, bug fixes, known issues, and deprecated functionality.

You can see the latest product updates for all of Google Cloud on the Google Cloud page, browse and filter all release notes in the Google Cloud console, or programmatically access release notes in BigQuery.

To get the latest product updates delivered to you, add the URL of this page to your feed reader, or add the feed URL directly.

January 16, 2024

Document AI Warehouse is deprecated and will no longer be available on Google Cloud after January 16, 2025. To safeguard your data, migrate any documents currently saved in Document AI Warehouse to an alternative like Cloud Storage. Ensure that your data migration is completed before the out of service date to prevent any data loss. See Deprecations for details.

November 02, 2023

Grant the documentSchemaViewer, documentCreator, and documentViewer roles the contentwarehouse.googleapis.com/locations.getStatus permission. This change enables the UI to render correctly, and does not change the security posture of these roles.

October 10, 2023

Deletion operation can't be successful when raw document is missing

October 02, 2023

Support root folder filtering

September 25, 2023

Add the doc_ai_document_type and doc_ai_document_path fields to the Pipeline API Cloud Function request.

Support "store_document_metadata_only" boolean flag in the Pipeline API Cloud Function response.

September 11, 2023

Validate the resource locations for all Document Warehouse API requests.

August 11, 2023

Pipelines support linking documents to a given folder during data ingestion.

July 13, 2023

Enable GenAI search (private preview) to support large documents up to 25K words.

June 23, 2023

Fixed the bug where plain_text is overritten during document creation.

June 16, 2023

Add support to filter for empty integer and float typed properties.

April 19, 2023

Added the skip_ingested_documents flag in the Cloud Storage Ingest Pipelines to skip ingested documents.

Fixed the bug that the Q&A search returns error when the search result is empty.

April 07, 2023

Datetime properties filtering is supported in the Document AI Warehouse UI.

March 29, 2023

Allow users to upload and view TIFF file types in the UI.

March 28, 2023

BigQuery Connector (preview): Supports batch exports of document metadata into BigQuery, which enables users to do data analysis, create reports and dashboards. For example, data visualization using BI dashboards.

March 22, 2023

Policy Engine:

  • Modify RuleSet APIs logic to auto-populate RuleId field during create RuleSet call and allow Rules update using existing RuleId
  • Publish action messages by default will include Schema name, Document name, RuleSet name, Rule Id, Action Id and trigger type information.

February 23, 2023

TIFF file UI rendering support: when calling GetDocument API for a TIFF file, the API will return a converted PNG image inside cloud_ai_document field.

Users do not need to grant the Cloud Storage roles to the Document AI Warehouse service account during the provisioning process.

January 31, 2023

Add field to mark raw document file type as TIFF

Document table filter and text search state are synced with the URL to allow users to easily save and share filter settings.

January 09, 2023

Allow users to set Project ACLs with conditions based on document schema ID or property name.

Fix the exception thrown when customers type a non-existent project number.

December 20, 2022

update the GetOperation API to remove empty response strings in pending operations.

December 13, 2022

Text extracted flag (text_extraction_disabled) will be deprecated. While customers can still use it in the client library, it will be no-op in our server-side. Meanwhile, a new flag (text_extraction_enabled) will be used to replace the old one. UI will have a new button for this new flag to let customers control the behavior.

CreateDocument response will only return document metadata instead of the whole document.

Mark async_enabled field as internal only.

December 08, 2022

AutoProvision service will return an operation ID for immediate completed operations.

Enable operation service for Document AI Warehouse v1 service.

Partially supports Google AIP-160 syntax (https://google.aip.dev/160) in search query. Search query now supports literals, logical operators, negation operators, comparison operators, and functions.

Fix operation status fetching for pending AutoProvision long running operations.

Disable text extraction for Google Cloud Storage based ingestion.

VPC Service Controls for Document AI Warehouse are publicly supported.

November 17, 2022

Fixed the bug that users are unable to preview documents created involving Doc AI processors.

November 10, 2022

Enable the validation check for Enum property values by default. Enum values that are not defined in the schema will not be allowed to be set to the corresponding document property Enum fields. The validationCheckDisabled flag in EnumTypeOptions disables the ENUM Validation.

Enable text extraction feature.

Fix partial document update which could cause loss of raw document name entry.

Fix plain_text unintended movement in API response messages.

Fix an issue when a user supplies multiple property filters for the same schema in the search query, the service returns error.

October 17, 2022

Add a validationCheckDisabled flag in EnumTypeOptions to disable ENUM Validation. The default behavior is unchanged (no validation on ENUM values is enforced).

Add "LIKE" support in search conditions against text properties. Support search word stemming, semantic search and shuffled ordering of search query strings that run against a text property.

October 10, 2022

Launched Document AI Warehouse V1 APIs.