Release notes

This page documents production updates to Document AI. We recommend that Document AI developers periodically check this list for any new announcements.

You can see the latest product updates for all of Google Cloud on the Google Cloud page, browse and filter all release notes in the Google Cloud console, or you can programmatically access release notes in BigQuery.

To get the latest product updates delivered to you, add the URL of this page to your feed reader, or add the feed URL directly: https://cloud.google.com/feeds/duai-release-notes.xml

July 27, 2022

v1beta3 & v1

New Release Candidate (RC) versions for PDAI Invoice and Expense processors - July 2022

We have launched new RC versions of Invoice parser and Expense parser on Jul 15, 2022. These can be accessed in the following way:

  • Invoice parser: pretrained-next-uptrainable
  • Expense parser: pretrained-next

Here are the details about the contents of the RC version updates:

Processor New Languages New Entities
Invoice: pretrained-next-uptrainable Italian, Portuguese, Romanian, Swedish N/A
Expense: pretrained-next Japanese Support for hotel and car rental folios

Payment information entities: Last 4 digits of credit card, payment type

June 30, 2022

v1

VPC Service Control support

Document AI VPC Service Controls provide additional security for your resources and services. To learn more about VPC Service Controls, see the VPC Service Controls overview.

To learn about the limitations when using Document AI with VPC Service Controls, see the supported products and limitations.

June 13, 2022

v1

Document AI is now generally available (GA) in the following new locations:

  • asia-south1 (Mumbai)
  • australia-southeast1 (Sydney)

You must request access to use the new locations. For more information, see Regional and multi-regional support.

v1beta3

New Identity Processor (Preview)

The France Passport Parser is now available in limited preview.

June 10, 2022

v1beta3

The Contract Parser is now more accurate, can extract more fields and supports higher page limits.

June 01, 2022

v1

Identity DocAI General availability (GA) release

The following Identity DocAI processors are now Generally Available (GA).

For more information, see Document AI for Identity.

April 21, 2022

v1

Document OCR processor

The changes from the Google Default Next version have been applied to the Google default version.

The previous Google default version can still be accessed until July 21, 2022 as pretrained-legacy. After July 21, 2022, that version will be removed.

For more information about using different versions of the processor, see Managing processor versions .

For the original announcement of this change, see the January 14, 2022 release note.

April 08, 2022

v1

New Version of Lending W2 Processor

We have released a new Release Candidate version of the W2 Processor. This version is experimental and has the following features:

  • Quality improvement on SSN and EIN fields.
  • Support for box 12 fields, including both codes and values.
  • Fine grained predictions of EmployeeName, EmployeeAddress, and EmployerNameAndAddress which are no longer part of the output and replaced with additional fields.

March 25, 2022

v1

New & Updated processors available

The following Lending DocAI processors are now available for trusted testers. Access to the trusted testers program is limited and granted on a case by case basis. If you would like to be considered please fill out the DocAI Processor Access Request Form:

New Experimental processors to support new document types:

  • Form VA Loan Discharge Statement Processor
  • Form USDA Conditional Statement Processor
  • Form 1017 Processor
  • Form Biweekly Payment Rider Processor
  • Form VBA26 1805 Processor
  • Form VBA26 6393 Processor
  • Form MERS Rider Processor

Updated Experimental processors:

  • Form 4506-T Processor
  • Form 4506-C Processor
  • Form HUD54114 Processor
  • Form HUD92900WS Processor
  • Form HUD92800 Processor
  • Form 1040-NR Processor
  • Form HUD92900LT Processor
  • Form VBA26 8923 Processor
  • Form HUD92900A Processor
  • FORM_1005_PROCESSOR

March 10, 2022

v1

Document AI is now generally available (GA) in the following new locations:

  • europe-west3
  • asia-southeast1

You must request access to use the new locations. For more information, see Regional and multi-regional support.

February 18, 2022

v1

New Versions of Procurement Processors

We have launched a new Google Pretrained version of the following procurement processors with various quality improvements:

The changes from the old Google default next version have been applied to the new Google Pretrained version. The old Google default version is still available and will not be deprecated for at least 180 days.

January 26, 2022

v1beta3 & v1

Enrichment using the Knowledge Graph is now Generally Available.

For more information, see Enterprise Knowledge Graph field enrichment.

January 14, 2022

v1

Document OCR processor

We have updated the Google default next version with quality improvements. Consequently, you have 90 days from today to test the new model before the changes are applied to the Google default version. After that, the original Google default version will be available for another 90 days as legacy. For more information about the processor and its versions, see the Document OCR processor.

For more information about using different versions of the processor, see Managing processor versions.

For the original announcement of this change, see the November 5, 2021 release note.

December 15, 2021

v1beta3 & v1

New Lending Processors (Preview)

The following new processors are now available in limited preview:

New Versions of Lending Processors

We have launched new versions of the following lending processors.

These new versions use a new lending document splitting and classification model with improved quality and support for more document types. For more information, see the Document types identified by the Lending Splitter & Classifier.

November 10, 2021

v1

We have lowered the price for many processors. For more information, see the Pricing page.

November 05, 2021

v1

The following procurement processors are now publicly accessible:

We have release a new version of the Document OCR Processor called Google default next. This version changes the distribution of confidence scores in the response. You have 90 days from today to test the new model before the changes are applied to the Google default version . After that event, the original version will still be available for another 90 days as legacy. For more information about using different versions of the processor, see Managing processor versions.

New Lending Processor (Preview)

The Mortgage statement parser is now available in limited preview.

October 15, 2021

v1beta3

Contract DocAI (Preview) released

The Contract parser is now available.

October 06, 2021

v1

Document AI is now generally available (GA) in the following new locations:

  • europe-west2
  • northamerica-northeast1

You must request access to use the new locations. For more information, see Regional and multi-regional support.

September 01, 2021

v1

Document AI now supports Data Residency, VPC-SC, Access Transparency, and CMEK.

August 20, 2021

v1

Managing processor versions

You can now switch between different versions of a processor. For more information, see Managing processor versions.

New processor versions

We have added new versions of the following processors:

  • Bank statement parser: improved model quality
  • Pay slip parser: improved model quality and extraction of three additional fields: net_pay, net_pay_ytd, and employee_account_number.

New Lending DocAI processors

The following Lending DocAI (LDAI) processors are now available in limited Preview:

  • 1065 parser
  • 1099-NEC parser
  • 1099-R parser
  • 1120 parser
  • 1120-S parser
  • SSA-1099 parser

Additionally, the LDAI Document Splitter and Classifier has been updated to support the new LDAI processors as well as the following processors:

  • US Driver License Parser
  • US Passport Parser

Human in the Loop (HITL) support for Lending DocAI processors

The following Lending DocAI processors now support Human in the Loop (HITL):

  • 1003 parser
  • 1040 Parser
  • 1040 Schedule C parser
  • 1040 Schedule E parser
  • 1099-DIV parser
  • 1099-G parser
  • 1099-INT parser
  • 1099-MISC parser
  • Bank Statement parser
  • Pay Stub parser
  • W2 parser
  • W9 parser

Knowledge Graph support

The following processors now support Knowledge Graph enrichment:

  • Bank Statement
  • Pay Slip
  • W2 Parser
  • W9 Parser

July 30, 2021

v1

The Invoice Parser now extracts a new field invoice_type that indicates the type of the input document.

July 02, 2021

v1

Change in processor documentation

The location of individual processor information has changed. You can now find individual processor documentation for all solutions (General, Procurement, Lending) in the following locations:

Human in the Loop (HITL) now supports priority queues for each processor, based on the urgency of each document. For more information, see HITL.

June 09, 2021

v1

VPC Service Controls

Integration with Document AI VPC Service Controls is now generally available.

April 09, 2021

v1

Procurement DocAI General availability (GA) release

Procurement DocAI (PDAI) solution is now available in private General Availability (GA).

This includes the following processors:

Human in the Loop (HITL) support for Procurement DocAI processors

Procurement DocAI processors now support Human in the Loop (HITL) AI platform functionality supporting human revisions of predictions.

Invoice parser behavior update

The invoice parser behavior has been updated to include the following features:

  • Offers extended support for the following languages (in addition to English):
    • French
    • Dutch
    • German
    • Spanish
  • Improves supplier parsing accuracy with Knowledge Graph support.
  • Improves prediction quality (accuracy).
  • Extends the header and line item fields extracted by the parser.
  • Increased the number of pages for online processing (10 pages) and offline processing (200 pages).
  • Increased the number of documents per batch in offline processing (50 documents).

Expense parser (Receipt parser) behavior update

The expense parser behavior has been updated to include the following features:

  • Renamed Receipt parser to Expense parser.
  • Improved prediction quality.
  • Improved prediction quality for English, French, and Dutch for more expense types (for example hotel statements).

Human in the Loop (HITL) AI General Availability (GA) released

HITL AI is now available in Private General Availability (GA) for human review of Invoice, Expense, and Utility parser predictions.

Features:

  • HITL configuration enhanced to designate which fields need review and whether a field is mandatory, saving review time.
  • Labeler UI highlights the fields below a confidence score and supports single-click confirmation to improve review efficiency.
  • Labeling Manager shows analytics and metrics by task and by labeler to streamline HITL operations.

April 02, 2021

v1

Lending DocAI General Availability (GA) released

Lending DocAI is now General Availability. See the documentation for more information.

Lending DocAI processors added

The following Lending DocAI processors are now available:

March 31, 2021

v1

Document AI General availability (GA) released

Document AI is now General Availability (GA).

January 14, 2021

v1beta3

New Procurement DocAI processor released in limited Preview

The following Procurement DocAI processor is now available in limited Preview:

  • Procurement document splitter

For more information, see the processor documentation.

January 11, 2021

v1beta3

Lending processors behavior update

The behavior of the following processors has been updated:

  • 1003 parser
  • 1040 parser
  • 1099-MISC parser
  • W2 parser
  • W9 parser

Now, if these processors are given a multi-page input file and contains a page that is the correct document type and one of the supported versions the processor performs entity extraction for that page; subsequent applicable pages will not be processed. If the prcoessor doesn't find any applicable documents in the input file it returns an error message.

October 29, 2020

v1beta3

Document AI Preview released

The following beta and preview features are available in API version v1beta3:

  • Procurement DocAI processors: Invoice parser and receipt parser.

October 16, 2020

v1beta3

Document AI Preview released

The following beta and preview features are available in API version v1beta3:

  • General processors: Document OCR (Optical Character Recognition), form parser, and document splitter.
  • Lending processors: W9, 1040, W2, 1099-MISC, and 1003 parsers, as well as lending document splitter & classifier.

uri field unavailable

  • Sending a request with the uri field is currently not supported for v1beta3. Any updates to the availability of the uri field will be announced here.

Workaround: Send requests with image information in the content field (base64 encoded information).

August 24, 2020

v1beta2

Form Parser model updates

The Form Parser model has been updated. The model update includes the following features:

  • Improved OCR quality for English detection.
  • Improved key-value pair, checkbox, and table parsing detection quality, particularly for rotated images and handwritten text.
  • Decreased latency for complex tables.

August 20, 2020

v1beta2

Invoice Parsing updates

  • Document AI now supports normalized values for certain entities returned from Invoice Parsing requests.
  • We have improved confidence scores for entities returned from Invoice Parsing requests.

July 04, 2020

v1beta2

Invoice Parsing Beta model upgrade

The Invoice Parsing Beta model has been upgraded. This model upgrade results in higher quality results for the entities and entityRelations. There is no API change.

See the product documentation for more information.

April 14, 2020

v1beta2

Document AI Beta released

The following beta features are available in API version v1beta2:

  • Document processing: You can use the API to parse forms or tables from PDF, TIFF, or GIF documents.
  • Regional support: The API now offers multi-regional support (us and eu) for all features. Using a multi-region endpoint enables you to configure the API to store and process your data in the United States or European Union.

Invoice processing Beta

  • Invoice processing is now available as a restricted feature. See Parsing invoices for more information.