Release notes

Stay organized with collections Save and categorize content based on your preferences.

This page documents production updates to Document AI. We recommend that Document AI developers periodically check this list for any new announcements.

You can see the latest product updates for all of Google Cloud on the Google Cloud page, browse and filter all release notes in the Google Cloud console, or you can programmatically access release notes in BigQuery.

To get the latest product updates delivered to you, add the URL of this page to your feed reader, or add the feed URL directly: https://cloud.google.com/feeds/duai-release-notes.xml

November 21, 2022

v1beta3

Expense Parser Releases

As of November 18, 2022, for the Expense Parser, we have promoted our v1.3 Release Candidate version to a Stable version so that more customers can use it confidently. 

New Stable version

Features in the new Stable Expense Parser, pretrained-expense-v1.3-2022-07-15:

  • Support for a new language, Japanese, which has been requested by multiple customers.

  • Better entity performance

  • Addition of 3 new entity types (line_item/quantity, payment_typecredit_card_last_four_digits)

  • Better support for hotel and car-rental related expenses 

New Release Candidate version

Along with this Stable version, we are also launching a new Release Candidate version of the Expense Parser, pretrained-expense-v1.4-2022-11-18, with the following new features, in addition to the features in the Stable version:

  • Improvements to overall performance

  • Support for two (2) new languages, Italian and Portuguese

  • Support for Uptraining to improve or add/remove entities in the schema

  • Support for Uptraining to add support for unsupported languages

  • Addition of 3 new entity types (traveler_name, reservation_idline_item/transaction_date)

  • Maximum pages (online/synchronous requests) limit has been increased to 15.

Deprecation of the old Stable version

The pretrained-expense-v1.1-2021-04-09 version of the Expense Parser will be deprecated following this release. 

Invoice Parser Updates

The previous Stable Invoice processor version, pretrained-invoice-v1.1-2021-04-09, is deprecated as of November 22, 2022.

The Invoice Parser, for v1.3 and v1.4, now has the following quotas and limits:

  • Maximum pages (online/synchronous requests): 15
  • Maximum pages (batch/offline/asynchronous requests): 200

November 16, 2022

v1beta3

The Identity Document Proofing Processor is now available in Public Preview.

The Identity Document Proofing Processor is designed to help predict the validity of ID documents with four different signals:

  • is_identity_document detection: predict whether an image contains a recognized identity document.
  • suspicious_words detection: predict whether words are present that aren't typical on IDs.
  • image_manipulation detection: predict whether the image was altered or tampered via an image editing tool.
  • online_duplicate detection: predict whether the image can be found online.

November 11, 2022

v1beta3

New stable W2 processor version with the following enhancements:

  • Breaks down long entities such as addresses into fine-grained sub-entities: StreetAddressOrPostalBox, AdditionalStreetAddressOrPostalBox, City, State, and Zip. This modification not only improves accuracy, but also entity specificity.
  • Handles wider variations of W2 forms including multi-copies (2,3,4-ups) issued by various payroll vendors.
  • Introduces 8 new entities for Box 12 that represent both codes and values.

New stable Payslip processor version with the following enhancements:

  • Bonus, commissions, holiday, overtime, regular pay and vacation are now part of earning_item/earning_this_period and earning_item/earning_ytd. Captures all types of earnings beyond those categories, and maps them to their respective earning rates, hours and pay (both for the period and year-to-date).
  • Returns year-to-date and current period taxes and deductions.
  • Direct deposits are linked to the corresponding bank account numbers.
  • Returns page numbers, state and federal tax exemptions and filing statuses.

October 31, 2022

v1

A new Release Candidate (RC) version of the Document OCR Processor, pretrained-ocr-v1.1-2022-09-12, is available in the US and EU. This RC can detect document defects.

  • If the document is considered to be defective, the API now returns the same 5 document defect types supported by the Intelligent Document Quality Processor:
    • quality/defect_blurry
    • quality/defect_noisy
    • quality/defect_dark
    • quality/defect_faint
    • quality/defect_text_too_small
  • In addition, it now supports 3 more defect types:
    • quality/defect_document_cutoff
    • quality/defect_text_cutoff
    • quality/defect_glare
  • The defect detection results are in the image_quality_scores field on the Page object in the returned JSON. This additional feature adds latency comparable to OCR processing to the process call.

October 21, 2022

v1beta3

As of October 21, 2022, we have promoted our v1.3 Release Candidate version to a Stable version of the Invoice processor.

Features in the new Stable Invoice processor, version pretrained-invoice-v1.3-2022-07-15.

  • Out of the box support for 7 new languages: Italian, Portuguese, Romanian, Swedish, Estonian, Lativian, Lithuanian

  • Support for uptraining using the newly launched Document AI Workbench. See Uptrain a specialized processor.

  • Improvements to Currency and Date Normalization.

  • Improvements to Line Items extraction.

  • Quotas and limits

    • Maximum pages (online/synchronous requests): 15
    • Maximum pages (batch/offline/asynchronous requests): 200
    • Regional availability
      • US (Multi-region), Europe (Multi-region)

Alongside this stable version, we are launching a new release candidate version, pretrained-invoice-v1.4-2022-10-21, to which new features will be added.

October 10, 2022

v1beta3

Known issue (Document Labeling)

  • If you delete one or more documents, and these documents selected for deletion are all associated with an active labeling job, then all documents in that dataset will also be deleted, even if you did not select them for deletion. This is true regardless of the number of documents selected.

    Workaround: Do not delete documents during an active labeling job. You can track active labeling jobs on the Dataset management page, under the category Labeling tasks, located on the right side of the page. If you absolutely must delete documents during an active labeling job, ensure that you also select at least one document that is NOT part of this active labeling job. Then, only the non-associated documents will be deleted, and the remaining documents in the dataset will be preserved.

September 29, 2022

v1beta3

This launch upgrades the lifecycle stage of the Custom Document Extractor (CDE) component of the DocAI Workbench from Private Preview to Public Preview. CDE covers essential workflows for developing custom document extraction processors with E2E UI support:

  • Data import
  • Schema creation and annotation
  • Processor model training
  • Evaluation and troubleshooting
  • Model deployment and version management
  • Human-in-the-loop (HITL) integration for "last-mile" processor quality assurance

Notable new Public Preview Custom Document Extractor (CDE) features include:

  • Progressive data import
  • Direct import of annotated .json files as training or test datasets
  • Data labeling platform integration
  • Dataset export with metadata preserved
  • Auto-labeling using a trained processor version at import to minimize manual annotation efforts
  • Tabular entity end-to-end support
  • Checkbox boolean annotation, training, and extraction
  • Fuzzy matching for more flexible model evaluation
  • Exportable / downloadable model evaluation metrics

The following features have been upgraded:

  • Data import
  • Schema creation and annotation
  • Processor model training
  • Evaluation and troubleshooting
  • Model deployment and version management
  • Human-in-the-loop (HITL) integration for "last-mile" processor quality assurance

Known issues

  • During labeling, checkboxes default to a state that does not reflect the selected or unselected states in the corresponding documents.

    Workaround: Label all checkboxes within a schema, regardless of whether they are selected or unselected,  for optimal checkbox extraction quality and accurate evaluation. When annotating a checkbox by drawing its bounding box, all checkboxes in the CDE annotation user interface will default to an unselected state, and you must manually update the selected or unselected state as necessary.

  • If a processor is deleted when there is an active labeling task ongoing, the task does not stop automatically. Labelers and Labeler Managers will still see the task in their labeling and manager consoles.

    Workaround: Cancel the active labeling task before you delete the processor. Otherwise, navigate to the manager console to pause or delete the corresponding task queue. Any documents labeled from that task after the processor is deleted will be unretrievable.

  • Some discrepancies might exist between pretrained and uptrained versions of a processor.

  • On the Evaluate & Test tab, the evaluation does not show as complete until you click Refresh Table.

  • If you cannot choose a specialist pool due to no chooser for the specialists, clicking Continue causes an error.

    Workaround: Click Task Details to go back and fill in remaining details.

  • Support for nested entities is limited to data arranged in a tabular format. Other layouts are currently not supported.

    Any mention of "nested entities" in previous versions of the Document AI Workbench documentation have been replaced with "tabular entities" to reflect the table-based nesting capabilities for this launch.

  • If you see this message, "This processor does not currently support uptraining or evaluation capabilities" on a processor in the Processor gallery, this means that only the prediction endpoint is available at this time.

  • In some cases, required_once and optional_once entities appear multiple times in a document, with the same value copied to multiple locations. While one annotation is s sufficient when annotating a test set for evaluation, all copies should be annotated to ensure higher recall for trained models.

    Workaround: Annotate all instances of an entity within a doc. This will support both Evaluation and Training without any infrastructure changes.

  • Existing Human in the Loop (HITL) configurations for Label-level filters do not automatically populate to new processor configurations.

    Workaround: If you require changes to the configuration, navigate to the Human-In-The-Loop tab. Under Set filters, select Label-level filters and click Set Label Filters. A Label-level filters page opens. Manually edit this table to configure all of the labels needed for validation and review.

  • Support for handwritten entity detection such as signatures, dates, and initials are limited and may require additional configuration for proper evaluation.

    Workaround: When evaluating pretrained processors, the Value should be manually updated to YES if the entity is present, or NO otherwise. For uptrained processor versions, the Value should be kept as detected by OCR. This issue affects W9, HUD92900B, SSA-1099, and VBA26-0551 processors.

  • When you attempt to select a specialist pool for a labeling task, and there are missing fields in the form, and then click Continue, you might get an error.

    Workaround: Click Task Details, and enter the missing details in the form. You can then successfully create a labeling task.

September 15, 2022

v1beta3

Schema support for checkboxes and nested entitites

  • Customers using Document AI Workbench, and processors for Purchase Order (PO), Invoice, or Expense, now have access to a new schema. This schema enables customers to label checkboxes, if they are defined in the schema, and to accurately represent nested entities, such as parent-child relationships, on the HITL annotation and review console. As additional processors adapt the new schema, these release notes will be updated to include those.

Nested entities

  • The Annotation console now supports labeling for nested entities. The left panel is refreshed with a new look for nested rows to represent nested entities. The value of "parent" will now be the concatenation of all its "children". The parent is effectively a container for all of its children.

September 01, 2022

v1beta3 & v1

We are standarding our release processes and naming conventions for processor versions. For more information, see Manage processor versions.

July 27, 2022

v1beta3 & v1

New Release Candidate (RC) versions for PDAI Invoice and Expense processors - July 2022

We have launched new RC versions of Invoice parser and Expense parser on July 15, 2022. These can be accessed in the following way:

  • Invoice parser: pretrained-next-uptrainable
  • Expense parser: pretrained-next

Here are the details about the contents of the RC version updates:

Processor New Languages New Entities
Invoice: pretrained-next-uptrainable Italian, Portuguese, Romanian, Swedish N/A
Expense: pretrained-next Japanese Support for hotel and car rental folios

Payment information entities: Last 4 digits of credit card, payment type

The current limit for uptrainable processors is as follows (it is different from the pre-trained version). We are gathering customer feedback to increase the async limit.

Quotas and limits

Maximum pages (online/synchronous requests): 10
Maximum pages (batch/offline/asynchronous requests): 15

June 30, 2022

v1

VPC Service Control support

Document AI VPC Service Controls provide additional security for your resources and services. To learn more about VPC Service Controls, see the VPC Service Controls overview.

To learn about the limitations when using Document AI with VPC Service Controls, see the supported products and limitations.

June 13, 2022

v1

Document AI is now generally available (GA) in the following new locations:

  • asia-south1 (Mumbai)
  • australia-southeast1 (Sydney)

You must request access to use the new locations. For more information, see Regional and multi-regional support.

v1beta3

New Identity Processor (Preview)

The France Passport Parser is now available in limited preview.

June 10, 2022

v1beta3

The Contract Parser is now more accurate, can extract more fields and supports higher page limits.

June 01, 2022

v1

Identity DocAI General availability (GA) release

The following Identity DocAI processors are now Generally Available (GA).

For more information, see Document AI for Identity.

April 21, 2022

v1

Document OCR processor

The changes from the Google Default Next version have been applied to the Google default version.

The previous Google default version can still be accessed until July 21, 2022 as pretrained-legacy. After July 21, 2022, that version will be removed.

For more information about using different versions of the processor, see Managing processor versions .

For the original announcement of this change, see the January 14, 2022 release note.

April 08, 2022

v1

New Version of Lending W2 Processor

We have released a new Release Candidate version of the W2 Processor. This version is experimental and has the following features:

  • Quality improvement on SSN and EIN fields.
  • Support for box 12 fields, including both codes and values.
  • Fine grained predictions of EmployeeName, EmployeeAddress, and EmployerNameAndAddress which are no longer part of the output and replaced with additional fields.

March 25, 2022

v1

New & Updated processors available

The following Lending DocAI processors are now available for trusted testers. Access to the trusted testers program is limited and granted on a case by case basis. If you would like to be considered please fill out the DocAI Processor Access Request Form:

New Experimental processors to support new document types:

  • Form VA Loan Discharge Statement Processor
  • Form USDA Conditional Statement Processor
  • Form 1017 Processor
  • Form Biweekly Payment Rider Processor
  • Form VBA26 1805 Processor
  • Form VBA26 6393 Processor
  • Form MERS Rider Processor

Updated Experimental processors:

  • Form 4506-T Processor
  • Form 4506-C Processor
  • Form HUD54114 Processor
  • Form HUD92900WS Processor
  • Form HUD92800 Processor
  • Form 1040-NR Processor
  • Form HUD92900LT Processor
  • Form VBA26 8923 Processor
  • Form HUD92900A Processor
  • FORM_1005_PROCESSOR

March 10, 2022

v1

Document AI is now generally available (GA) in the following new locations:

  • europe-west3
  • asia-southeast1

You must request access to use the new locations. For more information, see Regional and multi-regional support.

February 18, 2022

v1

New Versions of Procurement Processors

We have launched a new Google Pretrained version of the following procurement processors with various quality improvements:

The changes from the old Google default next version have been applied to the new Google Pretrained version. The old Google default version is still available and will not be deprecated for at least 180 days.

January 26, 2022

v1beta3 & v1

Enrichment using the Knowledge Graph is now Generally Available.

For more information, see Enterprise Knowledge Graph field enrichment.

January 14, 2022

v1

Document OCR processor

We have updated the Google default next version with quality improvements. Consequently, you have 90 days from today to test the new model before the changes are applied to the Google default version. After that, the original Google default version will be available for another 90 days as legacy. For more information about the processor and its versions, see the Document OCR processor.

For more information about using different versions of the processor, see Managing processor versions.

For the original announcement of this change, see the November 5, 2021 release note.

December 15, 2021

v1beta3 & v1

New Lending Processors (Preview)

The following new processors are now available in limited preview:

New Versions of Lending Processors

We have launched new versions of the following lending processors.

These new versions use a new lending document splitting and classification model with improved quality and support for more document types. For more information, see the Document types identified by the Lending Splitter & Classifier.

November 10, 2021

v1

We have lowered the price for many processors. For more information, see the Pricing page.

November 05, 2021

v1

The following procurement processors are now publicly accessible:

We have release a new version of the Document OCR Processor called Google default next. This version changes the distribution of confidence scores in the response. You have 90 days from today to test the new model before the changes are applied to the Google default version . After that event, the original version will still be available for another 90 days as legacy. For more information about using different versions of the processor, see Managing processor versions.

New Lending Processor (Preview)

The Mortgage statement parser is now available in limited preview.

October 15, 2021

v1beta3

Contract DocAI (Preview) released

The Contract parser is now available.

October 06, 2021

v1

Document AI is now generally available (GA) in the following new locations:

  • europe-west2
  • northamerica-northeast1

You must request access to use the new locations. For more information, see Regional and multi-regional support.

September 01, 2021

v1

Document AI now supports Data Residency, VPC-SC, Access Transparency, and CMEK.

August 20, 2021

v1

Managing processor versions

You can now switch between different versions of a processor. For more information, see Managing processor versions.

New processor versions

We have added new versions of the following processors:

  • Bank statement parser: improved model quality
  • Pay slip parser: improved model quality and extraction of three additional fields: net_pay, net_pay_ytd, and employee_account_number.

New Lending DocAI processors

The following Lending DocAI (LDAI) processors are now available in limited Preview:

  • 1065 parser
  • 1099-NEC parser
  • 1099-R parser
  • 1120 parser
  • 1120-S parser
  • SSA-1099 parser

Additionally, the LDAI Document Splitter and Classifier has been updated to support the new LDAI processors as well as the following processors:

  • US Driver License Parser
  • US Passport Parser

Human in the Loop (HITL) support for Lending DocAI processors

The following Lending DocAI processors now support Human in the Loop (HITL):

  • 1003 parser
  • 1040 Parser
  • 1040 Schedule C parser
  • 1040 Schedule E parser
  • 1099-DIV parser
  • 1099-G parser
  • 1099-INT parser
  • 1099-MISC parser
  • Bank Statement parser
  • Pay Stub parser
  • W2 parser
  • W9 parser

Knowledge Graph support

The following processors now support Knowledge Graph enrichment:

  • Bank Statement
  • Pay Slip
  • W2 Parser
  • W9 Parser

July 30, 2021

v1

The Invoice Parser now extracts a new field invoice_type that indicates the type of the input document.

July 02, 2021

v1

Change in processor documentation

The location of individual processor information has changed. You can now find individual processor documentation for all solutions (General, Procurement, Lending) in the following locations:

Human in the Loop (HITL) now supports priority queues for each processor, based on the urgency of each document. For more information, see HITL.

June 09, 2021

v1

VPC Service Controls

Integration with Document AI VPC Service Controls is now generally available.

April 09, 2021

v1

Procurement DocAI General availability (GA) release

Procurement DocAI (PDAI) solution is now available in private General Availability (GA).

This includes the following processors:

Human in the Loop (HITL) support for Procurement DocAI processors

Procurement DocAI processors now support Human in the Loop (HITL) AI platform functionality supporting human revisions of predictions.

Invoice parser behavior update

The invoice parser behavior has been updated to include the following features:

  • Offers extended support for the following languages (in addition to English):
    • French
    • Dutch
    • German
    • Spanish
  • Improves supplier parsing accuracy with Knowledge Graph support.
  • Improves prediction quality (accuracy).
  • Extends the header and line item fields extracted by the parser.
  • Increased the number of pages for online processing (10 pages) and offline processing (200 pages).
  • Increased the number of documents per batch in offline processing (50 documents).

Expense parser (Receipt parser) behavior update

The expense parser behavior has been updated to include the following features:

  • Renamed Receipt parser to Expense parser.
  • Improved prediction quality.
  • Improved prediction quality for English, French, and Dutch for more expense types (for example hotel statements).

Human in the Loop (HITL) AI General Availability (GA) released

HITL AI is now available in Private General Availability (GA) for human review of Invoice, Expense, and Utility parser predictions.

Features:

  • HITL configuration enhanced to designate which fields need review and whether a field is mandatory, saving review time.
  • Labeler UI highlights the fields below a confidence score and supports single-click confirmation to improve review efficiency.
  • Labeling Manager shows analytics and metrics by task and by labeler to streamline HITL operations.

April 02, 2021

v1

Lending DocAI General Availability (GA) released

Lending DocAI is now General Availability. See the documentation for more information.

Lending DocAI processors added

The following Lending DocAI processors are now available:

March 31, 2021

v1

Document AI General availability (GA) released

Document AI is now General Availability (GA).

January 14, 2021

v1beta3

New Procurement DocAI processor released in limited Preview

The following Procurement DocAI processor is now available in limited Preview:

  • Procurement document splitter

For more information, see the processor documentation.

January 11, 2021

v1beta3

Lending processors behavior update

The behavior of the following processors has been updated:

  • 1003 parser
  • 1040 parser
  • 1099-MISC parser
  • W2 parser
  • W9 parser

Now, if these processors are given a multi-page input file and contains a page that is the correct document type and one of the supported versions the processor performs entity extraction for that page; subsequent applicable pages will not be processed. If the prcoessor doesn't find any applicable documents in the input file it returns an error message.

October 29, 2020

v1beta3

Document AI Preview released

The following beta and preview features are available in API version v1beta3:

  • Procurement DocAI processors: Invoice parser and receipt parser.

October 16, 2020

v1beta3

Document AI Preview released

The following beta and preview features are available in API version v1beta3:

  • General processors: Document OCR (Optical Character Recognition), form parser, and document splitter.
  • Lending processors: W9, 1040, W2, 1099-MISC, and 1003 parsers, as well as lending document splitter & classifier.

uri field unavailable

  • Sending a request with the uri field is currently not supported for v1beta3. Any updates to the availability of the uri field will be announced here.

Workaround: Send requests with image information in the content field (base64 encoded information).

August 24, 2020

v1beta2

Form Parser model updates

The Form Parser model has been updated. The model update includes the following features:

  • Improved OCR quality for English detection.
  • Improved key-value pair, checkbox, and table parsing detection quality, particularly for rotated images and handwritten text.
  • Decreased latency for complex tables.

August 20, 2020

v1beta2

Invoice Parsing updates

  • Document AI now supports normalized values for certain entities returned from Invoice Parsing requests.
  • We have improved confidence scores for entities returned from Invoice Parsing requests.

July 04, 2020

v1beta2

Invoice Parsing Beta model upgrade

The Invoice Parsing Beta model has been upgraded. This model upgrade results in higher quality results for the entities and entityRelations. There is no API change.

See the product documentation for more information.

April 14, 2020

v1beta2

Document AI Beta released

The following beta features are available in API version v1beta2:

  • Document processing: You can use the API to parse forms or tables from PDF, TIFF, or GIF documents.
  • Regional support: The API now offers multi-regional support (us and eu) for all features. Using a multi-region endpoint enables you to configure the API to store and process your data in the United States or European Union.

Invoice processing Beta

  • Invoice processing is now available as a restricted feature. See Parsing invoices for more information.