Stay organized with collections Save and categorize content based on your preferences.

Full processor and detail list

This page contains detailed information on all processors offered by Document AI. You can see a list of all processors by solution type.

All Document AI processors adhere to the Data Processing and Security Terms.

General processors

Document OCR (Optical Character Recognition)

Solution type General
Type in UI General
Release stage

General availability

For more information, see see the launch stage descriptions.

Access status Public
Description

Identify and extract text in different types of documents.

This processor allows you to identify and extract text from documents in over 200 languages for printed text and 50 languages for handwritten text.

Notes
Processor versions
Version ID Release Channel Additional fields detected Additional languages supported Description
pretrained-ocr-v1.0-2020-09-23 Stable

None

None

Previously named 'Google default' and versioned 'pretrained'.
pretrained-ocr-v1.1-2022-09-12 Release Candidate

None

None

Previously named 'Google Default Next' and versioned 'pretrained-next'.

For more information, see Managing processor versions.

Quotas and limits
Maximum pages (online/synchronous requests): 10
Maximum pages (batch/offline/asynchronous requests): 500
Human-in-the-Loop

Not Supported

Sample Output Open in new window.

Document Splitter

Solution type General
Type in UI General
Release stage

Deprecated

For more information, see see the launch stage descriptions.

Access status Limited

To request API access, please fill out and submit the Document AI limited access customer request form. The form requests information about you, your company, and your use case. Please note that a Google Cloud Project ID is required for access. To create a new Google Cloud project, or identify your existing project's Project ID see the following instructions.

After you submit the form, the Document AI team will review your request to ensure you meet the criteria for access. If approved, you will receive an email with instructions on how to access and use this feature.

Description

Programmatically split documents on logical boundaries.

Document Splitter uses machine learning to separate documents on logical boundaries. For example, if you have one PDF document with multiple scanned files, the Document AI API will suggest the page location of a new file.

Notes
  • Maximum image size: 65500 x 65500 pixels
  • The splitter is not designed to split logical documents that are over 30 pages long. Logical documents that are more than 30 pages long (e.g. a 40-page bank statement) may be split into two or more docs.
Processor versions
Version ID Release Channel Additional fields detected Additional languages supported Description
pretrained-document-split-v1.0-2020-09-20 Stable

None

None

Previously named 'Google default' and versioned 'pretrained'.

For more information, see Managing processor versions.

Quotas and limits
Maximum pages (online/synchronous requests): 15
Maximum pages (batch/offline/asynchronous requests): 2000
Human-in-the-Loop

Not Supported

More information Document splitters behavior

Form Parser

Solution type General
Type in UI General
Release stage

General availability

For more information, see see the launch stage descriptions.

Access status Public
Description

Extract form elements such as text and checkboxes.

Notes
  • The current Form Parser model does not support checkboxes in tables, because the checkboxes are treated as key-value pairs. However the checkboxes, if recognized, may be stored in the table as Unicode characters (both checked and unchecked).
Supported languages
Full list of languages
  • af: Afrikaans
  • sq: Albanian
  • ca: Catalan
  • hr: Croatian
  • cs: Czech
  • da: Danish
  • nl: Dutch
  • en: English
  • et: Estonian
  • tl: Filipino
  • fi: Finnish
  • fr: French
  • de: German
  • hu: Hungarian
  • is: Icelandic
  • id: Indonesian
  • it: Italian
  • lv: Latvian
  • lt: Lithuanian
  • ms: Malay
  • no: Norwegian
  • pl: Polish
  • pt: Portuguese (Brazilian & Continental)
  • ro: Romanian
  • sr: Serbian
  • sk: Slovak
  • sl: Slovenian
  • es: Spanish
  • sv: Swedish
  • tr: Turkish
  • vi: Vietnamese
Processor versions
Version ID Release Channel Additional fields detected Additional languages supported Description
pretrained-form-parser-v1.0-2020-09-23 Stable

None

None

Previously named 'Google default' and versioned 'pretrained'.

For more information, see Managing processor versions.

Quotas and limits
Maximum pages (online/synchronous requests): 5
Maximum pages (batch/offline/asynchronous requests): 100
Human-in-the-Loop

Supported[2]

Sample Output Open in new window.

Intelligent Document Quality Processor

Solution type General
Type in UI General
Release stage

Preview

For more information, see see the launch stage descriptions.

Access status Limited

To request API access, please fill out and submit the Document AI limited access customer request form. The form requests information about you, your company, and your use case. Please note that a Google Cloud Project ID is required for access. To create a new Google Cloud project, or identify your existing project's Project ID see the following instructions.

After you submit the form, the Document AI team will review your request to ensure you meet the criteria for access. If approved, you will receive an email with instructions on how to access and use this feature.

Description

Perform quality assessment of a document based on its readability and get a quality score.

Intelligent Document Quality processor uses machine learning to perform quality assessment of a document based on the readability of its content. This quality assessment is returned as a quality score [0, 1], where 1 means perfect quality. If the quality score detected is lower than 0.5, a list of negative quality reasons (sorted by the likelihood) is also returned.

Notes
  • Quality score is returned in the confidence field of the entity with type="quality_score".
  • The quality/defect_* properties are sorted in descending order by confidence value.
Processor versions
Version ID Release Channel Additional fields detected Additional languages supported Description
pretrained-document-quality-v1.0-2021-01-20 Stable

None

None

Previously named 'Google default' and versioned 'pretrained'.

For more information, see Managing processor versions.

Quotas and limits
Maximum pages (online/synchronous requests): 5
Maximum pages (batch/offline/asynchronous requests): 100
Fields detected in the earliest version

You can also find this information in the Field detected page.

Full list of fields
  • quality_score
  • quality/defect_blurry
  • quality/defect_dark
  • quality/defect_faint
  • quality/defect_noisy
  • quality/defect_text_too_small
Human-in-the-Loop

Not Supported

Sample Output Open in new window.

Contract processors

Contract parser

Solution type Contract
Type in UI Specialized
Release stage

Preview

For more information, see see the launch stage descriptions.

Access status Limited

To request API access, please fill out and submit the Contract AI limited access customer request form. The form requests information about you, your company, and your use case. Please note that a Google Cloud Project ID is required for access. To create a new Google Cloud project, or identify your existing project's Project ID see the following instructions.

After you submit the form, the Document AI team will review your request to ensure you meet the criteria for access. If approved, you will receive an email with instructions on how to access and use this feature.

Description

Extract text and values from legal contracts such as agreement date, effective date, and parties.

Notes
  • Duration entities such as renewal_term, notice_to_terminate_renewal, and initial_term are normalized in 'years-months-days' format. E.g. 'The initial term is five (5) months' would have the following normalized value: '0-5-0'.
  • If expiration_date is not explicit in the document, it is inferred from the effective_date and initial_term.
Supported languages
  • en: English
Processor versions
Version ID Release Channel Additional fields detected Additional languages supported Description
pretrained-contract-v1.2-2021-10-05 Stable

None

None

Previously named 'Google default' and versioned 'pretrained'.

For more information, see Managing processor versions.

Quotas and limits
Maximum pages (online/synchronous requests): 15
Maximum pages (batch/offline/asynchronous requests): 200
Fields detected in the earliest version

You can also find this information in the Field detected page.

Full list of fields
  • agreement_date
  • arbitration_venue
  • confidentiality_clause
  • document_name
  • effective_date
  • expiration_date
  • governing_law
  • indemnity_clause
  • initial_term
  • litigation_venue
  • notice_to_terminate_renewal
  • non_compete_clause
  • parties
  • renewal_term
Uptraining

Supported

Human-in-the-Loop

Supported[2]

Sample Output Open in new window.

Identity processors

France Driver License Parser

Solution type Identity
Type in UI Specialized
Release stage

Preview

For more information, see see the launch stage descriptions.

Access status Limited

To request API access, please fill out and submit the Document AI limited access customer request form. The form requests information about you, your company, and your use case. Please note that a Google Cloud Project ID is required for access. To create a new Google Cloud project, or identify your existing project's Project ID see the following instructions.

After you submit the form, the Document AI team will review your request to ensure you meet the criteria for access. If approved, you will receive an email with instructions on how to access and use this feature.

Description

Extract fields such as names, document ID, date of birth, etc.

Processor versions
Version ID Release Channel Additional fields detected Additional languages supported Description
pretrained-fr-driver-license-v1.0-2021-06-14 Stable

None

None

Previously named 'Google default' and versioned 'pretrained'.

For more information, see Managing processor versions.

Quotas and limits
Maximum pages (online/synchronous requests): 2
Maximum pages (batch/offline/asynchronous requests): 2
Fields detected in the earliest version

You can also find this information in the Field detected page.

Full list of fields
  • Family Name
  • Given Names
  • Document Id
  • Expiration Date
  • Date Of Birth
  • Issue Date
  • Portrait
Human-in-the-Loop

Supported[2]


France National ID Parser

Solution type Identity
Type in UI Specialized
Release stage

Preview

For more information, see see the launch stage descriptions.

Access status Limited

To request API access, please fill out and submit the Document AI limited access customer request form. The form requests information about you, your company, and your use case. Please note that a Google Cloud Project ID is required for access. To create a new Google Cloud project, or identify your existing project's Project ID see the following instructions.

After you submit the form, the Document AI team will review your request to ensure you meet the criteria for access. If approved, you will receive an email with instructions on how to access and use this feature.

Description

Extract fields such as names, document ID, date of birth, etc.

Processor versions
Version ID Release Channel Additional fields detected Additional languages supported Description
pretrained-fr-national-id-v1.0-2021-06-14 Stable

None

None

Previously named 'Google default' and versioned 'pretrained'.

For more information, see Managing processor versions.

Quotas and limits
Maximum pages (online/synchronous requests): 2
Maximum pages (batch/offline/asynchronous requests): 2
Fields detected in the earliest version

You can also find this information in the Field detected page.

Full list of fields
  • Family Name
  • Given Names
  • Document Id
  • Expiration Date
  • Date Of Birth
  • Issue Date
  • Address
  • Portrait
Human-in-the-Loop

Supported[2]


France Passport Parser

Solution type Identity
Type in UI Specialized
Release stage

Preview

For more information, see see the launch stage descriptions.

Access status Limited

To request API access, please fill out and submit the Document AI limited access customer request form. The form requests information about you, your company, and your use case. Please note that a Google Cloud Project ID is required for access. To create a new Google Cloud project, or identify your existing project's Project ID see the following instructions.

After you submit the form, the Document AI team will review your request to ensure you meet the criteria for access. If approved, you will receive an email with instructions on how to access and use this feature.

Description

Extract fields such as names, document ID, date of birth, etc.

Processor versions
Version ID Release Channel Additional fields detected Additional languages supported Description
pretrained-fr-passport-v1.0-2022-04-29 Stable

None

None

Previously named 'Google default' and versioned 'pretrained'.

For more information, see Managing processor versions.

Quotas and limits
Maximum pages (online/synchronous requests): 2
Maximum pages (batch/offline/asynchronous requests): 2
Fields detected in the earliest version

You can also find this information in the Field detected page.

Full list of fields
  • family_name
  • given_name
  • document_id
  • expiration_date
  • date_of_birth
  • issue_date
  • address
  • place_of_birth
  • portrait
Human-in-the-Loop

Supported[2]


US Driver License Parser

Solution type Identity
Type in UI Specialized
Release stage

General availability

For more information, see see the launch stage descriptions.

Access status Public
Description

Extract fields such as names, document ID, date of birth, etc.

Supported languages
  • en: English
Supported form/versions
  • Supports all 50 States and D.C.
Processor versions
Version ID Release Channel Additional fields detected Additional languages supported Description
pretrained-us-driver-license-v1.0-2021-06-14 Stable

None

None

Previously named 'Google default' and versioned 'pretrained'.

For more information, see Managing processor versions.

Quotas and limits
Maximum pages (online/synchronous requests): 2
Maximum pages (batch/offline/asynchronous requests): 2
Fields detected in the earliest version

You can also find this information in the Field detected page.

Full list of fields
  • Family Name
  • Given Names
  • Document Id
  • Expiration Date
  • Date Of Birth
  • Issue Date
  • Address
  • Portrait
Human-in-the-Loop

Supported[2]

Sample Output Open in new window.

US Passport Parser

Solution type Identity
Type in UI Specialized
Release stage

General availability

For more information, see see the launch stage descriptions.

Access status Public
Description

Extract fields such as names, document ID, date of birth, etc.

Supported languages
  • en: English
Processor versions
Version ID Release Channel Additional fields detected Additional languages supported Description
pretrained-us-passport-v1.0-2021-06-14 Stable

None

None

Previously named 'Google default' and versioned 'pretrained'.

For more information, see Managing processor versions.

Quotas and limits
Maximum pages (online/synchronous requests): 2
Maximum pages (batch/offline/asynchronous requests): 2
Fields detected in the earliest version

You can also find this information in the Field detected page.

Full list of fields
  • Family Name
  • Given Names
  • Document Id
  • Expiration Date
  • Date Of Birth
  • Issue Date
  • MRZ Code
  • Portrait
Human-in-the-Loop

Supported[2]

Sample Output Open in new window.

Lending processors

1003 Parser

Solution type Lending
Type in UI Specialized
Release stage

General availability

For more information, see see the launch stage descriptions.

Access status Limited

To request API access, please fill out and submit the Document AI limited access customer request form. The form requests information about you, your company, and your use case. Please note that a Google Cloud Project ID is required for access. To create a new Google Cloud project, or identify your existing project's Project ID see the following instructions.

After you submit the form, the Document AI team will review your request to ensure you meet the criteria for access. If approved, you will receive an email with instructions on how to access and use this feature.

Description

Extract over 50 fields from Fannie Mae Form 1003 (URLA).

The 1003 Form is Fannie Mae's form number for the Uniform Residential Loan Application (URLA), a borrower’s application for a mortgage. Freddie Mac's form number is Form 65; both refer to the same form.

Notes

This content is available to approved customers. You can view this content after you have been approved and added to the appropriate allowlist.

Supported languages
  • en: English
Supported form/versions
  • Legacy Form 1003 (standard and customized versions)
Processor versions
Version ID Release Channel Additional fields detected Additional languages supported Description
pretrained-1003-v1.0-2020-10-01 Stable

None

None

Previously named 'Google default' and versioned 'pretrained'.
pretrained-1003-v1.1-2021-12-10 Stable

None

None

New splitter and classifier model with improved quality and more document type support; Launched in Dec 2021; Previously named 'Google Default Next' and versioned 'pretrained-next'.

For more information, see Managing processor versions.

Quotas and limits
Maximum pages (online/synchronous requests): 15
Maximum pages (batch/offline/asynchronous requests): 15
Fields detected in the earliest version

This content is available to approved customers. You can view this content after you have been approved and added to the appropriate allowlist.

Human-in-the-Loop

Supported[2]


1040 Parser

Solution type Lending
Type in UI Specialized
Release stage

General availability

For more information, see see the launch stage descriptions.

Access status Limited

To request API access, please fill out and submit the Document AI limited access customer request form. The form requests information about you, your company, and your use case. Please note that a Google Cloud Project ID is required for access. To create a new Google Cloud project, or identify your existing project's Project ID see the following instructions.

After you submit the form, the Document AI team will review your request to ensure you meet the criteria for access. If approved, you will receive an email with instructions on how to access and use this feature.

Description

Extract from Form 1040, including name, filing status, amounts, etc.

Notes

This content is available to approved customers. You can view this content after you have been approved and added to the appropriate allowlist.

Supported languages
  • en: English
Supported form/versions
  • 2021 (pretrained-1040-v2.0-2022-08-24 version only)
  • 2020 (pretrained-1040-v2.0-2022-08-24 version only)
  • 2019
  • 2018
Processor versions
Version ID Release Channel Additional fields detected Additional languages supported Description
pretrained-1040-v1.0-2020-10-01 Stable

None

None

Previously named 'Google default' and versioned 'pretrained'.
pretrained-1040-v1.1-2021-12-10 Stable

None

None

New splitter and classifier model with improved quality and more document type support; Launched in Dec 2021; Previously named 'Google Default Next' and versioned 'pretrained-next'.
pretrained-1040-v2.0-2022-08-24 Release Candidate
Show fields
  • pensions_annuities_taxable
  • social_security_benefits_taxable
  • ira_distributions
  • ira_distributions_taxable_amount
  • qualified_dividends
  • ordinary_dividends
  • tax_exempt_interest
  • taxable_interest

None

Quality improvements.

Added support for Year 2020 and 2021.

Breaking change: the names of all extracted fields have been renamed from CamelCase to snake_case (for example, first_name instead of FirstName).

This change was made to standardize the format of field names across Document AI.

Previously named 'RC'.

For more information, see Managing processor versions.

Quotas and limits
Maximum pages (online/synchronous requests): 15
Maximum pages (batch/offline/asynchronous requests): 15
Fields detected in the earliest version

This content is available to approved customers. You can view this content after you have been approved and added to the appropriate allowlist.

Human-in-the-Loop

Supported[2]

Labeling Instructions Open in new window.
Sample Output Open in new window.

1040 Schedule C Parser

Solution type Lending
Type in UI Specialized
Release stage

General availability

For more information, see see the launch stage descriptions.

Access status Limited

To request API access, please fill out and submit the Document AI limited access customer request form. The form requests information about you, your company, and your use case. Please note that a Google Cloud Project ID is required for access. To create a new Google Cloud project, or identify your existing project's Project ID see the following instructions.

After you submit the form, the Document AI team will review your request to ensure you meet the criteria for access. If approved, you will receive an email with instructions on how to access and use this feature.

Description

Extract from Form 1040 Schedule C, including name, wages, etc.

Supported languages
  • en: English
Supported form/versions
  • 2020 (standard and customized versions)
  • 2019 (standard and customized versions)
Processor versions
Version ID Release Channel Additional fields detected Additional languages supported Description
pretrained-1040sch-c-v1.0-2021-05-27 Stable

None

None

Previously named 'Google default' and versioned 'pretrained'.
pretrained-1040sch-c-v1.1-2021-12-10 Stable

None

None

New splitter and classifier model with improved quality and more document type support; Launched in Dec 2021; Previously named 'Google Default Next' and versioned 'pretrained-next'.

For more information, see Managing processor versions.

Quotas and limits
Maximum pages (online/synchronous requests): 15
Maximum pages (batch/offline/asynchronous requests): 15
Fields detected in the earliest version

This content is available to approved customers. You can view this content after you have been approved and added to the appropriate allowlist.

Human-in-the-Loop

Supported[2]

Labeling Instructions Open in new window.

1040 Schedule D Parser

Solution type Lending
Type in UI Specialized
Release stage

Preview

For more information, see see the launch stage descriptions.

Access status Limited

To request API access, please fill out and submit the Document AI limited access customer request form. The form requests information about you, your company, and your use case. Please note that a Google Cloud Project ID is required for access. To create a new Google Cloud project, or identify your existing project's Project ID see the following instructions.

After you submit the form, the Document AI team will review your request to ensure you meet the criteria for access. If approved, you will receive an email with instructions on how to access and use this feature.

Description

Extract from Form 1040 Schedule D, including name, gains, losses, etc.

Supported languages
  • en: English
Supported form/versions
  • 2020 (standard and customized versions)
Processor versions
Version ID Release Channel Additional fields detected Additional languages supported Description
pretrained-1040sch-d-v1.0-2021-11-17 Stable

None

None

Previously named 'Google default' and versioned 'pretrained'.

For more information, see Managing processor versions.

Quotas and limits
Maximum pages (online/synchronous requests): 10
Maximum pages (batch/offline/asynchronous requests): 15
Fields detected in the earliest version

This content is available to approved customers. You can view this content after you have been approved and added to the appropriate allowlist.

Human-in-the-Loop

Not Supported


1040 Schedule E Parser

Solution type Lending
Type in UI Specialized
Release stage

General availability

For more information, see see the launch stage descriptions.

Access status Limited

To request API access, please fill out and submit the Document AI limited access customer request form. The form requests information about you, your company, and your use case. Please note that a Google Cloud Project ID is required for access. To create a new Google Cloud project, or identify your existing project's Project ID see the following instructions.

After you submit the form, the Document AI team will review your request to ensure you meet the criteria for access. If approved, you will receive an email with instructions on how to access and use this feature.

Description

Extract from Form 1040 Schedule E, including name, expenses, etc.

Supported languages
  • en: English
Supported form/versions
  • 2020 (standard and customized versions)
  • 2019 (standard and customized versions)
Processor versions
Version ID Release Channel Additional fields detected Additional languages supported Description
pretrained-1040sch-e-v1.0-2021-04-14 Stable

None

None

Previously named 'Google default' and versioned 'pretrained'.
pretrained-1040sch-e-v1.1-2021-12-10 Stable

None

None

New splitter and classifier model with improved quality and more document type support; Launched in Dec 2021; Previously named 'Google Default Next' and versioned 'pretrained-next'.

For more information, see Managing processor versions.

Quotas and limits
Maximum pages (online/synchronous requests): 15
Maximum pages (batch/offline/asynchronous requests): 15
Fields detected in the earliest version

This content is available to approved customers. You can view this content after you have been approved and added to the appropriate allowlist.

Human-in-the-Loop

Supported[2]


1099-DIV Parser

Solution type Lending
Type in UI Specialized
Release stage

General availability

For more information, see see the launch stage descriptions.

Access status Limited

To request API access, please fill out and submit the Document AI limited access customer request form. The form requests information about you, your company, and your use case. Please note that a Google Cloud Project ID is required for access. To create a new Google Cloud project, or identify your existing project's Project ID see the following instructions.

After you submit the form, the Document AI team will review your request to ensure you meet the criteria for access. If approved, you will receive an email with instructions on how to access and use this feature.

Description

Extract from Form 1099-DIV, including account number, qualified dividends, federal income tax withheld, etc.

Notes

This content is available to approved customers. You can view this content after you have been approved and added to the appropriate allowlist.

Supported languages
  • en: English
Supported form/versions
  • 2020 (standard and customized versions)
  • 2019 (standard and customized versions)
  • 2018 (standard and customized versions)
Processor versions
Version ID Release Channel Additional fields detected Additional languages supported Description
pretrained-1099div-v1.0-2021-05-27 Stable

None

None

Previously named 'Google default' and versioned 'pretrained'.
pretrained-1099div-v1.1-2021-12-10 Stable

None

None

New splitter and classifier model with improved quality and more document type support; Launched in Dec 2021; Previously named 'Google Default Next' and versioned 'pretrained-next'.

For more information, see Managing processor versions.

Quotas and limits
Maximum pages (online/synchronous requests): 15
Maximum pages (batch/offline/asynchronous requests): 15
Fields detected in the earliest version

This content is available to approved customers. You can view this content after you have been approved and added to the appropriate allowlist.

Human-in-the-Loop

Supported[2]


1099-G Parser

Solution type Lending
Type in UI Specialized
Release stage

General availability

For more information, see see the launch stage descriptions.

Access status Limited

To request API access, please fill out and submit the Document AI limited access customer request form. The form requests information about you, your company, and your use case. Please note that a Google Cloud Project ID is required for access. To create a new Google Cloud project, or identify your existing project's Project ID see the following instructions.

After you submit the form, the Document AI team will review your request to ensure you meet the criteria for access. If approved, you will receive an email with instructions on how to access and use this feature.

Description

Extract from Form 1099-G, including payer, recipient, etc.

Notes

This content is available to approved customers. You can view this content after you have been approved and added to the appropriate allowlist.

Supported languages
  • en: English
Supported form/versions
  • 2020 (standard and customized versions)
  • 2019 (standard and customized versions)
Processor versions
Version ID Release Channel Additional fields detected Additional languages supported Description
pretrained-1099g-v1.0-2021-05-27 Stable

None

None

Previously named 'Google default' and versioned 'pretrained'.
pretrained-1099g-v1.1-2021-12-10 Stable

None

None

New splitter and classifier model with improved quality and more document type support; Launched in Dec 2021; Previously named 'Google Default Next' and versioned 'pretrained-next'.

For more information, see Managing processor versions.

Quotas and limits
Maximum pages (online/synchronous requests): 15
Maximum pages (batch/offline/asynchronous requests): 15
Fields detected in the earliest version

This content is available to approved customers. You can view this content after you have been approved and added to the appropriate allowlist.

Human-in-the-Loop

Supported[2]


1099-INT Parser

Solution type Lending
Type in UI Specialized
Release stage

General availability

For more information, see see the launch stage descriptions.

Access status Limited

To request API access, please fill out and submit the Document AI limited access customer request form. The form requests information about you, your company, and your use case. Please note that a Google Cloud Project ID is required for access. To create a new Google Cloud project, or identify your existing project's Project ID see the following instructions.

After you submit the form, the Document AI team will review your request to ensure you meet the criteria for access. If approved, you will receive an email with instructions on how to access and use this feature.

Description

Extract from Form 1099-INT, including payer, recipient, etc.

Notes

This content is available to approved customers. You can view this content after you have been approved and added to the appropriate allowlist.

Supported languages
  • en: English
Supported form/versions
  • 2020 (standard and customized versions)
  • 2019 (standard and customized versions)
  • 2018 (standard and customized versions)
Processor versions
Version ID Release Channel Additional fields detected Additional languages supported Description
pretrained-1099int-v1.0-2021-05-27 Stable

None

None

Previously named 'Google default' and versioned 'pretrained'.
pretrained-1099int-v1.1-2021-12-10 Stable

None

None

New splitter and classifier model with improved quality and more document type support; Launched in Dec 2021; Previously named 'Google Default Next' and versioned 'pretrained-next'.

For more information, see Managing processor versions.

Quotas and limits
Maximum pages (online/synchronous requests): 15
Maximum pages (batch/offline/asynchronous requests): 15
Fields detected in the earliest version

This content is available to approved customers. You can view this content after you have been approved and added to the appropriate allowlist.

Human-in-the-Loop

Supported[2]


1099-MISC Parser

Solution type Lending
Type in UI Specialized
Release stage

General availability

For more information, see see the launch stage descriptions.

Access status Limited

To request API access, please fill out and submit the Document AI limited access customer request form. The form requests information about you, your company, and your use case. Please note that a Google Cloud Project ID is required for access. To create a new Google Cloud project, or identify your existing project's Project ID see the following instructions.

After you submit the form, the Document AI team will review your request to ensure you meet the criteria for access. If approved, you will receive an email with instructions on how to access and use this feature.

Description

Extract from Form 1099-MISC, including payer, recipient, amounts, etc.

Notes

This content is available to approved customers. You can view this content after you have been approved and added to the appropriate allowlist.

Supported form/versions
  • 2020 (standard and customized versions)
  • 2019 (standard and customized versions)
  • 2018 (standard and customized versions)
Processor versions
Version ID Release Channel Additional fields detected Additional languages supported Description
pretrained-1099misc-v1.0-2021-05-27 Stable

None

None

Previously named 'Google default' and versioned 'pretrained'.
pretrained-1099misc-v1.1-2021-12-10 Stable

None

None

New splitter and classifier model with improved quality and more document type support; Launched in Dec 2021; Previously named 'Google Default Next' and versioned 'pretrained-next'.

For more information, see Managing processor versions.

Quotas and limits
Maximum pages (online/synchronous requests): 15
Maximum pages (batch/offline/asynchronous requests): 15
Fields detected in the earliest version

This content is available to approved customers. You can view this content after you have been approved and added to the appropriate allowlist.

Human-in-the-Loop

Supported[2]


1099-NEC Parser

Solution type Lending
Type in UI Specialized
Release stage

Preview

For more information, see see the launch stage descriptions.

Access status Limited

To request API access, please fill out and submit the Document AI limited access customer request form. The form requests information about you, your company, and your use case. Please note that a Google Cloud Project ID is required for access. To create a new Google Cloud project, or identify your existing project's Project ID see the following instructions.

After you submit the form, the Document AI team will review your request to ensure you meet the criteria for access. If approved, you will receive an email with instructions on how to access and use this feature.

Description

Extract from Form 1099-NEC, including payer, recipient, etc.

Notes

This content is available to approved customers. You can view this content after you have been approved and added to the appropriate allowlist.

Supported languages
  • en: English
Supported form/versions
  • 2021 (standard and customized versions)
  • 2020 (standard and customized versions)
Processor versions
Version ID Release Channel Additional fields detected Additional languages supported Description
pretrained-1099nec-v1.0-2021-08-11 Stable

None

None

Previously named 'Google default' and versioned 'pretrained'.

For more information, see Managing processor versions.

Quotas and limits
Maximum pages (online/synchronous requests): 10
Maximum pages (batch/offline/asynchronous requests): 15
Fields detected in the earliest version

This content is available to approved customers. You can view this content after you have been approved and added to the appropriate allowlist.

Human-in-the-Loop

Not Supported


1099-R Parser

Solution type Lending
Type in UI Specialized
Release stage

Preview

For more information, see see the launch stage descriptions.

Access status Limited

To request API access, please fill out and submit the Document AI limited access customer request form. The form requests information about you, your company, and your use case. Please note that a Google Cloud Project ID is required for access. To create a new Google Cloud project, or identify your existing project's Project ID see the following instructions.

After you submit the form, the Document AI team will review your request to ensure you meet the criteria for access. If approved, you will receive an email with instructions on how to access and use this feature.

Description

Extract from Form 1099-R, including payer, recipient, etc.

Notes

This content is available to approved customers. You can view this content after you have been approved and added to the appropriate allowlist.

Supported languages
  • en: English
Supported form/versions
  • 2021 (standard and customized versions)
  • 2020 (standard and customized versions)
  • 2019 (standard and customized versions)
Processor versions
Version ID Release Channel Additional fields detected Additional languages supported Description
pretrained-1099r-v1.0-2021-08-11 Stable

None

None

Previously named 'Google default' and versioned 'pretrained'.
pretrained-1099r-v2.0-2022-07-25 Release Candidate
Show fields
  • FormYear
  • PayerFirstName
  • PayerLastName
  • PayerMiddleInitial
  • PayerOrganizationName
  • PayerStreetAddress_Line1
  • PayerStreetAddress_Line2
  • PayerCity
  • PayerState
  • PayerZipcode
  • RecipientFirstName
  • RecipientLastName
  • RecipientMiddleInitial
  • RecipientOrganizationName
  • RecipientCity
  • RecipientState
  • RecipientZipcode
  • RecipientStreetAddress1
  • RecipientStreetAddress2

None

Quality improvements.

Uptraining supported.

Page limit increased from 10 to 15.

Breaking change: PayersName, PayersAddress, RecipientName, ReceptientCityStateCountry, RecipientAddress, RecipientStreetAddress, and EmployerNameAndAddress are no longer part of the output, and they are replaced with additional fields.(for example, PayerStreetAddress_Line1, PayerStreetAddress_Line2, PayerCity, PayerState and PayerZipcode instead of PayersAddress).

LocalTaxWithheld_Line2 is not supported in this version. Please use uptraining function to get the prediction if you are interested.

Previously named 'Google Release Candidate 2022-07-25' and versioned '2022-07-25'.

For more information, see Managing processor versions.

Quotas and limits
Maximum pages (online/synchronous requests): 10
Maximum pages (batch/offline/asynchronous requests): 15
Fields detected in the earliest version

This content is available to approved customers. You can view this content after you have been approved and added to the appropriate allowlist.

Uptraining

Supported

Human-in-the-Loop

Not Supported

Labeling Instructions Open in new window.

1065 Parser

Solution type Lending
Type in UI Specialized
Release stage

Preview

For more information, see see the launch stage descriptions.

Access status Limited

To request API access, please fill out and submit the Document AI limited access customer request form. The form requests information about you, your company, and your use case. Please note that a Google Cloud Project ID is required for access. To create a new Google Cloud project, or identify your existing project's Project ID see the following instructions.

After you submit the form, the Document AI team will review your request to ensure you meet the criteria for access. If approved, you will receive an email with instructions on how to access and use this feature.

Description

Extract from Form 1065, partnership name, address, assets, etc.

Notes

This content is available to approved customers. You can view this content after you have been approved and added to the appropriate allowlist.

Supported languages
  • en: English
Supported form/versions
  • 2020 (standard and customized versions)
  • 2019 (standard and customized versions)
Processor versions
Version ID Release Channel Additional fields detected Additional languages supported Description
pretrained-1065-v1.0-2021-08-11 Stable

None

None

Previously named 'Google default' and versioned 'pretrained'.
pretrained-1065-v2.0-2022-02-03 Stable

None

None

Quality improvements.

Breaking change: the names of all extracted fields have been renamed from CamelCase to snake_case (for example, end_of_tax_year_cash instead of EndOfTaxYear_Cash).

This change was made to standardize the format of field names across Document AI. Previously named 'RC' and versioned 'pretrained-next'.

For more information, see Managing processor versions.

Quotas and limits
Maximum pages (online/synchronous requests): 10
Maximum pages (batch/offline/asynchronous requests): 15
Fields detected in the earliest version

This content is available to approved customers. You can view this content after you have been approved and added to the appropriate allowlist.

Human-in-the-Loop

Not Supported


1120 Parser

Solution type Lending
Type in UI Specialized
Release stage

Preview

For more information, see see the launch stage descriptions.

Access status Limited

To request API access, please fill out and submit the Document AI limited access customer request form. The form requests information about you, your company, and your use case. Please note that a Google Cloud Project ID is required for access. To create a new Google Cloud project, or identify your existing project's Project ID see the following instructions.

After you submit the form, the Document AI team will review your request to ensure you meet the criteria for access. If approved, you will receive an email with instructions on how to access and use this feature.

Description

Extract from Form 1120, partnership name, address, assets, etc.

Notes

This content is available to approved customers. You can view this content after you have been approved and added to the appropriate allowlist.

Supported languages
  • en: English
Supported form/versions
  • 2021 (pretrained-1120-v3.0-2022-04-26 version only)
  • 2020 (standard and customized versions)
  • 2019 (standard and customized versions)
Processor versions
Version ID Release Channel Additional fields detected Additional languages supported Description
pretrained-1120-v1.0-2021-08-11 Stable

None

None

Previously named 'Google default' and versioned 'pretrained'.
pretrained-1120-v2.0-2022-02-03 Stable
Show fields
  • amount_owed
  • bal_begin_tax_yr
  • bal_end_tax_yr
  • begin_of_tax_yr_cash
  • begin_of_tax_yr_total_assets
  • begin_of_tax_yr_total_liab_sh_eq
  • city_state_country
  • credited_to_2021_estimated_tax
  • date_incorporated
  • employer_identification_number
  • end_of_tax_yr_cash
  • end_of_tax_yr_total_assets
  • end_of_tax_yr_total_liab_sh_eq
  • income
  • name
  • net_income_or_loss_per_books
  • over_payment
  • refunded
  • street_address
  • total_assets
  • total_deductions
  • total_income

None

Quality improvements.

Breaking change: the names of all extracted fields have been renamed from CamelCase to snake_case (for example, end_of_tax_year_cash instead of EndOfTaxYear_Cash).

This change was made to standardize the format of field names across Document AI. Previously named 'RC' and versioned 'pretrained-next'.

pretrained-1120-v3.0-2022-04-26 Release Candidate
Show fields
  • begin_of_tax_yr_mortgages_notes_bonds_payable_below_1_year
  • beginning_date
  • capital_gain_net_income
  • city_state_country_zipcode
  • cost_of_goods_sold
  • depletion
  • depreciation
  • end_of_tax_yr_mortgages_notes_bonds_payable_below_1_year
  • ending_date
  • final_return_checkbox
  • foreign_ownership_no_checkbox
  • foreign_ownership_yes_checkbox
  • gross_income
  • net_gain_or_loss
  • net_operating_loss_deduction
  • other_income
  • other_ownership_no_checkbox_options
  • other_ownership_yes_checkbox_options
  • tax_year
  • taxable_income
  • total_tax_page1
  • travel_and_entertainment

None

Quality improvements.

Added support for Year 2021.

Entity list is in snake_case format similar to pretrained-1120-v2.0-2022-02-03.

Entity city_state_country is renamed into city_state_country_zipcode.

For more information, see Managing processor versions.

Quotas and limits
Maximum pages (online/synchronous requests): 10
Maximum pages (batch/offline/asynchronous requests): 15
Fields detected in the earliest version

This content is available to approved customers. You can view this content after you have been approved and added to the appropriate allowlist.

Human-in-the-Loop

Not Supported

Labeling Instructions Open in new window.

1120S Parser

Solution type Lending
Type in UI Specialized
Release stage

Preview

For more information, see see the launch stage descriptions.

Access status Limited

To request API access, please fill out and submit the Document AI limited access customer request form. The form requests information about you, your company, and your use case. Please note that a Google Cloud Project ID is required for access. To create a new Google Cloud project, or identify your existing project's Project ID see the following instructions.

After you submit the form, the Document AI team will review your request to ensure you meet the criteria for access. If approved, you will receive an email with instructions on how to access and use this feature.

Description

Extract from Form 1120S, name, address, assets, etc.

Notes

This content is available to approved customers. You can view this content after you have been approved and added to the appropriate allowlist.

Supported languages
  • en: English
Supported form/versions
  • 2021 (pretrained-1120s-v2.1-2022-07-22 version only)
  • 2020 (standard and customized versions)
  • 2019 (standard and customized versions)
Processor versions
Version ID Release Channel Additional fields detected Additional languages supported Description
pretrained-1120s-v1.0-2021-08-11 Stable

None

None

Previously named 'Google default' and versioned 'pretrained'.
pretrained-1120s-v2.0-2022-02-03 Stable
Show fields
  • accum_adj_acc_bal_begin_tax_yr
  • accum_adj_acc_bal_end_tax_yr
  • accum_earn_prft_bal_begin_tax_yr
  • accum_earn_prft_bal_end_tax_yr
  • amount_owed
  • begin_of_tax_yr_cash
  • begin_of_tax_yr_total_assets
  • begin_of_tax_yr_total_liab_sh_eq
  • city_state_country
  • credited_to_2021_estimated_tax
  • date_incorporated
  • employer_identification_number
  • end_of_tax_yr_cash
  • end_of_tax_yr_total_assets
  • end_of_tax_yr_total_liab_sh_eq
  • form_year
  • income_or_loss
  • income_or_loss_reconciliation
  • name
  • net_income_or_loss_per_books
  • number_of_shareholders
  • ordinary_biz_income_or_loss
  • ordinary_biz_income_loss_sch_k
  • other_adj_acc_bal_begin_tax_yr
  • other_adj_acc_bal_end_tax_yr
  • over_payment
  • refunded
  • street_address
  • taxable_income_bal_begin_tax_yr
  • taxable_income_bal_end_tax_yr
  • total_assets
  • total_deductions
  • total_income_or_loss

None

Quality improvements.

Breaking change: the names of all extracted fields have been renamed from CamelCase to snake_case (for example, end_of_tax_year_cash instead of EndOfTaxYear_Cash).

This change was made to standardize the format of field names across Document AI. Previously named 'RC' and versioned 'pretrained-next'.

pretrained-1120s-v2.1-2022-07-22 Release Candidate
Show fields
  • begin_of_tax_yr_accounts_payable
  • begin_of_tax_yr_mortgages_notes_bonds_less_than_a_yr
  • begin_of_tax_yr_other_assets
  • begin_of_tax_yr_other_current_assets
  • begin_of_tax_yr_other_current_liabilities
  • begin_of_tax_yr_tax_exempt_securities
  • begin_of_tax_yr_trade_notes_and_accounts
  • begin_of_tax_yr_us_govt_obligations
  • cost_of_goods_sold
  • depletion
  • depreciation
  • end_of_tax_yr_accounts_payable
  • end_of_tax_yr_mortgages_notes_bonds_less_than_a_yr
  • end_of_tax_yr_other_assets
  • end_of_tax_yr_other_current_assets
  • end_of_tax_yr_other_current_liabilities
  • end_of_tax_yr_tax_exempt_securities
  • end_of_tax_yr_trade_notes_and_accounts
  • end_of_tax_yr_us_govt_obligations
  • tax_year_begin_date
  • tax_year_end_date
  • travel_and_entertainment
  • other_income_or_loss

None

Quality improvements and supporting new fields.

Added support for year 2021.

For more information, see Managing processor versions.

Quotas and limits
Maximum pages (online/synchronous requests): 10
Maximum pages (batch/offline/asynchronous requests): 15
Fields detected in the earliest version

This content is available to approved customers. You can view this content after you have been approved and added to the appropriate allowlist.

Human-in-the-Loop

Supported[2]

Labeling Instructions Open in new window.

Bank Statement Parser

Solution type Lending
Type in UI Specialized
Release stage

General availability

For more information, see see the launch stage descriptions.

Access status Limited

To request API access, please fill out and submit the Document AI limited access customer request form. The form requests information about you, your company, and your use case. Please note that a Google Cloud Project ID is required for access. To create a new Google Cloud project, or identify your existing project's Project ID see the following instructions.

After you submit the form, the Document AI team will review your request to ensure you meet the criteria for access. If approved, you will receive an email with instructions on how to access and use this feature.

Description

Extract from bank statements including name, account, transactions, etc.

Notes

This content is available to approved customers. You can view this content after you have been approved and added to the appropriate allowlist.

Supported languages
  • en: English
Processor versions
Version ID Release Channel Additional fields detected Additional languages supported Description
pretrained-bankstatement-v1.0-2021-08-08 Stable

None

None

Previously named 'Google default' and versioned 'pretrained'.
pretrained-bankstatement-v1.1-2021-08-13 Stable

None

None

Quality improvement; Launched in Aug 2021; Previously named 'Google Default Next' and versioned 'pretrained-next'.
pretrained-bankstatement-v2.0-2021-12-10 Stable

None

None

New splitter and classifier model with improved quality and more document type support; Launched in Dec 2021; Previously named 'Google Default Next V2' and versioned 'pretrained-next-v2'.
pretrained-bankstatement-v3.0-2022-05-16 Release Candidate

None

None

This version assumes that the input file contains a single bank statement. Unlike the default version, this version does not check the input file for bank statements and will not return an error if no bank statements are found. If your input document contains multiple bank statements, use the Lending Document Splitter & Classifier for splitting before sending it to this processor. Launched in May 2022. Previously versioned 'pretrained-2022-05-16'.

For more information, see Managing processor versions.

Quotas and limits
Maximum pages (online/synchronous requests): 15
Maximum pages (batch/offline/asynchronous requests): 30
Fields detected in the earliest version

This content is available to approved customers. You can view this content after you have been approved and added to the appropriate allowlist.

Enriched fields

This content is available to approved customers. You can view this content after you have been approved and added to the appropriate allowlist.

Human-in-the-Loop

Supported[2]

Labeling Instructions Open in new window.
Sample Output Open in new window.

HOA Statement Parser

Solution type Lending
Type in UI Specialized
Release stage

Preview

For more information, see see the launch stage descriptions.

Access status Limited

To request API access, please fill out and submit the Document AI limited access customer request form. The form requests information about you, your company, and your use case. Please note that a Google Cloud Project ID is required for access. To create a new Google Cloud project, or identify your existing project's Project ID see the following instructions.

After you submit the form, the Document AI team will review your request to ensure you meet the criteria for access. If approved, you will receive an email with instructions on how to access and use this feature.

Description

Extract from Homeowner Association(HOA) statements including name, address, due amount, etc.

Notes

This content is available to approved customers. You can view this content after you have been approved and added to the appropriate allowlist.

Supported languages
  • en: English
Processor versions
Version ID Release Channel Additional fields detected Additional languages supported Description
pretrained-hoa-statement-v1.0-2021-12-08 Stable

None

None

Previously named 'Google default' and versioned 'pretrained'.

For more information, see Managing processor versions.

Quotas and limits
Maximum pages (online/synchronous requests): 15
Maximum pages (batch/offline/asynchronous requests): 50
Fields detected in the earliest version

This content is available to approved customers. You can view this content after you have been approved and added to the appropriate allowlist.

Enriched fields

This content is available to approved customers. You can view this content after you have been approved and added to the appropriate allowlist.

Human-in-the-Loop

Supported[2]


HUD-92900B Parser

Solution type Lending
Type in UI Specialized
Release stage

Preview

For more information, see see the launch stage descriptions.

Access status Limited

To request API access, please fill out and submit the Document AI limited access customer request form. The form requests information about you, your company, and your use case. Please note that a Google Cloud Project ID is required for access. To create a new Google Cloud project, or identify your existing project's Project ID see the following instructions.

After you submit the form, the Document AI team will review your request to ensure you meet the criteria for access. If approved, you will receive an email with instructions on how to access and use this feature.

Description

Extract from Form HUD-92900B dates and signature existence.

Supported languages
  • en: English
Supported form/versions
  • 2019 (standard version only)
Processor versions
Version ID Release Channel Additional fields detected Additional languages supported Description
pretrained-hud92900b-2021-09-16 Stable

None

None

Previously named 'Google default' and versioned 'pretrained'.

For more information, see Managing processor versions.

Quotas and limits
Maximum pages (online/synchronous requests): 10
Maximum pages (batch/offline/asynchronous requests): 15
Fields detected in the earliest version

This content is available to approved customers. You can view this content after you have been approved and added to the appropriate allowlist.

Human-in-the-Loop

Not Supported


Lending Document Splitter & Classifier

Solution type Lending
Type in UI Specialized
Release stage

General availability

For more information, see see the launch stage descriptions.

Access status Limited

To request API access, please fill out and submit the Document AI limited access customer request form. The form requests information about you, your company, and your use case. Please note that a Google Cloud Project ID is required for access. To create a new Google Cloud project, or identify your existing project's Project ID see the following instructions.

After you submit the form, the Document AI team will review your request to ensure you meet the criteria for access. If approved, you will receive an email with instructions on how to access and use this feature.

Description

Identify documents in a large file and classify known lending document types.

Mortgage application packages and other lending documents often contain multiple documents (such as 1040 tax forms, W2, bank statements, etc.) in a single file. Lending document splitter allows you to programmatically split these combined lending documents on logical boundaries. The split files are then classified based on the document type, so that the appropriate extraction model can be applied to each file.

Notes

This content is available to approved customers. You can view this content after you have been approved and added to the appropriate allowlist.

Supported languages
  • en: English
Quotas and limits
Maximum pages (online/synchronous requests): 15
Maximum pages (batch/offline/asynchronous requests): 1250
Human-in-the-Loop

Not Supported

Document types identified
Show types

This splitter can identify and classify the following types of documents and form:

  • 1003 - Legacy Form (standard and customized versions)
    • Return type(s): 1003[1], 1003_2009
  • 1040 - 2018, 2019, 2020 (standard and customized versions)
    • Return type(s): 1040[1], 1040_2018, 1040_2019, 1040_2020[1]
  • 1040 Schedule C - 2018, 2019, 2020 (standard and customized versions)
    • Return type(s): 1040sc[1], 1040sc_2018[1], 1040sc_2019, 1040sc_2020
  • 1040 Schedule E - 2018, 2019, 2020 (standard and customized versions)
    • Return type(s): 1040se[1], 1040se_2018[1], 1040se_2019, 1040se_2020
  • 1065 - 2018, 2019, 2020 (standard and customized versions)
    • Return type(s): 1065[1], 1065_2018[1], 1065_2019, 1065_2020
  • 1099-DIV - 2018, 2019, 2020 (standard and customized versions)
    • Return type(s): 1099div[1], 1099div_2018, 1099div_2019, 1099div_2020
  • 1099-G - 2018, 2019, 2020 (standard and customized versions)
    • Return type(s): 1099g[1], 1099g_2018[1], 1099g_2019, 1099g_2020
  • 1099-INT - 2018, 2019, 2020 (standard and customized versions)
    • Return type(s): 1099int[1], 1099int_2018, 1099int_2019, 1099int_2020
  • 1099-MISC - 2018, 2019, 2020 (standard and customized versions)
    • Return type(s): 1099misc[1], 1099misc_2018, 1099misc_2019, 1099misc_2020
  • 1099-NEC - 2020 (standard and customized versions)
    • Return type(s): 1099nec[1], 1099nec_2020
  • 1099-R - 2018, 2019, 2020 (standard and customized versions)
    • Return type(s): 1099r[1], 1099r_2018, 1099r_2019, 1099r_2020
  • 1120 - 2018, 2019, 2020 (standard and customized versions)
    • Return type(s): 1120[1], 1120_2018[1], 1120_2019, 1120_2020
  • 1120S - 2018, 2019, 2020 (standard and customized versions)
    • Return type(s): 1120s[1], 1120s_2018[1], 1120s_2019, 1120s_2020
  • Bank Statement
    • Return type(s): account_statement_bank
  • Pay Slip
    • Return type(s): payslip
  • SSA-1099 - 2018, 2019, 2020 (standard and customized versions)
    • Return type(s): 1099ssa[1], 1099ssa_2018[1], 1099ssa_2019, 1099ssa_2020
  • US Driver License
    • Return type(s): US_Driver_License
  • US Pasport
    • Return type(s): US_Passport
  • W2 - 2018, 2019, 2020 (standard and customized versions)
    • Return type(s): w2[1], w2_2018, w2_2019, w2_2020
  • W9 - Rev. 10-2018, Rev. 11-2017
    • Return type(s): w9[1], w9_2017, w9_2018
  • If the splitter cannot identify the type of the document, it returns other.
Processor versions
Version ID Release Channel Additional document types detected Description
pretrained-lending-document-split-v1.0-2021-12-08 Stable

None

Previously named 'Google default' and versioned 'pretrained'.
pretrained-lending-document-split-v1.1-2021-12-09 Stable
Show types
  • 1005_1996
  • 1040_2021[1]
  • 1040nr[1]
  • 1040nr_2018
  • 1040nr_2019
  • 1040nr_2020
  • 1040nr_2021[1]
  • 1040sb[1]
  • 1040sb_2018
  • 1040sb_2019
  • 1040sb_2020
  • 1040sb_2021[1]
  • 1040sc_2021[1]
  • 1040sd[1]
  • 1040sd_2018
  • 1040sd_2019
  • 1040sd_2020
  • 1040sd_2021[1]
  • 1040se_2021[1]
  • 1040sr[1]
  • 1040sr_2018
  • 1040sr_2019
  • 1040sr_2020
  • 1040sr_2021[1]
  • 1065_2021
  • 1076_2016
  • 1099div_2021[1]
  • 1099g_2021[1]
  • 1099int_2021[1]
  • 1099misc_2021[1]
  • 1099nec_2018[1]
  • 1099nec_2019[1]
  • 1099nec_2021
  • 1099r_2021[1]
  • 1099ssa_2021
  • 1120_2021[1]
  • 1120s_2021[1]
  • 1_4_Family_Rider_3170
  • 3108_Adjustable_Rate_Rider
  • 3140_Condominium_Rider
  • 3190_Balloon_Rider
  • 3890_Second_Home_Rider
  • 4506_T[1]
  • 4506_T_2018[1]
  • 4506_T_2019[1]
  • 4506_T_2020[1]
  • 4506_T_2021
  • 4506_T_EZ[1]
  • 4506_T_EZ_2018[1]
  • 4506_T_EZ_2019
  • 4506_T_EZ_2020[1]
  • 4506_T_EZ_2021
  • account_statement_investment_and_retirement
  • appraisal_ucdp_ssr
  • dhs_flood_certification
  • f11_12956_2017[1]
  • hud_54114
  • hud_92051
  • hud_92541
  • hud_92544
  • hud_92800
  • hud_92900a
  • hud_92900b
  • hud_92900lt
  • hud_92900ws
  • mortgage_statements
  • property_insurance
  • pud_rider
  • revocable_trust_rider
  • ssa_89[1]
  • ssa_89_2018[1]
  • ssa_89_2019[1]
  • ssa_89_2020
  • ssa_89_2021
  • ucc_financing_statement
  • usda_ad_3030
  • vba_26_0551_2004
  • vba_26_8923_2021
  • w2_2021[1]
  • w9_2019[1]
  • w9_2020[1]
  • w9_2021[1]
New splitter and classifier model with improved quality and more document type support; Launched in Dec 2021; Previously named 'Google Default Next' and versioned 'pretrained-next'.

For more information, see Managing processor versions.

Sample Output Open in new window.
More information Document splitters behavior

Mortgage Statement Parser

Solution type Lending
Type in UI Specialized
Release stage

Preview

For more information, see see the launch stage descriptions.

Access status Limited

To request API access, please fill out and submit the Document AI limited access customer request form. The form requests information about you, your company, and your use case. Please note that a Google Cloud Project ID is required for access. To create a new Google Cloud project, or identify your existing project's Project ID see the following instructions.

After you submit the form, the Document AI team will review your request to ensure you meet the criteria for access. If approved, you will receive an email with instructions on how to access and use this feature.

Description

Extract from mortgage statements including name, address, due amount, etc.

Notes

This content is available to approved customers. You can view this content after you have been approved and added to the appropriate allowlist.

Supported languages
  • en: English
Processor versions
Version ID Release Channel Additional fields detected Additional languages supported Description
pretrained-mortgage-statement-v1.0-2021-10-17 Stable

None

None

Previously named 'Google default' and versioned 'pretrained'.

For more information, see Managing processor versions.

Quotas and limits
Maximum pages (online/synchronous requests): 15
Maximum pages (batch/offline/asynchronous requests): 50
Fields detected in the earliest version

This content is available to approved customers. You can view this content after you have been approved and added to the appropriate allowlist.

Enriched fields

This content is available to approved customers. You can view this content after you have been approved and added to the appropriate allowlist.

Human-in-the-Loop

Supported[2]

Labeling Instructions Open in new window.

Pay Slip Parser

Solution type Lending
Type in UI Specialized
Release stage

General availability

For more information, see see the launch stage descriptions.

Access status Limited

To request API access, please fill out and submit the Document AI limited access customer request form. The form requests information about you, your company, and your use case. Please note that a Google Cloud Project ID is required for access. To create a new Google Cloud project, or identify your existing project's Project ID see the following instructions.

After you submit the form, the Document AI team will review your request to ensure you meet the criteria for access. If approved, you will receive an email with instructions on how to access and use this feature.

Description

Extract from pay slips, including name, business, amounts, etc.

Notes

This content is available to approved customers. You can view this content after you have been approved and added to the appropriate allowlist.

Supported languages
  • en: English
Processor versions
Version ID Release Channel Additional fields detected Additional languages supported Description
pretrained-paystub-v1.0-2021-03-19 Stable

None

None

Previously named 'Google default' and versioned 'pretrained'.
pretrained-paystub-v1.1-2021-08-13 Stable
Show fields
  • net_pay
  • net_pay_ytd
  • employee_account_number

None

Quality improvement and new fields support; Launched in Aug 2021; Previously named 'Google Default Next' and versioned 'pretrained-next'.
pretrained-paystub-v1.2-2021-12-10 Stable

None

None

New splitter and classifier model with improved quality and more document type support; Launched in Dec 2021; Previously named 'Google Default Next V2' and versioned 'pretrained-next-v2'.
pretrained-paystub-v2.0-2022-05-17 Release Candidate
Show fields
  • deduction_item
  • deduction_item/deduction_type
  • deduction_item/deduction_this_period
  • deduction_item/deduction_ytd
  • direct_deposit_item
  • direct_deposit_item/direct_deposit
  • direct_deposit_item/employee_account_number
  • earning_item
  • earning_item/earning_type
  • earning_item/earning_rate
  • earning_item/earning_hours
  • earning_item/earning_this_period
  • earning_item/earning_ytd
  • page_number
  • tax_item
  • tax_item/tax_type
  • tax_item/tax_this_period
  • tax_item/tax_ytd
  • federal_additional_tax
  • federal_allowance
  • federal_marital_status
  • state_additional_tax
  • state_allowance
  • state_marital_status

None

This version assumes that the input file contains a single pay slip. Unlike the default version, this version does not check the input file for pay slips and will not return an error if no pay slips are found. If your input document contains multiple pay slips, use the Lending Document Splitter & Classifier for splitting before sending it to this processor.

Quality improvement, new fields support and new schema. Bonus, Commissions, Holiday, Overtime, Regular Pay and Vacation are now part of earning_item/earning_this_period, and their year-to-date versions are in earning_item/earning_ytd. Direct Deposit and Employee Account Number are now nested under direct_deposit_item.

Async page limit is 10.

Launched in March 2022.

pretrained-paystub-v2.0-2022-07-22 Release Candidate

None

None

Quality improvement and uptraining enhancements.

Launched in July 2022.

For more information, see Managing processor versions.

Quotas and limits
Maximum pages (online/synchronous requests): 15
Maximum pages (batch/offline/asynchronous requests): 50
Fields detected in the earliest version

This content is available to approved customers. You can view this content after you have been approved and added to the appropriate allowlist.

Enriched fields

This content is available to approved customers. You can view this content after you have been approved and added to the appropriate allowlist.

Uptraining

Supported

Human-in-the-Loop

Supported[2]

Labeling Instructions Open in new window.

Retirement/Investment Statement Parser

Solution type Lending
Type in UI Specialized
Release stage

Preview

For more information, see see the launch stage descriptions.

Access status Limited

To request API access, please fill out and submit the Document AI limited access customer request form. The form requests information about you, your company, and your use case. Please note that a Google Cloud Project ID is required for access. To create a new Google Cloud project, or identify your existing project's Project ID see the following instructions.

After you submit the form, the Document AI team will review your request to ensure you meet the criteria for access. If approved, you will receive an email with instructions on how to access and use this feature.

Description

Extract from Retirement/Investment statements including name, address, due amount, etc.

Notes

This content is available to approved customers. You can view this content after you have been approved and added to the appropriate allowlist.

Supported languages
  • en: English
Processor versions
Version ID Release Channel Additional fields detected Additional languages supported Description
pretrained-retirement-investment-statement-v1.0-2021-12-03 Stable

None

None

Previously named 'Google default' and versioned 'pretrained'.

For more information, see Managing processor versions.

Quotas and limits
Maximum pages (online/synchronous requests): 15
Maximum pages (batch/offline/asynchronous requests): 30
Fields detected in the earliest version

This content is available to approved customers. You can view this content after you have been approved and added to the appropriate allowlist.

Enriched fields

This content is available to approved customers. You can view this content after you have been approved and added to the appropriate allowlist.

Human-in-the-Loop

Supported[2]

Labeling Instructions Open in new window.

SSA-89 Parser

Solution type Lending
Type in UI Specialized
Release stage

Preview

For more information, see see the launch stage descriptions.

Access status Limited

To request API access, please fill out and submit the Document AI limited access customer request form. The form requests information about you, your company, and your use case. Please note that a Google Cloud Project ID is required for access. To create a new Google Cloud project, or identify your existing project's Project ID see the following instructions.

After you submit the form, the Document AI team will review your request to ensure you meet the criteria for access. If approved, you will receive an email with instructions on how to access and use this feature.

Description

Extract from Form SSA-89, including name, address, SSN, etc.

Supported languages
  • en: English
Supported form/versions
  • 2020 (standard and customized versions)
Processor versions
Version ID Release Channel Additional fields detected Additional languages supported Description
pretrained-ssa89-v1.0-2021-09-16 Stable

None

None

Previously named 'Google default' and versioned 'pretrained'.

For more information, see Managing processor versions.

Quotas and limits
Maximum pages (online/synchronous requests): 10
Maximum pages (batch/offline/asynchronous requests): 15
Fields detected in the earliest version

This content is available to approved customers. You can view this content after you have been approved and added to the appropriate allowlist.

Human-in-the-Loop

Supported[2]


SSA-1099 Parser

Solution type Lending
Type in UI Specialized
Release stage

Preview

For more information, see see the launch stage descriptions.

Access status Limited

To request API access, please fill out and submit the Document AI limited access customer request form. The form requests information about you, your company, and your use case. Please note that a Google Cloud Project ID is required for access. To create a new Google Cloud project, or identify your existing project's Project ID see the following instructions.

After you submit the form, the Document AI team will review your request to ensure you meet the criteria for access. If approved, you will receive an email with instructions on how to access and use this feature.

Description

Extract from Form SSA-1099 including name, address, SSN, etc.

Notes

This content is available to approved customers. You can view this content after you have been approved and added to the appropriate allowlist.

Supported languages
  • en: English
Supported form/versions
  • 2020 (standard and customized versions)
  • 2019 (standard and customized versions)
Processor versions
Version ID Release Channel Additional fields detected Additional languages supported Description
pretrained-ssa1099-v1.0-2021-08-09 Stable

None

None

Previously named 'Google default' and versioned 'pretrained'.

For more information, see Managing processor versions.

Quotas and limits
Maximum pages (online/synchronous requests): 10
Maximum pages (batch/offline/asynchronous requests): 15
Fields detected in the earliest version

This content is available to approved customers. You can view this content after you have been approved and added to the appropriate allowlist.

Human-in-the-Loop

Not Supported


VBA26-0551 Parser

Solution type Lending
Type in UI Specialized
Release stage

Preview

For more information, see see the launch stage descriptions.

Access status Limited

To request API access, please fill out and submit the Document AI limited access customer request form. The form requests information about you, your company, and your use case. Please note that a Google Cloud Project ID is required for access. To create a new Google Cloud project, or identify your existing project's Project ID see the following instructions.

After you submit the form, the Document AI team will review your request to ensure you meet the criteria for access. If approved, you will receive an email with instructions on how to access and use this feature.

Description

Extract from Form VBA26-0551, coborrower signature, veteran signature, etc.

Supported languages
  • en: English
Supported form/versions
  • 2004 (standard and customized versions)
Processor versions
Version ID Release Channel Additional fields detected Additional languages supported Description
pretrained-vba26-0551-v1.0-2021-09-16 Stable

None

None

Previously named 'Google default' and versioned 'pretrained'.

For more information, see Managing processor versions.

Quotas and limits
Maximum pages (online/synchronous requests): 10
Maximum pages (batch/offline/asynchronous requests): 15
Fields detected in the earliest version

This content is available to approved customers. You can view this content after you have been approved and added to the appropriate allowlist.

Human-in-the-Loop

Not Supported


W2 Parser

Solution type Lending
Type in UI Specialized
Release stage

General availability

For more information, see see the launch stage descriptions.

Access status Limited

To request API access, please fill out and submit the Document AI limited access customer request form. The form requests information about you, your company, and your use case. Please note that a Google Cloud Project ID is required for access. To create a new Google Cloud project, or identify your existing project's Project ID see the following instructions.

After you submit the form, the Document AI team will review your request to ensure you meet the criteria for access. If approved, you will receive an email with instructions on how to access and use this feature.

Description

Extract from Form W2, including employee, employer, wages, etc.

Notes

This content is available to approved customers. You can view this content after you have been approved and added to the appropriate allowlist.

Supported languages
  • en: English
Supported form/versions
  • 2020 (standard and customized versions)
  • 2019 (standard and customized versions)
  • 2018 (standard and customized versions)
Processor versions
Version ID Release Channel Additional fields detected Additional languages supported Description
pretrained-w2-v1.0-2020-10-01 Stable

None

None

Previously named 'Google default' and versioned 'pretrained'.
pretrained-w2-v1.1-2022-01-27 Stable

None

None

New splitter and classifier model with improved quality and more document type support; Launched in Dec 2021; Previously named 'Google Default Next' and versioned 'pretrained-next'.
pretrained-w2-v1.2-2022-01-28 Stable
Show fields
  • AllocatedTips
  • DependentCareBenefits
  • EmployerNameAndAddress
  • EmployerStateIdNumber_Line1
  • FormYear
  • LocalIncomeTax_Line1
  • LocalityName_Line1
  • LocalWagesTipsEtc_Line1
  • NonqualifiedPlans
  • SocialSecurityTips
  • State_Line1
  • StateIncomeTax_Line1
  • StateWagesTipsEtc_Line1

None

Quality improvements and supporting new fields; does not include splitter.

Async page limit is 15.

Previously named 'Google Default Next V2' and versioned 'pretrained-next-v2'.

pretrained-w2-v2.0-2022-03-30 Release Candidate
Show fields
  • a_Code
  • a_Value
  • b_Code
  • b_Value
  • c_Code
  • c_Value
  • d_Code
  • d_Value
  • EmployeeAddress_City
  • EmployeeAddress_StreetAddressOrPostalBox
  • EmployeeAddress_AdditionalStreetAddressOrPostalBox
  • EmployeeAddress_State
  • EmployeeAddress_Zip
  • EmployeeName_FirstName
  • EmployeeName_LastName
  • EmployeeName_MiddleNameOrInitial
  • EmployerAddress_City
  • EmployerAddress_StreetAddressOrPostalBox
  • EmployerAddress_AdditionalStreetAddressOrPostalBox
  • EmployerAddress_State
  • EmployerAddress_Zip
  • EmployerName

None

Quality improvements and support for box 12 fields and fine-grained predictions of EmployeeName, EmployeeAddress, and EmployerNameAndAddress, all of which are no longer part of the output and are replaced with additional fields.

Async page limit is 15.

Previously versioned 'release-candidate'.

For more information, see Managing processor versions.

Quotas and limits
Maximum pages (online/synchronous requests): 15
Maximum pages (batch/offline/asynchronous requests): 15
Fields detected in the earliest version

This content is available to approved customers. You can view this content after you have been approved and added to the appropriate allowlist.

Enriched fields

This content is available to approved customers. You can view this content after you have been approved and added to the appropriate allowlist.

Uptraining

Supported

Human-in-the-Loop

Supported[2]


W9 Parser

Solution type Lending
Type in UI Specialized
Release stage

General availability

For more information, see see the launch stage descriptions.

Access status Limited

To request API access, please fill out and submit the Document AI limited access customer request form. The form requests information about you, your company, and your use case. Please note that a Google Cloud Project ID is required for access. To create a new Google Cloud project, or identify your existing project's Project ID see the following instructions.

After you submit the form, the Document AI team will review your request to ensure you meet the criteria for access. If approved, you will receive an email with instructions on how to access and use this feature.

Description

Extract from Form W9 including name, address, TIN, etc.

Notes

This content is available to approved customers. You can view this content after you have been approved and added to the appropriate allowlist.

Supported languages
  • en: English
Supported form/versions
  • Form (Rev. 10-2018, Rev. 11-2017)
Processor versions
Version ID Release Channel Additional fields detected Additional languages supported Description
pretrained-w9-v1.0-2020-09-25 Stable

None

None

Previously named 'Google default' and versioned 'pretrained'.
pretrained-w9-v1.1-2021-12-10 Stable

None

None

New splitter and classifier model with improved quality and more document type support; Launched in Dec 2021; Previously named 'Google Default Next' and versioned 'Google Default Next'.
pretrained-w9-v1.2-2022-01-27 Stable

None

None

Handles documents with variations other than the standard template and does not include splitter and classifier model (that has its own service and can be called seperately); Launched in Feb 2022. Previously named 'Google Default Next V2' and versioned 'pretrained-next-v2'.
pretrained-w9-v2.0-2022-06-23 Release Candidate

None

None

Quality improvements.

Sync page limit is 10.

Breaking change: the names of all extracted fields have been renamed from CamelCase to snake_case (for example, business_name instead of BusinessName).

This change was made to standardize the format of field names across Document AI.

This processor assumes the input file contains the supported document from the beginning and will not classify or split the input file. If your input file does not meet this assumption, please run the Lending Document Splitter & Classifier first and preprocess the input file.

Previously named 'Google Release Candidate 2022-06-23' and versioned '2022-06-23'.

For more information, see Managing processor versions.

Quotas and limits
Maximum pages (online/synchronous requests): 15
Maximum pages (batch/offline/asynchronous requests): 15
Fields detected in the earliest version

This content is available to approved customers. You can view this content after you have been approved and added to the appropriate allowlist.

Enriched fields

This content is available to approved customers. You can view this content after you have been approved and added to the appropriate allowlist.

Human-in-the-Loop

Supported[2]

Labeling Instructions Open in new window.

Procurement processors

Expense Parser

Solution type Procurement
Type in UI Specialized
Release stage

General availability

For more information, see see the launch stage descriptions.

Access status Public
Description

Extract text and values from expense documents such as expense date, supplier name, total amount, and currency.

Supported languages
Full list of languages
  • de: German
  • en: English
  • es: Spanish
  • fr: French
  • ja: Japanese
  • nl: Dutch
Processor versions
Version ID Release Channel Additional fields detected Additional languages supported Description
pretrained-expense-v1.1-2021-04-09 Stable

None

None

Previously named 'Google default' and versioned 'pretrained'.
pretrained-expense-v1.2-2022-02-18 Stable

None

None

Previously named 'Google Pretrained'.
pretrained-expense-v1.3-2022-07-15 Release Candidate
Show fields
  • credit_card_last_four_digits
  • payment_type
  • ja: Japanese
Launched in July 2022. Support for hotel and car rental folios. Previously named 'Google Default Next' and versioned 'pretrained-next'.

For more information, see Managing processor versions.

Quotas and limits
Maximum pages (online/synchronous requests): 10
Maximum pages (batch/offline/asynchronous requests): 10
Fields detected in the earliest version

You can also find this information in the Field detected page.

Full list of fields
  • credit_card_last_four_digits
  • currency
  • end_date
  • net_amount
  • payment_type
  • purchase_time
  • receipt_date
  • start_date
  • supplier_address
  • supplier_city
  • supplier_name
  • tip_amount
  • total_amount
  • total_tax_amount
  • line_item
    • line_item/amount
    • line_item/description
    • line_item/product_code
Enriched fields

You can find more information in the Enterprise Knowledge Graph page.

Full list of enriched fields
  • supplier_address
  • supplier_name
  • supplier_phone
Uptraining

Supported

Human-in-the-Loop

Supported[2]

Labeling Instructions Open in new window.
Sample Output Open in new window.

Invoice Parser

Solution type Procurement
Type in UI Specialized
Release stage

General availability

For more information, see see the launch stage descriptions.

Access status Public
Description

Extract text and values from invoices such as invoice number, supplier name, invoice amount, tax amount, invoice date, due date.

The invoice Parser extracts both header and line item fields, such as invoice number, supplier name, invoice amount, tax amount, invoice date, due date, and line item amounts.

Supported languages
Full list of languages
  • de: German
  • en: English
  • es: Spanish
  • et: Estonian
  • fr: French
  • it: Italian
  • lv: Latvian
  • lt: Lithuanian
  • nl: Dutch
  • pt: Portuguese (Brazilian & Continental)
  • ro: Romanian
  • sv: Swedish
Processor versions
Version ID Release Channel Additional fields detected Additional languages supported Description
pretrained-invoice-v1.1-2021-04-09 Stable

None

None

Previously named 'Google default' and versioned 'pretrained'.
pretrained-invoice-v1.2-2022-02-18 Stable

None

None

Launched in Feb 2022. Previously named 'Google Pretrained' and versioned 'pretrained-next'.
pretrained-invoice-v1.3-2022-07-15 Release Candidate

None

  • it: Italian
  • pt: Portuguese (Brazilian & Continental)
  • ro: Romanian
  • sv: Swedish
Uptrainable processor version. Launched in July 2022. Previously named 'Google Pretrained Next with Uptraining' and 'pretrained-next-uptrainable'.

For more information, see Managing processor versions.

Quotas and limits
Maximum pages (online/synchronous requests): 10
Maximum pages (batch/offline/asynchronous requests): 200
Fields detected in the earliest version

You can also find this information in the Field detected page.

Full list of fields
  • amount_paid_since_last_invoice
  • carrier
  • currency
  • currency_exchange_rate
  • delivery_date
  • due_date
  • freight_amount
  • invoice_date
  • invoice_id
  • line_item
    • line_item/amount
    • line_item/description
    • line_item/product_code
    • line_item/purchase_order
    • line_item/quantity
    • line_item/unit
    • line_item/unit_price
  • net_amount
  • original_invoice_id
  • payment_terms
  • purchase_order
  • receiver_address
  • receiver_email
  • receiver_name
  • receiver_phone
  • receiver_tax_id
  • receiver_website
  • remit_to_address
  • remit_to_name
  • ship_from_address
  • ship_from_name
  • ship_to_address
  • ship_to_name
  • supplier_address
  • supplier_email
  • supplier_iban
  • supplier_name
  • supplier_payment_ref
  • supplier_phone
  • supplier_registration
  • supplier_tax_id
  • supplier_website
  • total_amount
  • total_tax_amount
  • vat
    • vat/amount
    • vat/category_code
    • vat/tax_amount
    • vat/tax_rate
Enriched fields

You can find more information in the Enterprise Knowledge Graph page.

Full list of enriched fields
  • supplier_address
  • supplier_name
  • supplier_phone
Uptraining

Supported

Human-in-the-Loop

Supported[2]

Labeling Instructions Open in new window.
Sample Output Open in new window.

Procurement Document Splitter & Classifier

Solution type Procurement
Type in UI Specialized
Release stage

General availability

For more information, see see the launch stage descriptions.

Access status Limited

To request API access, please fill out and submit the Document AI limited access customer request form. The form requests information about you, your company, and your use case. Please note that a Google Cloud Project ID is required for access. To create a new Google Cloud project, or identify your existing project's Project ID see the following instructions.

After you submit the form, the Document AI team will review your request to ensure you meet the criteria for access. If approved, you will receive an email with instructions on how to access and use this feature.

Description

Allows you to programmatically split these combined procurement documents on logical boundaries.

Procurement document splitter allows you to take different procurement documents grouped in a single file and programmatically split the documents on logical boundaries. The split files are then classified based on the document type, so that the appropriate extraction model can be applied to each file.

Notes
  • The splitter is not designed to split logical documents that are over 30 pages long. Logical documents that are more than 30 pages long (e.g. a 40-page bank statement) may be split into two or more docs and classified separately.
Supported languages
  • en: English
Processor versions
Version ID Release Channel Additional fields detected Additional languages supported Description
pretrained-procurement-splitter-v1.1-2021-04-09 Stable

None

None

Previously named 'Google default' and versioned 'pretrained'.
pretrained-procurement-splitter-v1.2-2022-08-19 Stable

None

None

Previously versioned 'pretrained-next'.

For more information, see Managing processor versions.

Quotas and limits
Maximum pages (online/synchronous requests): 15
Maximum pages (batch/offline/asynchronous requests): 1250
Human-in-the-Loop

Not Supported

Document types identified
Show types

This splitter can identify and classify the following types of documents and form:

  • Utility statement: A bill or receipt issued by an utility company (telecommunications, gas, electric, cable service) that shows the amount owed by the customer for the services provided. This may also show the previous payments made by the customer for current or prior services.
    • Return type(s): utility_statement
  • Debit note: A document issued by a business stating a monetary amount a client owes to the business.
    • Return type(s): debit_note
  • Credit note: A document issued by a business that needs to provide a client with a discount or a refund, or to correct a previous invoicing error.
    • Return type(s): credit_note
  • Credit Card Slip: A document that shows a payment made by credit card. It typically includes the total charge amount, a tip amount (mostly US documents), and a total payment. Tip amount and total payment are usually handwritten. This doc is relevant for expense processing. It is not a suitable proof of expense in expense processing.
    • Return type(s): credit_card_slip
  • Restaurant statement: A document issued by a restaurant to a customer itemizing the specific items consumed, the taxes, total amount, tips, and amount paid.
    • Return type(s): restaurant_statement
  • Air travel statement: A document issued by an airline to a customer itemizing the specific flight and non-flight charges, and the amount paid (if available).
    • Return type(s): air_travel_statement
  • Hotel statement: A document issued by a hotel to a customer itemizing the specific charges related to a hotel stay, and the amount paid (if available).
    • Return type(s): hotel_statement
  • Car rental statement: A document issued by a car rental company to a customer itemizing the specific charges related to a car rental, and the amount paid (if available).
    • Return type(s): car_rental_statement
  • Ground transportation statement: A document issued by a ground transportation company (ride sharing, train/subway) to a customer itemizing the specific charges related to a trip, and the amount paid (if available).
    • Return type(s): ground_transportation_statement
  • Invoice statement: A document sent by the seller to the customer that requests payments for products or services and (for the purpose of our taxonomy) is not covered by any other document type definition.
    • Return type(s): invoice_statement
  • Receipt statement: A document that shows proof of payment which confirms that a customer has received the goods and services they paid a business for. Conversely, this can be a document showing the business was compensated for the goods or services they sold to a customer and (for the purpose of our taxonomy) is not covered by any other document type definition.
    • Return type(s): receipt_statement
  • If the splitter cannot identify the type of the document, it returns other.
Sample Output Open in new window.
More information Document splitters behavior

Utility Parser

Solution type Procurement
Type in UI Specialized
Release stage

General availability

For more information, see see the launch stage descriptions.

Access status Limited

To request API access, please fill out and submit the Document AI limited access customer request form. The form requests information about you, your company, and your use case. Please note that a Google Cloud Project ID is required for access. To create a new Google Cloud project, or identify your existing project's Project ID see the following instructions.

After you submit the form, the Document AI team will review your request to ensure you meet the criteria for access. If approved, you will receive an email with instructions on how to access and use this feature.

Description

Extract text and values from utility bills such as supplier name and previous paid amount.

Supported languages
  • en: English
Processor versions
Version ID Release Channel Additional fields detected Additional languages supported Description
pretrained-utility-v1.1-2021-04-09 Stable

None

None

Previously named 'Google default' and versioned 'pretrained'.

For more information, see Managing processor versions.

Quotas and limits
Maximum pages (online/synchronous requests): 10
Maximum pages (batch/offline/asynchronous requests): 200
Fields detected in the earliest version

You can also find this information in the Field detected page.

Full list of fields
  • adjusted_amount
  • amount_due
  • amount_paid_since_last_invoice
  • balance_transfer_amount
  • carrier
  • currency
  • currency_exchange_rate
  • delivery_date
  • deposit_credited_amount
  • due_date
  • freight_amount
  • invoice_date
  • invoice_id
  • late_fee_amount
  • line_item
    • line_item/amount
    • line_item/description
    • line_item/frequency
    • line_item/product_code
    • line_item/purchase_order
    • line_item/quantity
    • line_item/service_address
    • line_item/service_end_date
    • line_item/service_id_1
    • line_item/service_id_2
    • line_item/service_start_date
    • line_item/supplier_account_number
    • line_item/tax_amount
    • line_item/unit_number
    • line_item/unit_of_measure
    • line_item/unit_price
    • line_item/usage
  • net_amount
  • payment_terms
  • prior_amount_due
  • prior_paid_amount
  • purchase_order
  • receiver_address
  • receiver_email
  • receiver_name
  • receiver_phone
  • receiver_tax_id
  • receiver_website
  • reclaimed_water
  • remit_to_address
  • remit_to_name
  • service
    • service/service_end_date
    • service/service_id
    • service/service_start_date
    • service/unit_of_measure
    • service/usage
  • service_address
  • service_end_date
  • service_id
  • service_start_date
  • ship_from_address
  • ship_from_name
  • ship_to_address
  • ship_to_name
  • supplier_account_number
  • supplier_address
  • supplier_email
  • supplier_iban
  • supplier_name
  • supplier_payment_ref
  • supplier_phone
  • supplier_registration
  • supplier_tax_id
  • supplier_website
  • tampering
  • total_amount
  • total_tax_amount
  • usage
  • vat
    • vat/amount
    • vat/category_code
    • vat/tax_amount
    • vat/tax_rate
Human-in-the-Loop

Supported[2]

Labeling Instructions Open in new window.
Sample Output Open in new window.

Custom processors

Custom Document Extractor

Solution type Custom
Type in UI Custom
Release stage

Preview

For more information, see see the launch stage descriptions.

Access status Public
Description

Build and train your own custom entity extractor for new document types for which no pre-trained processors are available

Notes
Supported languages
Full list of languages
  • af: Afrikaans
  • sq: Albanian
  • ca: Catalan
  • hr: Croatian
  • cs: Czech
  • da: Danish
  • nl: Dutch
  • en: English
  • et: Estonian
  • tl: Filipino
  • fi: Finnish
  • fr: French
  • de: German
  • hu: Hungarian
  • is: Icelandic
  • id: Indonesian
  • it: Italian
  • lv: Latvian
  • lt: Lithuanian
  • ms: Malay
  • no: Norwegian
  • pl: Polish
  • pt: Portuguese (Brazilian & Continental)
  • ro: Romanian
  • sk: Slovak
  • sl: Slovenian
  • es: Spanish
  • sv: Swedish
  • tr: Turkish
  • vi: Vietnamese
Quotas and limits
Maximum pages (online/synchronous requests): 15
Maximum pages (batch/offline/asynchronous requests): 50
Uptraining

Supported

Human-in-the-Loop

Not Supported


[1] The corresponding parser for this form does not support this document type. This means that the splitter can identify and classify documents of this type, but Document AI does not provide a parser to extract information.

[2] If Human in the Loop (HITL) is enabled, the HITL limit of 10 pages per document applies, in addition to any other processor page limits.