Sample processor output

This page contains detailed information on output produced by processors offered by Document AI.

The files on this page are sample documents in a variety of structures and the raw outputs from the Document AI API in the Document format.

The fields returned in the response can be limited by using a FieldMask when making a processing request.

Digitize text

Processors Output samples

Enterprise Document OCR (Optical Character Recognition)

Category Digitize
Solution type General
Functions OCR, Quality Analysis
Release stage General availability
Access status Public
Full processor details Detailed entry
Sample input file
pretrained-ocr-v1.0-2020-09-23
pretrained-ocr-v1.1-2022-09-12
pretrained-ocr-v1.2-2022-11-10
pretrained-ocr-v2.0-2023-06-02
pretrained-ocr-v2.1-2024-08-07
pretrained-ocr-v2.1.1-2025-01-31

Extract documents

Processors Output samples

Custom Extractor

Category Extract
Solution type Custom
Functions OCR, Entity Extraction
Release stage General availability
Access status Public
Full processor details Detailed entry
Sample input file
pretrained-foundation-model-v1.0-2023-08-22
pretrained-foundation-model-v1.1-2024-03-12
pretrained-foundation-model-v1.2-2024-05-10
pretrained-foundation-model-v1.3-2024-08-31
pretrained-foundation-model-v1.4-2025-02-05

Form Parser

Category Extract
Solution type General
Functions OCR, Form Parsing, Entity Extraction
Release stage General availability
Access status Public
Full processor details Detailed entry
Sample input file
pretrained-form-parser-v1.0-2020-09-23
pretrained-form-parser-v2.0-2022-11-10
pretrained-form-parser-v2.1-2023-06-26

Layout Parser

Category Extract
Solution type General
Functions Layout Parsing, Document Chunking
Release stage General availability
Access status Public
Full processor details Detailed entry
Sample input file
pretrained-layout-parser-v1.0-2024-06-03

Classify documents

Processors Output samples

Custom Classifier

Category Classify
Solution type Custom
Functions OCR, Classification
Release stage General availability
Access status Public
Full processor details Detailed entry
Sample input file

Custom Splitter

Category Classify
Solution type Custom
Functions OCR, Classification, Splitting
Release stage General availability
Access status Public
Full processor details Detailed entry
Sample input file

Explore pretrained processors

Processors Output samples

Bank Statement Parser

Category Pretrained
Solution type Lending
Functions OCR, Entity Extraction
Release stage General availability
Access status Public
Full processor details Detailed entry
Sample input file
pretrained-bankstatement-v1.0-2021-08-08
pretrained-bankstatement-v1.1-2021-08-13
pretrained-bankstatement-v2.0-2021-12-10
pretrained-bankstatement-v3.0-2022-05-16
pretrained-bankstatement-v4.0-2023-07-31
pretrained-bankstatement-v5.0-2023-12-06

Expense Parser

Category Pretrained
Solution type Procurement
Functions OCR, Entity Extraction
Release stage General availability
Access status Public
Full processor details Detailed entry
Sample input file
pretrained-expense-v1.1-2021-04-09
pretrained-expense-v1.2-2022-02-18
pretrained-expense-v1.3-2022-07-15
pretrained-expense-v1.3.2-2024-09-11
pretrained-expense-v1.4-2022-11-18
pretrained-expense-v1.4.2-2024-09-12

Identity Document Proofing Parser

Category Pretrained
Solution type Identity
Functions OCR, Quality Analysis
Release stage Preview
Access status Public
Full processor details Detailed entry
Sample input file
pretrained-id-proofing-v1.0-2022-10-03
pretrained-id-proofing-v1.1-2023-05-18
pretrained-id-proofing-v1.2-2023-10-04

Invoice Parser

Category Pretrained
Solution type Procurement
Functions OCR, Entity Extraction
Release stage General availability
Access status Public
Full processor details Detailed entry
Sample input file
pretrained-invoice-v1.1-2021-04-09
pretrained-invoice-v1.2-2022-02-18
pretrained-invoice-v1.3-2022-07-15
pretrained-invoice-v1.4-2022-10-21
pretrained-invoice-v1.5-2023-09-15
pretrained-invoice-v2.0-2023-12-06

US Driver License Parser

Category Pretrained
Solution type Identity
Functions OCR, Entity Extraction
Release stage General availability
Access status Public
Full processor details Detailed entry
Sample input file
pretrained-us-driver-license-v1.0-2021-06-14

US Passport Parser

Category Pretrained
Solution type Identity
Functions OCR, Entity Extraction
Release stage General availability
Access status Public
Full processor details Detailed entry
Sample input file
pretrained-us-passport-v1.0-2021-06-14

Utility Parser

Category Pretrained
Solution type Procurement
Functions OCR, Entity Extraction
Release stage General availability
Access status Limited
Full processor details Detailed entry
Sample input file
pretrained-utility-v1.1-2021-04-09
pretrained-utility-v1.2-2022-12-15

W2 Parser

Category Pretrained
Solution type Lending
Functions OCR, Entity Extraction
Release stage General availability
Access status Public
Full processor details Detailed entry
Sample input file
pretrained-w2-v1.0-2020-10-01
pretrained-w2-v1.1-2022-01-27
pretrained-w2-v1.2-2022-01-28
pretrained-w2-v2.0-2022-03-30
pretrained-w2-v2.1-2022-06-08