Send feedback
Processor list
bookmark_border bookmark
Stay organized with collections
Save and categorize content based on your preferences.
This page contains detailed information on all processors offered by
Document AI. You can see a list of all processors by solution type.
All Document AI processors adhere to the
Data Processing and Security Terms .
Refer to the Managing processor versions
documentation for more details. Also, specific processor limits apply in addition
to overall product quotas and limits .
Digitize text
Enterprise Document OCR (Optical Character Recognition)
Description
Identify and extract text in different types of documents.
This processor allows you to identify and extract text, including handwritten text, from documents in more than 200 languages. The processor also uses machine learning to perform a quality assessment of a document based on the readability of its content.
Category
Digitize
Functions
OCR, Quality Analysis
Release stage
General availability
Access status
Public
lock_open
Type in API
OCR_PROCESSOR
Supported languages
Full list of languages
Language Name
BCP 47 Tag
Script
Handwriting supported
Afrikaans
af
Latn
Albanian
sq
Latn
Arabic
ar
Arab
Armenian
hy
Armn
Belarusian
be
Cyrl
Bangla
bn
Beng
Bengali
bn
Beng
Bulgarian
bg
Cyrl
Catalan
ca
Latn
Chinese
zh
Hani
Croatian
hr
Latn
Czech
cs
Latn
Danish
da
Latn
Dutch
nl
Latn
English
en
Latn
Estonian
et
Latn
Filipino
fil
Latn
Finnish
fi
Latn
French
fr
Latn
German
de
Latn
Greek
el
Grek
Gujarati
gu
Gujr
Hebrew
iw
Hebr
Hindi
hi
Deva
Hungarian
hu
Latn
Icelandic
is
Latn
Indonesian
id
Latn
Italian
it
Latn
Japanese
ja
Jpan
Kannada
kn
Knda
Khmer
km
Khmr
Korean
ko
Kore
Lao
lo
Laoo
Latvian
lv
Latn
Lithuanian
lt
Latn
Macedonian
mk
Cyrl
Malay
ms
Latn
Malayalam
ml
Mlym
Marathi
mr
Deva
Nepali
ne
Deva
Norwegian
no
Latn
Persian
fa
Arab
Polish
pl
Latn
Portuguese (Portugal & Brazil)
pt
Latn
Punjabi
pa
Guru
Romanian
ro
Latn
Russian
ru
Cyrl
Serbian
sr
Cyrl
Slovak
sk
Latn
Slovenian
sl
Latn
Spanish
es
Latn
Swedish
sv
Latn
Tagalog
tl
Latn
Tamil
ta
Taml
Telugu
te
Telu
Thai
th
Thai
Turkish
tr
Latn
Ukrainian
uk
Cyrl
Vietnamese
vi
Latn
Yiddish
yi
Hebr
Processor versions
Version ID
Release Channel
Additional fields detected
Additional languages supported
Description
pretrained-ocr-v1.0-2020-09-23
Stable
None
None
Not recommended for use and will be discontinued in the United States (US) and European Union (EU) starting April 30, 2025.
pretrained-ocr-v1.1-2022-09-12
Stable
None
None
Not recommended for use and will be discontinued in the United States (US) and European Union (EU) starting April 30, 2025.
pretrained-ocr-v1.2-2022-11-10
Stable
None
None
Frozen model version of v1.0: Model files, configurations, and binaries of a version snapshot frozen in a container image for up to 18 months.
pretrained-ocr-v2.0-2023-06-02
Stable
None
None
Production-ready model specialized for document use cases. Includes access to all OCR add-ons.
pretrained-ocr-v2.1-2024-08-07
Stable
None
None
The main areas of improvement for v2.1 are: better printed text recognition, more precise checkbox detection and more accurate reading order.
pretrained-ocr-v2.1.1-2025-01-31
Release candidate
None
None
v2.1.1 is similar to V2.1, and is available in all regions except: US
, EU
, and asia-southeast1
.
For more information, see Managing processor versions.
Quotas and limits
Maximum pages (online/synchronous requests):
15
Maximum pages (batch/offline/asynchronous requests):
500
Note: Specific processor limits are in addition to
overall product quotas and limits .
Uptraining
Human-in-the-Loop
Sample Input File
Open in new window.
Sample Output
Open in new window.
Supported regions
asia-south1
asia-southeast1
australia-southeast1
eu
europe-west2
europe-west3
northamerica-northeast1
us
Note: Consult Supported regions
for a full list of processor version availability by region.
More information
Enterprise Document OCR
Refer to Sample datasets
for sample labeled and unlabeled datasets to use for training.
Custom Extractor
Description
Extract fields from documents using generative AI or custom models; fine tune models to accurately extract data from your documents.
Category
Extract
Functions
OCR, Entity Extraction
Release stage
General availability
Access status
Public
lock_open
Type in API
CUSTOM_EXTRACTION_PROCESSOR
Notes
Supported languages
Full list of languages
Language Name
BCP 47 Tag
Script
Handwriting supported
Afrikaans
af
Latn
Arabic
ar
Arab
Azerbaijani
az
Latn
Azerbaijani (Cyrillic)
az-Cyrl
Cyrl
Belarusian
be
Cyrl
Bulgarian
bg
Cyrl
Bosnian
bs
Latn
Catalan
ca
Latn
Cebuano
ceb
Latn
Czech
cs
Latn
Welsh
cy
Latn
Danish
da
Latn
German
de
Latn
Greek
el
Grek
English
en
Latn
Esperanto
eo
Latn
Spanish
es
Latn
Estonian
et
Latn
Basque
eu
Latn
Persian
fa
Arab
Finnish
fi
Latn
Filipino
fil
Latn
French
fr
Latn
Irish
ga
Latn
Galician
gl
Latn
Hindi
hi
Deva
Croatian
hr
Latn
Haitian Creole
ht
Latn
Hungarian
hu
Latn
Indonesian
id
Latn
Icelandic
is
Latn
Italian
it
Latn
Hebrew
iw
Hebr
Japanese
ja
Jpan
Javanese
jv
Latn
Kazakh
kk
Cyrl
Korean
ko
Kore
Kyrgyz
ky
Cyrl
Latin
la
Latn
Lithuanian
lt
Latn
Latvian
lv
Latn
Macedonian
mk
Cyrl
Mongolian
mn
Cyrl
Marathi
mr
Deva
Malay
ms
Latn
Maltese
mt
Latn
Nepali
ne
Deva
Dutch
nl
Latn
Norwegian
no
Latn
Polish
pl
Latn
Pashto
ps
Arab
Portuguese (Portugal & Brazil)
pt
Latn
Romanian
ro
Latn
Russian
ru
Cyrl
Russian (Petrine Orthography)
ru-PETR1708
Cyrl
Sanskrit
sa
Deva
Slovak
sk
Latn
Slovenian
sl
Latn
Albanian
sq
Latn
Serbian
sr
Cyrl
Swedish
sv
Latn
Swahili
sw
Latn
Tagalog
tl
Latn
Turkish
tr
Latn
Ukrainian
uk
Cyrl
Urdu
ur
Arab
Uzbek
uz
Latn
Uzbek (Cyrillic)
uz-Cyrl
Cyrl
Vietnamese
vi
Latn
Yiddish
yi
Hebr
Chinese simplified
zh-Hans
Hani
Chinese traditional
zh-Hant
Hani
Zulu
zu
Latn
Processor versions
Version ID
Release Channel
Additional fields detected
Additional languages supported
Description
pretrained-foundation-model-v1.0-2023-08-22
Stable
None
None
Production-ready candidate specialized for document use cases powered by specialized vision models and foundation models. Note: Effective April 9, 2025, this version will no longer be accessible.
pretrained-foundation-model-v1.1-2024-03-12
Release Candidate
None
None
Release candidate powered by Gemini 1.0 Pro LLM and newly developed technologies, including specialized language and vision models. Also includes advanced OCR features such as checkbox detection. Note: Effective April 9, 2025, this version will no longer be accessible.
pretrained-foundation-model-v1.2-2024-05-10
Stable
None
None
Production-ready candidate powered by Gemini 1.5 Pro LLM and newly developed technologies, including specialized language and vision models. Also includes advanced OCR features such as checkbox detection. Recommended for users who want to use the largest supported token limits or experiment with newer models.
pretrained-foundation-model-v1.3-2024-08-31
Stable
None
None
Production-ready candidate powered by the Gemini 1.5 Flash LLM. Also includes advanced OCR features such as checkbox detection. Recommended for those who want the lowest latency.
For more information, see Managing processor versions.
Quotas and limits
Maximum pages (online/synchronous requests):
15
Maximum pages (batch/offline/asynchronous requests):
200
Note: Specific processor limits are in addition to
overall product quotas and limits .
Normalized data types
You can find more information in the
Enrichment & normalization , and
Create dataset pages.
Uptraining
Human-in-the-Loop
Sample Input File
Open in new window.
Sample Output
Open in new window.
Supported regions
asia-south1
asia-southeast1
australia-southeast1
eu
europe-west2
europe-west3
northamerica-northeast1
us
Note: Consult Supported regions
for a full list of processor version availability by region.
More information
Custom Extractor
Description
Extract general key-value pairs (entity and checkbox), tables, and generic entities from documents in addition to OCR text.
This processor applies advanced machine learning technologies to extract key-value pairs, checkboxes, and tables from documents more than 200 languages. This processor also leverages deep learning models to extract 11 generic entities that are common in various document types.
Category
Extract
Functions
OCR, Form Parsing, Entity Extraction
Release stage
General availability
Access status
Public
lock_open
Type in API
FORM_PARSER_PROCESSOR
Supported languages
Full list of languages
Language Name
BCP 47 Tag
Script
Handwriting supported
Afrikaans
af
Latn
Albanian
sq
Latn
Arabic
ar
Arab
Belarusian
be
Cyrl
Catalan
ca
Latn
Chinese
zh
Hani
Croatian
hr
Latn
Czech
cs
Latn
Danish
da
Latn
Dutch
nl
Latn
English
en
Latn
Estonian
et
Latn
Filipino
fil
Latn
Finnish
fi
Latn
French
fr
Latn
German
de
Latn
Hebrew
iw
Hebr
Hindi
hi
Deva
Hungarian
hu
Latn
Icelandic
is
Latn
Indonesian
id
Latn
Italian
it
Latn
Japanese
ja
Jpan
Korean
ko
Kore
Latvian
lv
Latn
Lithuanian
lt
Latn
Macedonian
mk
Cyrl
Malay
ms
Latn
Marathi
mr
Deva
Nepali
ne
Deva
Norwegian
no
Latn
Persian
fa
Arab
Polish
pl
Latn
Portuguese (Portugal & Brazil)
pt
Latn
Romanian
ro
Latn
Russian
ru
Cyrl
Serbian
sr
Cyrl
Slovak
sk
Latn
Slovenian
sl
Latn
Spanish
es
Latn
Swedish
sv
Latn
Tagalog
tl
Latn
Turkish
tr
Latn
Ukrainian
uk
Cyrl
Vietnamese
vi
Latn
Yiddish
yi
Hebr
Processor versions
Version ID
Release Channel
Additional fields detected
Additional languages supported
Description
pretrained-form-parser-v1.0-2020-09-23
Stable
None
None
Legacy version. For best quality and full feature set, use the Form Parser v2.0.
pretrained-form-parser-v2.0-2022-11-10
Stable
Show fields
email
phone
url
date_time
address
person
organization
quantity
price
id
page_number
None
Recommended version. Supports generic entities and includes upgraded table, KVP, and checkbox model, as well as more than 200 languages.
pretrained-form-parser-v2.1-2023-06-26
Release Candidate
None
None
Public Preview version. Same model as v2.0 with native text extraction from digital PDF files enabled.
For more information, see Managing processor versions.
Quotas and limits
Maximum pages (online/synchronous requests):
15
Maximum pages (batch/offline/asynchronous requests):
100
Note: Specific processor limits are in addition to
overall product quotas and limits .
Uptraining
Human-in-the-Loop
Sample Input File
Open in new window.
Sample Output
Open in new window.
Supported regions
asia-south1
asia-southeast1
australia-southeast1
eu
europe-west2
europe-west3
northamerica-northeast1
us
Note: Consult Supported regions
for a full list of processor version availability by region.
More information
Form Parser
Layout Parser
Description
Extracts document content elements (text, tables, and lists) and creates context-aware chunks.
Layout Parser extracts document content elements like text, tables, and lists, and creates context-aware chunks that facilitate information retrieval in generative AI and discovery applications.
Category
Extract
Functions
Layout Parsing, Document Chunking
Release stage
General availability
Access status
Public
lock_open
Type in API
LAYOUT_PARSER_PROCESSOR
Notes
This parser supports PDF, HTML and DOCX files.
Supported languages
Full list of languages
Language Name
BCP 47 Tag
Script
Handwriting supported
Afrikaans
af
Latn
Albanian
sq
Latn
Arabic
ar
Arab
Armenian
hy
Armn
Belarusian
be
Cyrl
Bangla
bn
Beng
Bengali
bn
Beng
Bulgarian
bg
Cyrl
Catalan
ca
Latn
Chinese
zh
Hani
Croatian
hr
Latn
Czech
cs
Latn
Danish
da
Latn
Dutch
nl
Latn
English
en
Latn
Estonian
et
Latn
Filipino
fil
Latn
Finnish
fi
Latn
French
fr
Latn
German
de
Latn
Greek
el
Grek
Gujarati
gu
Gujr
Hebrew
iw
Hebr
Hindi
hi
Deva
Hungarian
hu
Latn
Icelandic
is
Latn
Indonesian
id
Latn
Italian
it
Latn
Japanese
ja
Jpan
Kannada
kn
Knda
Khmer
km
Khmr
Korean
ko
Kore
Lao
lo
Laoo
Latvian
lv
Latn
Lithuanian
lt
Latn
Macedonian
mk
Cyrl
Malay
ms
Latn
Malayalam
ml
Mlym
Marathi
mr
Deva
Nepali
ne
Deva
Norwegian
no
Latn
Persian
fa
Arab
Polish
pl
Latn
Portuguese (Portugal & Brazil)
pt
Latn
Punjabi
pa
Guru
Romanian
ro
Latn
Russian
ru
Cyrl
Serbian
sr
Cyrl
Slovak
sk
Latn
Slovenian
sl
Latn
Spanish
es
Latn
Swedish
sv
Latn
Tagalog
tl
Latn
Tamil
ta
Taml
Telugu
te
Telu
Thai
th
Thai
Turkish
tr
Latn
Ukrainian
uk
Cyrl
Vietnamese
vi
Latn
Yiddish
yi
Hebr
Processor versions
Version ID
Release Channel
Additional fields detected
Additional languages supported
Description
pretrained-layout-parser-v1.0-2024-06-03
Stable
None
None
Recommended version.
For more information, see Managing processor versions.
Quotas and limits
Maximum pages (online/synchronous requests):
15
Maximum pages (batch/offline/asynchronous requests):
500
Note: Specific processor limits are in addition to
overall product quotas and limits .
Uptraining
Human-in-the-Loop
Sample Input File
Open in new window.
Sample Output
Open in new window.
Supported regions
Note: Consult Supported regions
for a full list of processor version availability by region.
More information
Layout Parser
Explore pretrained processors
Bank Statement Parser
Description
Extract from bank statements including name, account, transactions, etc.
Category
Pretrained
Functions
OCR, Entity Extraction
Release stage
General availability
Access status
Public
lock_open
Type in API
BANK_STATEMENT_PROCESSOR
Notes
If a page of a multi-page input file is the correct document type and one of the supported versions, the processor performs entity extraction on the first supported document. If the processor doesn't find any applicable documents in the input file, the processor returns an error message.
Supported languages
Language Name
BCP 47 Tag
Script
Handwriting supported
English
en
Latn
Processor versions
Version ID
Release Channel
Additional fields detected
Additional languages supported
Description
pretrained-bankstatement-v1.0-2021-08-08
Stable
None
None
pretrained-bankstatement-v1.1-2021-08-13
Stable
None
None
pretrained-bankstatement-v2.0-2021-12-10
Stable
None
None
pretrained-bankstatement-v3.0-2022-05-16
Stable
None
None
This version assumes that the input file contains a single bank statement. Unlike the default version, this version does not check the input file for bank statements and will not return an error if no bank statements are found.
pretrained-bankstatement-v4.0-2023-07-31
Release Candidate
None
None
pretrained-bankstatement-v5.0-2023-12-06
Release Candidate
None
None
For more information, see Managing processor versions.
Quotas and limits
Maximum pages (online/synchronous requests):
15
Maximum pages (batch/offline/asynchronous requests):
30
Note: Specific processor limits are in addition to
overall product quotas and limits .
Fields detected in the earliest version
You can also find this information in the
Field detected page.
Full list of fields
account_number
account_type
bank_address
bank_name
client_address
client_name
ending_balance
starting_balance
statement_date
statement_end_date
statement_start_date
table_item
table_item/transaction_deposit
table_item/transaction_deposit_date
table_item/transaction_deposit_description
table_item/transaction_withdrawal
table_item/transaction_withdrawal_date
table_item/transaction_withdrawal_description
Enriched fields
You can find more information in the
Enrichment & normalization page.
Normalized fields
You can find more information in the
Enrichment & normalization page.
Full list of normalized fields
ending_balance
starting_balance
statement_date
statement_end_date
statement_start_date
table_item/transaction_deposit
table_item/transaction_deposit_date
table_item/transaction_withdrawal
table_item/transaction_withdrawal_date
Uptraining
Human-in-the-Loop
Labeling Instructions
Open in new window.
Sample Input File
Open in new window.
Sample Output
Open in new window.
Supported regions
Note: Consult Supported regions
for a full list of processor version availability by region.
W2 Parser
Description
Extract from Form W2, including employee, employer, wages, etc.
Category
Pretrained
Functions
OCR, Entity Extraction
Release stage
General availability
Access status
Public
lock_open
Type in API
FORM_W2_PROCESSOR
Notes
If a page of a multi-page input file is the correct document type and one of the supported versions, the processor performs entity extraction on the first supported document. If the processor doesn't find any applicable documents in the input file, the processor returns an error message.
Supported languages
Language Name
BCP 47 Tag
Script
Handwriting supported
English
en
Latn
Supported form/versions
2020 (standard and customized versions)
2019 (standard and customized versions)
2018 (standard and customized versions)
Processor versions
Version ID
Release Channel
Additional fields detected
Additional languages supported
Description
pretrained-w2-v1.0-2020-10-01
Stable
None
None
pretrained-w2-v1.1-2022-01-27
Stable
None
None
pretrained-w2-v1.2-2022-01-28
Stable
Show fields
AllocatedTips
ControlNumber
DependentCareBenefits
EIN
EmployeeAddress
EmployeeName
EmployerNameAndAddress
EmployerStateIdNumber_Line1
FederalIncomeTaxWithheld
FormYear
LocalIncomeTax_Line1
LocalityName_Line1
LocalWagesTipsEtc_Line1
MedicareTaxWithheld
MedicareWagesAndTips
NonqualifiedPlans
SocialSecurityTaxWithheld
SocialSecurityTips
SocialSecurityWages
SSN
State_Line1
StateIncomeTax_Line1
StateWagesTipsEtc_Line1
WagesTipsOtherCompensation
None
Quality improvements and supporting new fields; does not include splitter.
pretrained-w2-v2.0-2022-03-30
Release Candidate
Show fields
AllocatedTips
ControlNumber
DependentCareBenefits
EIN
EmployeeAddress_AdditionalStreetAddressOrPostalBox
EmployeeAddress_City
EmployeeAddress_State
EmployeeAddress_StreetAddressOrPostalBox
EmployeeAddress_Zip
EmployeeName_FirstName
EmployeeName_LastName
EmployeeName_MiddleNameOrInitial
EmployerAddress_AdditionalStreetAddressOrPostalBox
EmployerAddress_City
EmployerAddress_State
EmployerAddress_StreetAddressOrPostalBox
EmployerAddress_Zip
EmployerName
EmployerStateIdNumber_Line1
FederalIncomeTaxWithheld
FormYear
LocalIncomeTax_Line1
LocalWagesTipsEtc_Line1
LocalityName_Line1
MedicareTaxWithheld
MedicareWagesAndTips
NonqualifiedPlans
SSN
SocialSecurityTaxWithheld
SocialSecurityTips
SocialSecurityWages
StateIncomeTax_Line1
StateWagesTipsEtc_Line1
State_Line1
WagesTipsOtherCompensation
a_Code
a_Value
b_Code
b_Value
c_Code
c_Value
d_Code
d_Value
None
Quality improvements and support for box 12 fields and fine-grained predictions of EmployeeName
, EmployeeAddress
, and EmployerNameAndAddress
, all of which are no longer part of the output and are replaced with additional fields.
pretrained-w2-v2.1-2022-06-08
Stable
Show fields
AllocatedTips
ControlNumber
DependentCareBenefits
EIN
EmployeeAddress_AdditionalStreetAddressOrPostalBox
EmployeeAddress_City
EmployeeAddress_State
EmployeeAddress_StreetAddressOrPostalBox
EmployeeAddress_Zip
EmployeeName_FirstName
EmployeeName_LastName
EmployeeName_MiddleNameOrInitial
EmployeeName_Suffix
EmployerAddress_AdditionalStreetAddressOrPostalBox
EmployerAddress_City
EmployerAddress_State
EmployerAddress_StreetAddressOrPostalBox
EmployerAddress_Zip
EmployerName
EmployerStateIdNumber_Line1
FederalIncomeTaxWithheld
FormYear
LocalIncomeTax_Line1
LocalWagesTipsEtc_Line1
LocalityName_Line1
MedicareTaxWithheld
MedicareWagesAndTips
NonqualifiedPlans
SSN
SocialSecurityTaxWithheld
SocialSecurityTips
SocialSecurityWages
StateIncomeTax_Line1
StateWagesTipsEtc_Line1
State_Line1
WagesTipsOtherCompensation
a_Code
a_Value
b_Code
b_Value
c_Code
c_Value
d_Code
d_Value
None
Similar to version pretrained-w2-v2.0-2022-03-30
with further quality enhancements and introducing one more entity EmployeeName_Suffix
.
For more information, see Managing processor versions.
Quotas and limits
Maximum pages (online/synchronous requests):
15
Maximum pages (batch/offline/asynchronous requests):
15
Note: Specific processor limits are in addition to
overall product quotas and limits .
Fields detected in the earliest version
You can also find this information in the
Field detected page.
Full list of fields
ControlNumber
EIN
EmployeeAddress
EmployeeName
EmployerNameAndAddress
FederalIncomeTaxWithheld
MedicareTaxWithheld
MedicareWagesAndTips
SSN
SocialSecurityTaxWithheld
SocialSecurityWages
WagesTipsOtherCompensation
Enriched fields
You can find more information in the
Enrichment & normalization page.
Uptraining
Human-in-the-Loop
Sample Input File
Open in new window.
Sample Output
Open in new window.
Supported regions
Note: Consult Supported regions
for a full list of processor version availability by region.
US Passport Parser
Description
Extract fields such as names, document ID, date of birth, etc.
Category
Pretrained
Functions
OCR, Entity Extraction
Release stage
General availability
Access status
Public
lock_open
Type in API
US_PASSPORT_PROCESSOR
Supported languages
Language Name
BCP 47 Tag
Script
Handwriting supported
English
en
Latn
Processor versions
Version ID
Release Channel
Additional fields detected
Additional languages supported
Description
pretrained-us-passport-v1.0-2021-06-14
Stable
None
None
For more information, see Managing processor versions.
Quotas and limits
Maximum pages (online/synchronous requests):
2
Maximum pages (batch/offline/asynchronous requests):
2
Note: Specific processor limits are in addition to
overall product quotas and limits .
Fields detected in the earliest version
You can also find this information in the
Field detected page.
Full list of fields
Family Name
Given Names
Document Id
Expiration Date
Date Of Birth
Issue Date
MRZ Code
Portrait
Normalized fields
You can find more information in the
Enrichment & normalization page.
Uptraining
Human-in-the-Loop
Sample Input File
Open in new window.
Sample Output
Open in new window.
Supported regions
Note: Consult Supported regions
for a full list of processor version availability by region.
Utility Parser
Description
Extract text and values from utility bills such as supplier name and previous paid amount.
Category
Pretrained
Functions
OCR, Entity Extraction
Release stage
General availability
Access status
Limited
security
[*]
Type in API
UTILITY_PROCESSOR
Supported languages
Language Name
BCP 47 Tag
Script
Handwriting supported
English
en
Latn
Processor versions
Version ID
Release Channel
Additional fields detected
Additional languages supported
Description
pretrained-utility-v1.1-2021-04-09
Stable
None
None
pretrained-utility-v1.2-2022-12-15
Release Candidate
None
None
For more information, see Managing processor versions.
Quotas and limits
Maximum pages (online/synchronous requests):
10
Maximum pages (batch/offline/asynchronous requests):
200
Note: Specific processor limits are in addition to
overall product quotas and limits .
Fields detected in the earliest version
You can also find this information in the
Field detected page.
Full list of fields
adjusted_amount
amount_due
balance_transfer_amount
carrier
currency
currency_exchange_rate
delivery_date
deposit_credited_amount
due_date
freight_amount
invoice_date
invoice_id
late_fee_amount
line_item
line_item/amount
line_item/description
line_item/frequency
line_item/product_code
line_item/purchase_order
line_item/quantity
line_item/service_address
line_item/service_end_date
line_item/service_id_1
line_item/service_id_2
line_item/service_start_date
line_item/supplier_account_number
line_item/tax_amount
line_item/unit_number
line_item/unit_of_measure
line_item/unit_price
line_item/usage
net_amount
payment_terms
prior_amount_due
prior_paid_amount
purchase_order
receiver_address
receiver_email
receiver_name
receiver_phone
receiver_tax_id
receiver_website
reclaimed_water
remit_to_address
remit_to_name
service
service/service_end_date
service/service_id
service/service_start_date
service/unit_of_measure
service/usage
service_address
service_end_date
service_id
service_start_date
ship_from_address
ship_from_name
ship_to_address
ship_to_name
supplier_account_number
supplier_address
supplier_email
supplier_iban
supplier_name
supplier_payment_ref
supplier_phone
supplier_registration
supplier_tax_id
supplier_website
tampering
total_amount
total_tax_amount
usage
vat
vat/amount
vat/category_code
vat/tax_amount
vat/tax_rate
Normalized fields
You can find more information in the
Enrichment & normalization page.
Full list of normalized fields
adjusted_amount
amount_due
balance_transfer_amount
currency
currency_exchange_rate
delivery_date
due_date
invoice_date
late_fee_amount
line_item/amount
line_item/quantity
line_item/tax_amount
line_item/unit_price
net_amount
prior_amount_due
prior_paid_amount
total_amount
total_tax_amount
Uptraining
Human-in-the-Loop
Labeling Instructions
Open in new window.
Sample Input File
Open in new window.
Sample Output
Open in new window.
Supported regions
Note: Consult Supported regions
for a full list of processor version availability by region.
Identity Document Proofing Parser
Description
Predict the validity of ID documents using multiple signals.
Identity Document Proofing Processor is designed to help predict the validity of ID documents with four different signals.
The processor currently returns information from the following signals:
fraud_signals_is_identity_document
detection: Predicts whether an image contains a recognized identity document. fraud_signals_suspicious_words
detection: Predicts whether words are present that aren't typical on IDs. fraud_signals_image_manipulation
detection: Predicts whether the image was altered or tampered with an image editing tool. fraud_signals_online_duplicate
detection: Predicts whether the image can be found online (US only).
Category
Pretrained
Functions
OCR, Quality Analysis
Release stage
Preview
Access status
Public
lock_open
Type in API
ID_PROOFING_PROCESSOR
Notes
The Online Duplicate Detection feature is currently processed in US data centers. Regional and multi-regional support is unavailable for this feature outside of the US.
This processor is supported by algorithms that are updated more frequently than new processor versions are released. For this reason, the processor might return different outputs over time even when using the same processor version. For example, the Online Duplicate Detection system monitors images present on the web. The system's behavior can then change more quickly than can be tracked in processor versions.
Refer to notes on Responsible AI[†] and Human review.[‡]
Supported languages
Language Name
BCP 47 Tag
Script
Handwriting supported
English
en
Latn
Supported form/versions
Support for US passports, passcards and driver's licenses.
Processor versions
Version ID
Release Channel
Additional fields detected
Additional languages supported
Description
pretrained-id-proofing-v1.0-2022-10-03
Stable
None
None
pretrained-id-proofing-v1.1-2023-05-18
Release Candidate
Show fields
fraud_signals_photocopy_detection
None
Additional photocopy detection signal
pretrained-id-proofing-v1.2-2023-10-04
Release Candidate
Show fields
fraud_signals_photocopy_detection
None
For more information, see Managing processor versions.
Quotas and limits
Maximum pages (online/synchronous requests):
2
Maximum pages (batch/offline/asynchronous requests):
2
Note: Specific processor limits are in addition to
overall product quotas and limits .
Fields detected in the earliest version
You can also find this information in the
Field detected page.
Full list of fields
fraud_signals_is_identity_document
fraud_signals_suspicious_words
evidence_suspicious_word
evidence_inconclusive_suspicious_word
fraud_signals_image_manipulation
fraud_signals_online_duplicate (US only)
fraud_signals_photocopy_detection
evidence_hostname (US only)
evidence_thumbnail_url (US only)
Normalized fields
You can find more information in the
Enrichment & normalization page.
Full list of normalized fields
fraud_signals_image_manipulation
fraud_signals_online_duplicate (US only)
fraud_signals_is_identity_document
fraud_signals_suspicious_words
Uptraining
Human-in-the-Loop
Sample Input File
Open in new window.
Sample Output
Open in new window.
Supported regions
Note: Consult Supported regions
for a full list of processor version availability by region.
Pay Slip Parser
Description
Extract from pay slips, including name, business, amounts, etc.
Category
Pretrained
Functions
OCR, Entity Extraction
Release stage
General availability
Access status
Public
lock_open
Type in API
PAYSTUB_PROCESSOR
Notes
If the multi-page input document contains more than one valid pay slips, the processor extracts entities from only the first valid pay slip. If no pay slips are found in the input file, the processor returns an error message.
Supported languages
Language Name
BCP 47 Tag
Script
Handwriting supported
English
en
Latn
Processor versions
Version ID
Release Channel
Additional fields detected
Additional languages supported
Description
pretrained-paystub-v1.0-2021-03-19
Stable
None
None
pretrained-paystub-v1.1-2021-08-13
Stable
Show fields
net_pay
net_pay_ytd
employee_account_number
None
Quality improvement and new fields support;
pretrained-paystub-v1.2-2021-12-10
Stable
None
None
pretrained-paystub-v2.0-2022-05-17
Release Candidate
Show fields
deduction_item
deduction_item/deduction_type
deduction_item/deduction_this_period
deduction_item/deduction_ytd
direct_deposit_item
direct_deposit_item/direct_deposit
direct_deposit_item/employee_account_number
earning_item
earning_item/earning_type
earning_item/earning_rate
earning_item/earning_hours
earning_item/earning_this_period
earning_item/earning_ytd
page_number
tax_item
tax_item/tax_type
tax_item/tax_this_period
tax_item/tax_ytd
federal_additional_tax
federal_allowance
federal_marital_status
state_additional_tax
state_allowance
state_marital_status
None
This version assumes that the input file contains a single pay slip. Unlike the default version, this version does not check the input file for pay slips and will not return an error if no pay slips are found.
Quality improvement, new fields support and new schema. Bonus, Commissions, Holiday, Overtime, Regular Pay and Vacation are now part of earning_item/earning_this_period, and their year-to-date versions are in earning_item/earning_ytd. Direct Deposit and Employee Account Number are now nested under direct_deposit_item.
Async page limit is 10.
pretrained-paystub-v2.0-2022-07-22
Stable
None
None
Quality improvement and uptraining enhancements.
pretrained-paystub-v3.0-2023-12-06
Release Candidate
None
None
For more information, see Managing processor versions.
Quotas and limits
Maximum pages (online/synchronous requests):
15
Maximum pages (batch/offline/asynchronous requests):
50
Note: Specific processor limits are in addition to
overall product quotas and limits .
Fields detected in the earliest version
You can also find this information in the
Field detected page.
Full list of fields
bonus
bonus_ytd
commissions
commissions_ytd
direct_deposit
employee_account_number (Added in "pretrained-paystub-v1.1-2021-08-13")
employee_address
employee_name
employer_address
employer_name
end_date
gross_earnings
gross_earnings_ytd
holiday
holiday_ytd
net_pay (Added in "pretrained-paystub-v1.1-2021-08-13")
net_pay_ytd (Added in "pretrained-paystub-v1.1-2021-08-13")
overtime
overtime_ytd
pay_date
regular_pay
regular_pay_ytd
ssn
start_date
vacation
vacation_ytd
Enriched fields
You can find more information in the
Enrichment & normalization page.
Normalized fields
You can find more information in the
Enrichment & normalization page.
Full list of normalized fields
bonus
bonus_ytd
commissions
commissions_ytd
direct_deposit
end_date
gross_earnings
gross_earnings_ytd
holiday
holiday_ytd
net_pay
net_pay_ytd
overtime
overtime_ytd
pay_date
regular_pay
regular_pay_ytd
start_date
vacation
vacation_ytd
Uptraining
Human-in-the-Loop
Labeling Instructions
Open in new window.
Supported regions
Note: Consult Supported regions
for a full list of processor version availability by region.
US Driver License Parser
Description
Extract fields such as names, document ID, date of birth, etc.
Category
Pretrained
Functions
OCR, Entity Extraction
Release stage
General availability
Access status
Public
lock_open
Type in API
US_DRIVER_LICENSE_PROCESSOR
Supported languages
Language Name
BCP 47 Tag
Script
Handwriting supported
English
en
Latn
Supported form/versions
Supports all 50 States and D.C.
Processor versions
Version ID
Release Channel
Additional fields detected
Additional languages supported
Description
pretrained-us-driver-license-v1.0-2021-06-14
Stable
None
None
For more information, see Managing processor versions.
Quotas and limits
Maximum pages (online/synchronous requests):
2
Maximum pages (batch/offline/asynchronous requests):
2
Note: Specific processor limits are in addition to
overall product quotas and limits .
Fields detected in the earliest version
You can also find this information in the
Field detected page.
Full list of fields
Family Name
Given Names
Document Id
Expiration Date
Date Of Birth
Issue Date
Address
Portrait
Normalized fields
You can find more information in the
Enrichment & normalization page.
Uptraining
Human-in-the-Loop
Sample Input File
Open in new window.
Sample Output
Open in new window.
Supported regions
Note: Consult Supported regions
for a full list of processor version availability by region.
Expense Parser
Description
Extract text and values from expense documents such as expense date, supplier name, total amount, and currency.
Category
Pretrained
Functions
OCR, Entity Extraction
Release stage
General availability
Access status
Public
lock_open
Type in API
EXPENSE_PROCESSOR
Supported languages
Full list of languages
Language Name
BCP 47 Tag
Script
Handwriting supported
German
de
Latn
English
en
Latn
Spanish
es
Latn
French
fr
Latn
Japanese
ja
Jpan
Dutch
nl
Latn
Processor versions
Version ID
Release Channel
Additional fields detected
Additional languages supported
Description
pretrained-expense-v1.1-2021-04-09
Stable
None
None
Launched in April 2021. Deprecation is planned soon.
pretrained-expense-v1.2-2022-02-18
Stable
None
None
Note: Will be discontinued in the United States (US) and European Union (EU) starting Effective April 30, 2025.
pretrained-expense-v1.3-2022-07-15
Stable
Show fields
credit_card_last_four_digits
line_item/quantity
payment_type
Support for hotel and car rental folios. Note: Will be discontinued in the United States (US) and European Union (EU) starting Effective April 30, 2025.
pretrained-expense-v1.3.2-2024-09-11
Release Candidate
Show fields
credit_card_last_four_digits
line_item/quantity
payment_type
An upgrade to v1.3 with an enhanced underlying vision model.
pretrained-expense-v1.4-2022-11-18
Release Candidate
Show fields
traveler_name
reservation_id
line_item/transaction_date
ja
: Japanese
it
: Italian
pt
: Portuguese (Portugal & Brazil)
Performance improvements and support for uptraining. Maximum pages (online/synchronous requests) limit has been increased to 15. Note: Will be discontinued in the United States (US) and European Union (EU) starting Effective April 30, 2025.
pretrained-expense-v1.4.2-2024-09-12
Release Candidate
Show fields
traveler_name
reservation_id
line_item/transaction_date
ja
: Japanese
it
: Italian
pt
: Portuguese (Portugal & Brazil)
An upgrade to v1.4 with an enhanced underlying vision model.
For more information, see Managing processor versions.
Quotas and limits
Maximum pages (online/synchronous requests):
10
Maximum pages (batch/offline/asynchronous requests):
10
Note: Specific processor limits are in addition to
overall product quotas and limits .
Fields detected in the earliest version
You can also find this information in the
Field detected page.
Full list of fields
credit_card_last_four_digits
currency
end_date
net_amount
payment_type
purchase_time
receipt_date
start_date
supplier_address
supplier_city
supplier_name
tip_amount
total_amount
total_tax_amount
line_item
line_item/amount
line_item/description
line_item/product_code
Enriched fields
You can find more information in the
Enrichment & normalization page.
Normalized fields
You can find more information in the
Enrichment & normalization page.
Full list of normalized fields
currency
total_amount
total_tax_amount
net_amount
receipt_date
purchase_time
start_date
end_date
line_item/amount
line_item/payment_date
line_item/payment_amount
Uptraining
Human-in-the-Loop
Labeling Instructions
Open in new window.
Sample Input File
Open in new window.
Sample Output
Open in new window.
Supported regions
asia-southeast1
australia-southeast1
eu
northamerica-northeast1
us
Note: Consult Supported regions
for a full list of processor version availability by region.
Invoice Parser
Description
Extract text and values from invoices such as invoice number, supplier name, invoice amount, tax amount, invoice date, due date.
The invoice Parser extracts both header and line item fields, such as invoice number, supplier name, invoice amount, tax amount, invoice date, due date, and line item amounts.
Category
Pretrained
Functions
OCR, Entity Extraction
Release stage
General availability
Access status
Public
lock_open
Type in API
INVOICE_PROCESSOR
Supported languages
Full list of languages
Language Name
BCP 47 Tag
Script
Handwriting supported
German
de
Latn
English
en
Latn
Spanish
es
Latn
Estonian
et
Latn
French
fr
Latn
Italian
it
Latn
Latvian
lv
Latn
Lithuanian
lt
Latn
Dutch
nl
Latn
Portuguese (Portugal & Brazil)
pt
Latn
Romanian
ro
Latn
Swedish
sv
Latn
Processor versions
Version ID
Release Channel
Additional fields detected
Additional languages supported
Description
pretrained-invoice-v1.1-2021-04-09
Stable
None
None
pretrained-invoice-v1.2-2022-02-18
Stable
None
None
Deprecation is planned soon.
pretrained-invoice-v1.3-2022-07-15
Stable
None
it
: Italian
pt
: Portuguese (Portugal & Brazil)
ro
: Romanian
sv
: Swedish
et
: Estonian
lv
: Latvian
lt
: Lithuanian
Uptrainable processor version. Maximum pages (online/synchronous requests) has been increased to 15.
pretrained-invoice-v1.4-2022-10-21
Release Candidate
None
None
Uptrainable processor version. Maximum pages (online/synchronous requests) has been increased to 15.
pretrained-invoice-v1.5-2023-09-15
Release Candidate
None
None
pretrained-invoice-v2.0-2023-12-06
Release Candidate
None
None
For more information, see Managing processor versions.
Quotas and limits
Maximum pages (online/synchronous requests):
15
Maximum pages (batch/offline/asynchronous requests):
200
Note: Specific processor limits are in addition to
overall product quotas and limits .
Fields detected in the earliest version
You can also find this information in the
Field detected page.
Full list of fields
amount_paid_since_last_invoice
carrier
currency
currency_exchange_rate
delivery_date
due_date
freight_amount
invoice_date
invoice_id
line_item
line_item/amount
line_item/description
line_item/product_code
line_item/purchase_order
line_item/quantity
line_item/unit
line_item/unit_price
net_amount
payment_terms
purchase_order
receiver_address
receiver_email
receiver_name
receiver_phone
receiver_tax_id
receiver_website
remit_to_address
remit_to_name
ship_from_address
ship_from_name
ship_to_address
ship_to_name
supplier_address
supplier_email
supplier_iban
supplier_name
supplier_payment_ref
supplier_phone
supplier_registration
supplier_tax_id
supplier_website
total_amount
total_tax_amount
vat
vat/amount
vat/category_code
vat/tax_amount
vat/tax_rate
Enriched fields
You can find more information in the
Enrichment & normalization page.
Normalized fields
You can find more information in the
Enrichment & normalization page.
Full list of normalized fields
amount_paid_since_last_invoice
currency
currency_exchange_rate
delivery_date
due_date
freight_amount
invoice_date
net_amount
total_amount
total_tax_amount
line_item/amount
line_item/quantity
line_item/unit_price
vat/amount
vat/tax_amount
vat/tax_rate
Uptraining
Human-in-the-Loop
Labeling Instructions
Open in new window.
Sample Input File
Open in new window.
Sample Output
Open in new window.
Supported regions
asia-south1
asia-southeast1
australia-southeast1
eu
northamerica-northeast1
us
Note: Consult Supported regions
for a full list of processor version availability by region.
Classify documents
Custom Classifier
Description
Train a model to classify a document type from a set of classes.
Category
Classify
Functions
OCR, Classification
Release stage
General availability
Access status
Public
lock_open
Type in API
CUSTOM_CLASSIFICATION_PROCESSOR
Supported languages
Language Name
BCP 47 Tag
Script
Handwriting supported
English
en
Latn
Quotas and limits
Maximum pages (online/synchronous requests):
15
Maximum pages (batch/offline/asynchronous requests):
200
Note: Specific processor limits are in addition to
overall product quotas and limits .
Uptraining
Human-in-the-Loop
Sample Input File
Open in new window.
Sample Output
Open in new window.
Supported regions
asia-south1
asia-southeast1
australia-southeast1
eu
europe-west2
europe-west3
northamerica-northeast1
us
Note: Consult Supported regions
for a full list of processor version availability by region.
More information
Create a custom classification processor
Custom Splitter
Description
Train a model to split a file containing multiple documents into individual, classified documents.
Category
Classify
Functions
OCR, Classification, Splitting
Release stage
General availability
Access status
Public
lock_open
Type in API
CUSTOM_SPLITTING_PROCESSOR
Notes
i18n can be supported through custom training options only.
Supported languages
Language Name
BCP 47 Tag
Script
Handwriting supported
English
en
Latn
Quotas and limits
Maximum pages (online/synchronous requests):
15
Maximum pages (batch/offline/asynchronous requests):
1000
Note: Specific processor limits are in addition to
overall product quotas and limits .
Uptraining
Human-in-the-Loop
Sample Input File
Open in new window.
Sample Output
Open in new window.
Supported regions
asia-south1
asia-southeast1
australia-southeast1
eu
europe-west2
europe-west3
northamerica-northeast1
us
Note: Consult Supported regions
for a full list of processor version availability by region.
More information
Create a custom splitter processor
Summarize documents
Summarizer
Description
Get abstract and bullet point summaries for short and long documents.
Category
pretrained
Functions
Summarize
Release stage
Preview
Access status
Public
lock_open
Type in API
SUMMARY_PROCESSOR
Supported languages
Language Name
BCP 47 Tag
Script
Handwriting supported
English
en
Latn
Processor versions
Version ID
Release Channel
Additional fields detected
Additional languages supported
Description
pretrained-foundation-model-v1.0-2023-08-22
Stable
None
None
Google foundation model
For more information, see Managing processor versions.
Quotas and limits
Maximum pages (online/synchronous requests):
15
Maximum pages (batch/offline/asynchronous requests):
250
Note: Specific processor limits are in addition to
overall product quotas and limits .
Uptraining
Human-in-the-Loop
Sample Input File
Open in new window.
Sample Output
Open in new window.
Supported regions
Note: Consult Supported regions
for a full list of processor version availability by region.
More information
Custom summarizer
Send feedback
Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License , and code samples are licensed under the Apache 2.0 License . For details, see the Google Developers Site Policies . Java is a registered trademark of Oracle and/or its affiliates.
Last updated 2025-02-11 UTC.
Need to tell us more?
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Hard to understand","hardToUnderstand","thumb-down"],["Incorrect information or sample code","incorrectInformationOrSampleCode","thumb-down"],["Missing the information/samples I need","missingTheInformationSamplesINeed","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2025-02-11 UTC."],[],[]]