Language support
Document AI API's text recognition feature (OCR) is able to detect text from a wide variety of languages and can detect multiple languages within a single document.
Languages detected by the Document AI API are returned in the Document
object
in the detectedLanguages
field as a BCP-47 identifier.
For more information about OCR Language support, refer to the Cloud Vision OCR Language Support documentation.
See the tables below or the full Processor List for details.
General processors
Processor | Supported Languages |
---|---|
Document OCR (Optical Character Recognition) |
|
Form Parser |
|
Specialized processors
Processor | Supported Languages |
---|---|
Contract parser |
|
Identity Document Proofing Parser |
|
US Driver License Parser |
|
US Passport Parser |
|
1003 Parser |
|
1040 Parser |
|
1040 Schedule C Parser |
|
1040 Schedule D Parser |
|
1040 Schedule E Parser |
|
1099-DIV Parser |
|
1099-G Parser |
|
1099-INT Parser |
|
1099-NEC Parser |
|
1099-R Parser |
|
1065 Parser |
|
1120 Parser |
|
1120S Parser |
|
Bank Statement Parser |
|
HOA Statement Parser |
|
HUD-92900B Parser |
|
Lending Document Splitter & Classifier |
|
Mortgage Statement Parser |
|
Pay Slip Parser |
|
Retirement/Investment Statement Parser |
|
SSA-89 Parser |
|
SSA-1099 Parser |
|
VBA26-0551 Parser |
|
W2 Parser |
|
W9 Parser |
|
Expense Parser |
|
Invoice Parser |
|
Procurement Document Splitter & Classifier |
|
Purchase Order Parser |
|
Utility Parser |
|
Custom processors
Processor | Supported Languages |
---|---|
Custom Document Extractor |
|
Custom Document Classifier |
|
Custom Document Splitter |
|
Handwriting recognition
The following languages are supported for handwriting recognition.
af
: Afrikaanssq
: Albanianbe
: Belarusianbn
: Bengalibg
: Bulgarianca
: Catalanzh
: Chinesehr
: Croatiancs
: Czechda
: Danishnl
: Dutchet
: Estoniantl
: Filipinofi
: Finnishde
: Germanel
: Greekhi
: Hindihu
: Hungarianis
: Icelandicid
: Indonesianit
: Italianja
: Japaneseko
: Koreanlv
: Latvianlt
: Lithuanianmk
: Macedonianms
: Malaymr
: Marathine
: Nepalipl
: Polishpt
: Portuguese (Brazilian & Continental)ro
: Romanianru
: Russiansr
: Serbiansk
: Slovaksl
: Slovenianes
: Spanishsv
: Swedishtr
: Turkishuk
: Ukrainianvi
: Vietnamese