Language support

Document AI API's text recognition feature (OCR) is able to detect a wide variety of languages and can detect multiple languages within a single document.

Languages detected by the Document AI API are returned in the Document object in the detectedLanguages field as a BCP-47 identifier.

For the list of languages and scripts supported by Document OCR (Optical Character Recognition), refer to the Cloud Vision OCR Language Support documentation.

Other processors may support a limited set of languages. See the tables below or the full Processor List for details.

General processors

Processor Supported Languages
Form Parser
  • af: Afrikaans
  • sq: Albanian
  • ca: Catalan
  • hr: Croatian
  • cs: Czech
  • da: Danish
  • nl: Dutch
  • en: English
  • et: Estonian
  • tl: Filipino
  • fi: Finnish
  • fr: French
  • de: German
  • hu: Hungarian
  • is: Icelandic
  • id: Indonesian
  • it: Italian
  • lv: Latvian
  • lt: Lithuanian
  • ms: Malay
  • no: Norwegian
  • pl: Polish
  • pt: Portuguese (Brazilian & Continental)
  • ro: Romanian
  • sr: Serbian
  • sk: Slovak
  • sl: Slovenian
  • es: Spanish
  • sv: Swedish
  • tr: Turkish
  • vi: Vietnamese

Contract processors

Processor Supported Languages
Contract parser
  • en: English

Identity processors

Processor Supported Languages
US Driver License Parser
  • en: English
US Passport Parser
  • en: English

Lending processors

Processor Supported Languages
1003 Parser
  • en: English
1040 Parser
  • en: English
1040 Schedule C Parser
  • en: English
1040 Schedule D Parser
  • en: English
1040 Schedule E Parser
  • en: English
1099-DIV Parser
  • en: English
1099-G Parser
  • en: English
1099-INT Parser
  • en: English
1099-NEC Parser
  • en: English
1099-R Parser
  • en: English
1065 Parser
  • en: English
1120 Parser
  • en: English
1120S Parser
  • en: English
Bank Statement Parser
  • en: English
HOA Statement Parser
  • en: English
HUD-92900B Parser
  • en: English
Lending Document Splitter & Classifier
  • en: English
Mortgage Statement Parser
  • en: English
Pay Slip Parser
  • en: English
Retirement/Investment Statement Parser
  • en: English
SSA-89 Parser
  • en: English
SSA-1099 Parser
  • en: English
VBA26-0551 Parser
  • en: English
W2 Parser
  • en: English
W9 Parser
  • en: English

Procurement processors

Processor Supported Languages
Expense Parser
  • nl: Dutch
  • en: English
  • fr: French
  • de: German
  • es: Spanish
Invoice Parser
  • nl: Dutch
  • en: English
  • et: Estonian
  • fr: French
  • de: German
  • lv: Latvian
  • lt: Lithuanian
  • es: Spanish
Procurement Document Splitter & Classifier
  • en: English
Utility Parser
  • en: English