Custom Document Classifier has launched with general availability, ready for production use cases. Check it out
Jump to

Document AI

Extract structured data from documents and analyze, search and store this data. The Document AI solutions suite includes pre-trained models for data extraction, Document AI Workbench to create new custom models or uptrain existing ones and Document AI Warehouse to search and store documents. 

  • Manage the entire unstructured document lifecycle in one unified solution

  • Reduce manual document processing, minimize setup costs, and accelerate deployment

  • Ensure a high level of accuracy with Google's AI and Human-in-the-Loop (HITL) reviews

  • Use your document data to gain new insights about your products and meet customer expectations


Cost-effective and flexible

Improve operational efficiency by extracting structured data from unstructured documents and making that structured data available to your business apps and users.

Ensure your data is accurate and compliant

Automate and validate all your documents to streamline compliance workflows, reduce guesswork, and keep data accurate and compliant.

Use your data to meet customer expectations

Leverage insights to meet customer expectations and improve CSAT, advocacy, lifetime value, and spend.


Try Document AI in your environment

Upload a document (like an invoice) and see the structured data extracted. Don't have a document? Try our sample.

Key features

A unified platform to meet all your document processing needs

Process documents from a unified console

The Document AI platform is a unified console for document processing that lets you quickly access all models and tools. Use Document AI's pre-trained models for document processing, including basic extractors like OCR and Form Parser and specialized models, for industry use cases like lending, contracts, procurement and identity documents. Or use Document AI Workbench to uptrain these models on your business documents or create your own models and get better results for documents from your organization. With Document AI Warehouse, you can search, store, and manage documents, and even trigger workflows. Document AI lets you automate and validate documents to streamline workflows, reduce guesswork, and keep data accurate and compliant.

Leverage Google's state-of-the-art AI

Document AI is built on decades of AI innovation at Google, bringing powerful and useful solutions to these challenges. Under the hood are Google’s industry-leading technologies: computer vision (including OCR) and natural language processing (NLP) that create pre-trained models for high-value, high-volume documents. The latest ML research and toolkits that power Document Workbench and semantic search that makes Document Warehouse so much better than traditional document repositories. 

Enrich data to make it more useful

Validate and enrich parsed information with Google knowledge graph technology to make the data even more useful, checking company names, addresses, phone numbers, and other details against entities on the internet.

Integrate human review into ML predictions

Human-in-the-Loop AI is a new DocAI feature that will help companies achieve higher document processing accuracy with the assurance of human review. Adding human review can increase accuracy and help businesses interpret predictions using purpose-built tools to enable those reviews.



Google Cloud Basics
Document AI overview

Get an overview of the basics of Document AI, including extracting text from documents, classifying documents, and entity extraction.

Document AI introduction videos & labs

Get started learning about Document AI with our video series "The Future of Documents" and step-by-step codelabs.

Setting up the Document AI API

This guide provides all required setup steps to start using Document AI.

Use cases

Use cases

Use case
Perform Optical Character Recognition

In this codelab, learn how to perform Optical Character Recognition using the Document AI API with Python.

Use case
Digitize text from documents

Extract text, words, paragraphs, blocks, symbols, lines and correct rotation with Document OCR. Extract layout from forms with a Form Parser

Use case
Process industry specific documents

Document AI offers pretrained models for specific industry needs for example lending forms for the mortgage industry, procurement documents, contract documents and identity cards to power the most common yet highly complex document processing use cases. 

Use case
Create a custom model specific to your business

Achieve higher document processing accuracy with custom models or uptrain an existing model to meet your business needs with Document AI Workbench 

Use case
Manage documents and their AI extracted data

Search, store, govern & manage documents and their AI-extracted and tagged data in a single platform with Document AI Warehouse 

Use case
Create a Custom Document Extractor

Learn how to use Document AI Workbench to create and train a Custom Document Extractor that processes W-2 (US tax form) documents (as an example). 

Use case
Create Custom Document Classifiers

Create Custom Document Classifiers that identify documents from a user-defined set of classes. 


Document AI pricing

Document AI offers transparent, cost effective pricing for all your document processing, model training and storage needs. Visit our pricing page for more details. 

If you pay in a currency other than USD, the prices listed in your currency on Google Cloud SKUs apply.


Document AI partners

Get help implementing Document AI from these trusted partners. View full partner directory.