Demo: See Document OCR in action real time with your own documents or a sample from us. No login or signup required.

OCR icon OCR (Optical Character Recognition)

OCR (Optical Character Recognition) with world-class Google Cloud AI

Extract text and data from images and documents, turn unstructured content into business-ready structured data, and unlock valuable insights.

Integrate OCR functionalities into your applications through APIs.

OCR solutions at Google Cloud are pre-built ML models, immediately available as APIs that you can easily integrate into your applications.

Overview

What is OCR?

Optical Character Recognition (OCR) is a foundational technology behind the conversion of typed, handwritten or printed text from images into machine-encoded text. 

What types of OCR does Google Cloud offer?

Google Cloud offers two types of OCR: OCR for documents and OCR for images and videos.

While they share a foundational technology, Document AI is a document understanding platform optimized for document processing; and Cloud Vision, on the other hand, is commonly used to detect text, handwriting and a wide range of objects from images and videos.

You can also use other Google Cloud products to perform OCR, for more advanced and specific functionality beyond those offered by Document AI and Cloud Vision.

How does OCR work at Google Cloud?

Google Cloud powers OCR with best-in-class AI. It goes beyond traditional text recognition by understanding, organizing and enriching data, ultimately generating business-ready insights. 

It gives you the flexibility to either use the OCR tools as a unified suite for streamlined efficiency (e.g. Document AI), or simply call the relevant APIs directly available in Google Cloud console to integrate OCR functionalities into your applications.

How Google Cloud AI and OCR work together?

All the OCR solutions mentioned above give you access to pre-trained ML models that you can deploy right away through an API, or uptrain to improve accuracy for your specific needs. 

You can also train your own custom models with AutoML - no machine learning expertise needed. 

Check out AutoML documentation on building custom ML models.

Which OCR solution is right for me?

If you are looking to analyze a document, or build an automated document processing pipeline, use Document AI - it takes care of the entire workflow all in one place, from understanding documents to search, store, govern and manage the documents alongside extracted data.

If you want to analyze and process images, use Cloud Vision alongside other Google Cloud products for best results - check the Common Uses section for details and quickstart guides.

Both APIs are free to try with a Google Cloud account.

How It Works

To understand and process documents, use Document AI.

For images, we recommend using Cloud Vision.

Both give you access to pre-trained ML models that you can deploy as-is through APIs or uptrain. You can also train your own custom models from scratch with AutoML - no ML expertise needed. 

First 1000 units every month are free when you use Cloud Vision or Document OCR - try it with a simple API call.

Try Cloud Vision API

How Cloud Vision recognizes and classifies images

Common Uses

Extract text and insights from documents

Extract text from images

Pricing

How much does my use case cost?

Understand your monthly cost to solve for a use case, with products you need and key usage assumptions laid out.

Image tagging, processing and search

Cloud Vision

Cloud Storage

Pub/Sub

Cloud Run

1. 15,000 Cloud Vision label detection API calls monthly

2. 100 GiB monthly storage

3. One 1.25 GiB CPU

4. Four GiB published daily through Pub/Sub

See calculation details in calculator

$27.36

Extract text and insights from documents

Document AI

Cloud Storage

BigQuery

Cloud Functions

1. 1,000 Document AI form parser API calls monthly

2. 100 GiB monthly storage

3. 1 TiB monthly queries

4.  RAM: 512 MB, CPU: 800 MHz

See calculation details in calculator

$71.87

Extract text from images

Cloud Vision

Cloud Storage

Pub/Sub

Cloud Run

1. 15,000 Cloud Vision OCR API calls monthly

2. 100 GiB monthly storage

3. One 1.25 GiB CPU

4. Four GiB published daily through Pub/Sub

See calculation details in calculator

$27.36

How much does my use case cost? Understand your monthly cost to solve for a use case, with products you need and key usage assumptions laid out.
Use case Products used Usage assumptions Estimated monthly cost (USD)
Image tagging, processing and search

Cloud Vision

Cloud Storage

Pub/Sub

Cloud Run

1. 15,000 Cloud Vision label detection API calls monthly

2. 100 GiB monthly storage

3. One 1.25 GiB CPU

4. Four GiB published daily through Pub/Sub

See calculation details in calculator

$27.36

Extract text and insights from documents

Document AI

Cloud Storage

BigQuery

Cloud Functions

1. 1,000 Document AI form parser API calls monthly

2. 100 GiB monthly storage

3. 1 TiB monthly queries

4.  RAM: 512 MB, CPU: 800 MHz

See calculation details in calculator

$71.87

Extract text from images

Cloud Vision

Cloud Storage

Pub/Sub

Cloud Run

1. 15,000 Cloud Vision OCR API calls monthly

2. 100 GiB monthly storage

3. One 1.25 GiB CPU

4. Four GiB published daily through Pub/Sub

See calculation details in calculator

$27.36

Pricing Calculator

Estimate the cost of your project by pulling in all the tools you need in a single place.
Estimate your cost

Custom Quote

Connect with our sales team to get a custom quote for your organization's unique needs.
Request a quote

Take the next step with OCR & Google Cloud

New customers get $300 in free credits

Try Cloud Vision for free

1000 pages/month are free with Document AI

Try Document AI API now

Quickstart: detect labels with Cloud Vision API

Get started

Automate a doc processing pipeline with Google AI

Watch video

See code samples for OCR solutions and use cases

View code samples