Process documents with Form Parser

Form Parser extracts key-value pairs (KVPs), tables, selection marks (like checkboxes), generic fields, and text to augment and automate document processing.

Form Parser can be considered over the other parsers when the use case involves:

  • Dealing with structured forms: It excels at extracting KVPs from well-defined forms that look like conventional forms with labeled blanks to fill in. For example name: __, when the blank area has text, it's an example of a kvp. Its pre-trained model offers high accuracy for common fields like names, dates, and addresses.
  • Flexible table extraction is needed: Form Parser extracts from simple (no cells that span rows or columns) tables that look like tables. No training is needed (nor possible). For trained table extraction, the custom extractor can be used with a parent field containing column (cell) child fields.
  • Need efficiency: Avoid building and maintaining extraction parsers, especially for high-volume and varied forms of extraction tasks.

Data-extraction features

Form Parser features encompass:

  • KVP: These are sets of two items within a document—a label or key and its corresponding data (a value). You can directly use KVPs (if the keys are consistent) or build custom logic to resolve varied keys into consistent structured information.

  • Generic entities: Parse 11 different fields from documents out of the box. These include:

    • email
    • phone
    • url
    • date_time
    • address
    • person
    • organization
    • quantity
    • price
    • id
    • page_number
  • Text and layout: Use our latest OCR engine to extract text and layout information. This includes embedded text from digital PDFs (v2.1 only) or text from images.

  • Tables: Detect and extract tables from images and PDFs.

  • Checkboxes: A high-quality selection mark detector, which extracts checkboxes from images and PDF output as KVP, using the text nearest the checkbox, with a valueType indicating whether it is filled or unfilled.

Languages and regions

  • Form Parser 2.0 supports over 200 languages. Learn more.
  • We provide feature support in eight regions. Learn more.

Model versions

The following processor versions are compatible with this feature. For more information, see Managing processor versions.

Version ID Release channel Description
pretrained-form-parser-v1.0-2020-09-23 Stable Legacy version. For best quality and full feature set, use the Form Parser v2.0.
pretrained-form-parser-v2.0-2022-11-10 Stable Recommended version. Supports generic entities and includes upgraded table, KVP, and checkbox model, as well as more than 200 languages.
pretrained-form-parser-v2.1-2023-06-26 Release Candidate Public Preview version. Same model as v2.0 with native text extraction from digital PDF files enabled.

Limitations

  • Prior JPEG compressions for TIFF are unsupported. Type of JPEG encapsulation defined by the TIFF version 6.0 specification.

  • The checkbox model doesn't support parsing radio buttons. Some detected checkboxes might not have corresponding keys.

  • The model doesn't reliably parse a KVP with an unfilled value, such as a blank form.

  • The KVP parsing on documents in certain languages may have lower quality than Latin languages.

Process documents with Form Parser

This quickstart introduces you to the Form Parser feature in Document AI. In this quickstart, you use the Google Cloud console to set up your Google Cloud project and authorization, create a Form Parser, and then make a request for Document AI to process a PDF form.

Learn how to:

  1. Enable Document AI in a Google Cloud project.

  2. Create a Form Parser processor, which can identify and extract text, key-value pairs, tables, and generic entities from many types of documents.

  3. Use the processor to annotate a sample document.


To follow step-by-step guidance for this task directly in the Google Cloud console, click Guide me:

Guide me


  1. Sign in to your Google Cloud account. If you're new to Google Cloud, create an account to evaluate how our products perform in real-world scenarios. New customers also get $300 in free credits to run, test, and deploy workloads.
  2. In the Google Cloud console, on the project selector page, select or create a Google Cloud project.

    Go to project selector

  3. Make sure that billing is enabled for your Google Cloud project.

  4. Enable the Document AI API.

    Enable the API

  5. In the Google Cloud console, on the project selector page, select or create a Google Cloud project.

    Go to project selector

  6. Make sure that billing is enabled for your Google Cloud project.

  7. Enable the Document AI API.

    Enable the API

Create a Form Parser processor

Use the Google Cloud console to create a Form Parser processor. See creating and managing processors for more information.

  1. In the Google Cloud console navigation menu, click Document AI and select Processor Gallery.

    Processor Gallery

  2. In the Processor Gallery, search for Form Parser and select Create.

    Form Parser option in UI

  3. In the side window, enter a Processor name, such as quickstart-form-processor.

  4. Select the region closest to you.

  5. Click the Create button.

You're taken to the Processor Details page of your new form parser processor.

Test processor

After creating your processor, you can send annotation requests to it.

  1. Download the sample document.

    It's a PDF file containing a sample handwritten medical intake form. This document is stored in a publicly accessible Cloud Storage bucket.

  2. Click the Upload Test Document button and select the document you just downloaded.

  3. You should now be on the Form Parser analysis page. You can view the OCR detected text, key-value pairs, tables, and generic entities extracted from the document.

    sample form key-value pairs in UI sample form generic entities in UI

Clean up

To avoid unnecessary Google Cloud charges, use the Google Cloud console to delete your processor and project if you don't need them.