Class Page (0.12.2a0)

Page(
    documentai_object: google.cloud.documentai_v1.types.document.Document.Page,
    _document_text: str,
)

Represents a wrapped documentai.Document.Page .

Attributes

NameDescription
documentai_object :noindex: google.cloud.documentai.Document.Page
Required. The original object.
text :noindex: str
Required. UTF-8 encoded text of the page.
page_number :noindex: int
Required. The page number of the Page.
hocr_bounding_box :noindex: str
Required. hOCR bounding box of the page element.
symbols :noindex: List[Symbol]
Optional. A list of visually detected text symbols (characters/letters) on the page.
tokens :noindex: List[Token]
Required. A list of visually detected text tokens (words) on the page.
lines :noindex: List[Line]
Required. A list of visually detected text lines on the page. A collection of tokens that a human would perceive as a line.
paragraphs :noindex: List[Paragraph]
Required. A list of visually detected text paragraphs on the page. A collection of lines that a human would perceive as a paragraph.
blocks :noindex: List[Block]
Required. A list of visually detected text blocks on the page. A collection of lines that a human would perceive as a block.
form_fields :noindex: List[FormField]
Optional. A list of visually detected form fields on the page.
tables :noindex: List[Table]
Optional. A list of visually detected tables on the page.
math_formulas :noindex: List[MathFormula]
Optional. A list of visually detected math formulas on the page.

Methods

_get_elements

_get_elements(element_type: typing.Type, attribute_name: str) -> typing.List

Helper method to create elements based on specified type.