TextAnnotation(mapping=None, *, ignore_unknown_fields=False, **kwargs)
TextAnnotation contains a structured representation of OCR extracted text. The hierarchy of an OCR extracted text structure is like this: TextAnnotation -> Page -> Block -> Paragraph -> Word -> Symbol Each structural component, starting from Page, may further have their own properties. Properties describe detected languages, breaks etc.. Please refer to the TextAnnotation.TextProperty message definition below for more detail.
Attributes
Name | Description |
pages |
Sequence[google.cloud.vision_v1.types.Page]
List of pages detected by OCR. |
text |
str
UTF-8 text detected on the pages. |
Inheritance
builtins.object > proto.message.Message > TextAnnotationClasses
DetectedBreak
DetectedBreak(mapping=None, *, ignore_unknown_fields=False, **kwargs)
Detected start or end of a structural component.
DetectedLanguage
DetectedLanguage(mapping=None, *, ignore_unknown_fields=False, **kwargs)
Detected language for a structural component.
TextProperty
TextProperty(mapping=None, *, ignore_unknown_fields=False, **kwargs)
Additional information detected on the structural component.