Toolbox - Convert Document to hOCR
bookmark_borderbookmark
Stay organized with collections
Save and categorize content based on your preferences.
Convert Document
output from Document AI to an hOCR XML string.
Explore further
For detailed documentation that includes this code sample, see the following:
Code sample
Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Hard to understand","hardToUnderstand","thumb-down"],["Incorrect information or sample code","incorrectInformationOrSampleCode","thumb-down"],["Missing the information/samples I need","missingTheInformationSamplesINeed","thumb-down"],["Other","otherDown","thumb-down"]],[],[[["This code demonstrates how to convert a `Document` object from Document AI into an hOCR XML string."],["The process involves utilizing the `document` module from the `google.cloud.documentai_toolbox` library."],["Authentication to Document AI requires setting up Application Default Credentials, as detailed in the provided documentation link."],["The provided code sample can be found with further details in the Document AI Toolbox client libraries documentation, which can be explored for further information."],["The `Document` object, `wrapped_document`, can be exported into hOCR string format through the `export_hocr_str` function."]]],[]]