Toolbox - Convert Document to hOCR
透過集合功能整理內容
你可以依據偏好儲存及分類內容。
將 Document AI 的 Document
輸出內容轉換為 hOCR XML 字串。
深入探索
如需包含這個程式碼範例的詳細說明文件,請參閱下列內容:
程式碼範例
除非另有註明,否則本頁面中的內容是採用創用 CC 姓名標示 4.0 授權,程式碼範例則為阿帕契 2.0 授權。詳情請參閱《Google Developers 網站政策》。Java 是 Oracle 和/或其關聯企業的註冊商標。
[[["容易理解","easyToUnderstand","thumb-up"],["確實解決了我的問題","solvedMyProblem","thumb-up"],["其他","otherUp","thumb-up"]],[["難以理解","hardToUnderstand","thumb-down"],["資訊或程式碼範例有誤","incorrectInformationOrSampleCode","thumb-down"],["缺少我需要的資訊/範例","missingTheInformationSamplesINeed","thumb-down"],["翻譯問題","translationIssue","thumb-down"],["其他","otherDown","thumb-down"]],[],[[["\u003cp\u003eThis code demonstrates how to convert a \u003ccode\u003eDocument\u003c/code\u003e object from Document AI into an hOCR XML string.\u003c/p\u003e\n"],["\u003cp\u003eThe process involves utilizing the \u003ccode\u003edocument\u003c/code\u003e module from the \u003ccode\u003egoogle.cloud.documentai_toolbox\u003c/code\u003e library.\u003c/p\u003e\n"],["\u003cp\u003eAuthentication to Document AI requires setting up Application Default Credentials, as detailed in the provided documentation link.\u003c/p\u003e\n"],["\u003cp\u003eThe provided code sample can be found with further details in the Document AI Toolbox client libraries documentation, which can be explored for further information.\u003c/p\u003e\n"],["\u003cp\u003eThe \u003ccode\u003eDocument\u003c/code\u003e object, \u003ccode\u003ewrapped_document\u003c/code\u003e, can be exported into hOCR string format through the \u003ccode\u003eexport_hocr_str\u003c/code\u003e function.\u003c/p\u003e\n"]]],[],null,["# Toolbox - Convert Document to hOCR\n\nConvert `Document` output from Document AI to an hOCR XML string.\n\nExplore further\n---------------\n\n\nFor detailed documentation that includes this code sample, see the following:\n\n- [Document AI Toolbox client libraries](/document-ai/docs/toolbox)\n- [Handle processing response](/document-ai/docs/handle-response)\n\nCode sample\n-----------\n\n### Python\n\n\nFor more information, see the\n[Document AI Python API\nreference documentation](/python/docs/reference/documentai/latest).\n\n\nTo authenticate to Document AI, set up Application Default Credentials.\nFor more information, see\n\n[Set up authentication for a local development environment](/docs/authentication/set-up-adc-local-dev-environment).\n\n\n from google.cloud.documentai_toolbox import document\n\n # TODO(developer): Uncomment these variables before running the sample.\n # Given a document.proto or sharded document.proto in path gs://bucket/path/to/folder\n # document_path = \"path/to/local/document.json\"\n # document_title = \"your-document-title\"\n\n\n def convert_document_to_hocr_sample(document_path: str, document_title: str) -\u003e str:\n wrapped_document = document.Document.from_document_path(document_path=document_path)\n\n # Converting wrapped_document to hOCR format\n hocr_string = wrapped_document.export_hocr_str(title=document_title)\n\n print(\"Document converted to hOCR!\")\n return hocr_string\n\nWhat's next\n-----------\n\n\nTo search and filter code samples for other Google Cloud products, see the\n[Google Cloud sample browser](/docs/samples?product=documentai)."]]