도구 상자 - 문서를 hOCR로 변환
컬렉션을 사용해 정리하기
내 환경설정을 기준으로 콘텐츠를 저장하고 분류하세요.
Document AI의 Document
출력을 hOCR XML 문자열로 변환합니다.
더 살펴보기
이 코드 샘플이 포함된 자세한 문서는 다음을 참조하세요.
코드 샘플
달리 명시되지 않는 한 이 페이지의 콘텐츠에는 Creative Commons Attribution 4.0 라이선스에 따라 라이선스가 부여되며, 코드 샘플에는 Apache 2.0 라이선스에 따라 라이선스가 부여됩니다. 자세한 내용은 Google Developers 사이트 정책을 참조하세요. 자바는 Oracle 및/또는 Oracle 계열사의 등록 상표입니다.
[[["이해하기 쉬움","easyToUnderstand","thumb-up"],["문제가 해결됨","solvedMyProblem","thumb-up"],["기타","otherUp","thumb-up"]],[["이해하기 어려움","hardToUnderstand","thumb-down"],["잘못된 정보 또는 샘플 코드","incorrectInformationOrSampleCode","thumb-down"],["필요한 정보/샘플이 없음","missingTheInformationSamplesINeed","thumb-down"],["번역 문제","translationIssue","thumb-down"],["기타","otherDown","thumb-down"]],[],[[["\u003cp\u003eThis code demonstrates how to convert a \u003ccode\u003eDocument\u003c/code\u003e object from Document AI into an hOCR XML string.\u003c/p\u003e\n"],["\u003cp\u003eThe process involves utilizing the \u003ccode\u003edocument\u003c/code\u003e module from the \u003ccode\u003egoogle.cloud.documentai_toolbox\u003c/code\u003e library.\u003c/p\u003e\n"],["\u003cp\u003eAuthentication to Document AI requires setting up Application Default Credentials, as detailed in the provided documentation link.\u003c/p\u003e\n"],["\u003cp\u003eThe provided code sample can be found with further details in the Document AI Toolbox client libraries documentation, which can be explored for further information.\u003c/p\u003e\n"],["\u003cp\u003eThe \u003ccode\u003eDocument\u003c/code\u003e object, \u003ccode\u003ewrapped_document\u003c/code\u003e, can be exported into hOCR string format through the \u003ccode\u003eexport_hocr_str\u003c/code\u003e function.\u003c/p\u003e\n"]]],[],null,["# Toolbox - Convert Document to hOCR\n\nConvert `Document` output from Document AI to an hOCR XML string.\n\nExplore further\n---------------\n\n\nFor detailed documentation that includes this code sample, see the following:\n\n- [Document AI Toolbox client libraries](/document-ai/docs/toolbox)\n- [Handle processing response](/document-ai/docs/handle-response)\n\nCode sample\n-----------\n\n### Python\n\n\nFor more information, see the\n[Document AI Python API\nreference documentation](/python/docs/reference/documentai/latest).\n\n\nTo authenticate to Document AI, set up Application Default Credentials.\nFor more information, see\n\n[Set up authentication for a local development environment](/docs/authentication/set-up-adc-local-dev-environment).\n\n\n from google.cloud.documentai_toolbox import document\n\n # TODO(developer): Uncomment these variables before running the sample.\n # Given a document.proto or sharded document.proto in path gs://bucket/path/to/folder\n # document_path = \"path/to/local/document.json\"\n # document_title = \"your-document-title\"\n\n\n def convert_document_to_hocr_sample(document_path: str, document_title: str) -\u003e str:\n wrapped_document = document.Document.from_document_path(document_path=document_path)\n\n # Converting wrapped_document to hOCR format\n hocr_string = wrapped_document.export_hocr_str(title=document_title)\n\n print(\"Document converted to hOCR!\")\n return hocr_string\n\nWhat's next\n-----------\n\n\nTo search and filter code samples for other Google Cloud products, see the\n[Google Cloud sample browser](/docs/samples?product=documentai)."]]