Options for Process API
JSON representation |
---|
{
"ocrConfig": {
object ( |
Fields | |
---|---|
ocrConfig |
Only applicable to "Document OCR Processor". Returns error if set on other processor types. |
OcrConfig
Config for Document OCR.
JSON representation |
---|
{
"hints": {
object ( |
Fields | |
---|---|
hints |
Hints for the OCR model. |
enableNativePdfParsing |
Enables special handling for PDFs with existing text information. Results in better text extraction quality in such PDF inputs. |
enableImageQualityScores |
Enables intelligent document quality scores after OCR. Can help with diagnosing why OCR responses are of poor quality for a given input. Adds additional latency comparable to regular OCR to the process call. |
advancedOcrOptions[] |
A list of advanced OCR options to further fine-tune OCR behavior. Current valid values are:
|
enableSymbol |
Includes symbol level OCR information if set to true. |
Hints
Hints for OCR Engine
JSON representation |
---|
{ "languageHints": [ string ] } |
Fields | |
---|---|
languageHints[] |
List of BCP-47 language codes to use for OCR. In most cases, not specifying it yields the best results since it enables automatic language detection. For languages based on the Latin alphabet, setting hints is not needed. In rare cases, when the language of the text in the image is known, setting a hint will help get better results (although it will be a significant hindrance if the hint is wrong). |