DocumentProcessingConfig(mapping=None, *, ignore_unknown_fields=False, **kwargs)
A singleton resource of DataStore. If it's empty when DataStore is created and DataStore is set to DataStore.ContentConfig.CONTENT_REQUIRED, the default parser will default to digital parser.
Attributes |
|
---|---|
Name | Description |
name |
str
The full resource name of the Document Processing Config. Format: projects/*/locations/*/collections/*/dataStores/*/documentProcessingConfig .
|
chunking_config |
google.cloud.discoveryengine_v1beta.types.DocumentProcessingConfig.ChunkingConfig
Whether chunking mode is enabled. |
default_parsing_config |
google.cloud.discoveryengine_v1beta.types.DocumentProcessingConfig.ParsingConfig
Configurations for default Document parser. If not specified, we will configure it as default DigitalParsingConfig, and the default parsing config will be applied to all file types for Document parsing. |
parsing_config_overrides |
MutableMapping[str, google.cloud.discoveryengine_v1beta.types.DocumentProcessingConfig.ParsingConfig]
Map from file type to override the default parsing configuration based on the file type. Supported keys: - pdf : Override parsing config for PDF files, either
digital parsing, ocr parsing or layout parsing is
supported.
- html : Override parsing config for HTML files, only
digital parsing and layout parsing are supported.
- docx : Override parsing config for DOCX files, only
digital parsing and layout parsing are supported.
- pptx : Override parsing config for PPTX files, only
digital parsing and layout parsing are supported.
- xlsm : Override parsing config for XLSM files, only
digital parsing and layout parsing are supported.
- xlsx : Override parsing config for XLSX files, only
digital parsing and layout parsing are supported.
|
Classes
ChunkingConfig
ChunkingConfig(mapping=None, *, ignore_unknown_fields=False, **kwargs)
Configuration for chunking config.
.. _oneof: https://proto-plus-python.readthedocs.io/en/stable/fields.html#oneofs-mutually-exclusive-fields
ParsingConfig
ParsingConfig(mapping=None, *, ignore_unknown_fields=False, **kwargs)
Related configurations applied to a specific type of document parser.
This message has oneof
_ fields (mutually exclusive fields).
For each oneof, at most one member field can be set at the same time.
Setting any member of the oneof automatically clears all other
members.
.. _oneof: https://proto-plus-python.readthedocs.io/en/stable/fields.html#oneofs-mutually-exclusive-fields
ParsingConfigOverridesEntry
ParsingConfigOverridesEntry(mapping=None, *, ignore_unknown_fields=False, **kwargs)
The abstract base class for a message.
Parameters | |
---|---|
Name | Description |
kwargs |
dict
Keys and values corresponding to the fields of the message. |
mapping |
Union[dict,
A dictionary or message to be used to determine the values for this message. |
ignore_unknown_fields |
Optional(bool)
If True, do not raise errors for unknown fields. Only applied if |