Discovery Engine v1 API - Class DocumentProcessingConfig (1.5.0)

public sealed class DocumentProcessingConfig : IMessage<DocumentProcessingConfig>, IEquatable<DocumentProcessingConfig>, IDeepCloneable<DocumentProcessingConfig>, IBufferMessage, IMessage

Reference documentation and code samples for the Discovery Engine v1 API class DocumentProcessingConfig.

A singleton resource of [DataStore][google.cloud.discoveryengine.v1.DataStore]. If it's empty when [DataStore][google.cloud.discoveryengine.v1.DataStore] is created and [DataStore][google.cloud.discoveryengine.v1.DataStore] is set to [DataStore.ContentConfig.CONTENT_REQUIRED][google.cloud.discoveryengine.v1.DataStore.ContentConfig.CONTENT_REQUIRED], the default parser will default to digital parser.

Inheritance

object > DocumentProcessingConfig

Namespace

Google.Cloud.DiscoveryEngine.V1

Assembly

Google.Cloud.DiscoveryEngine.V1.dll

Constructors

DocumentProcessingConfig()

public DocumentProcessingConfig()

DocumentProcessingConfig(DocumentProcessingConfig)

public DocumentProcessingConfig(DocumentProcessingConfig other)
Parameter
Name Description
other DocumentProcessingConfig

Properties

ChunkingConfig

public DocumentProcessingConfig.Types.ChunkingConfig ChunkingConfig { get; set; }

Whether chunking mode is enabled.

Property Value
Type Description
DocumentProcessingConfigTypesChunkingConfig

DefaultParsingConfig

public DocumentProcessingConfig.Types.ParsingConfig DefaultParsingConfig { get; set; }

Configurations for default Document parser. If not specified, we will configure it as default DigitalParsingConfig, and the default parsing config will be applied to all file types for Document parsing.

Property Value
Type Description
DocumentProcessingConfigTypesParsingConfig

DocumentProcessingConfigName

public DocumentProcessingConfigName DocumentProcessingConfigName { get; set; }

DocumentProcessingConfigName-typed view over the Name resource name property.

Property Value
Type Description
DocumentProcessingConfigName

Name

public string Name { get; set; }

The full resource name of the Document Processing Config. Format: projects/*/locations/*/collections/*/dataStores/*/documentProcessingConfig.

Property Value
Type Description
string

ParsingConfigOverrides

public MapField<string, DocumentProcessingConfig.Types.ParsingConfig> ParsingConfigOverrides { get; }

Map from file type to override the default parsing configuration based on the file type. Supported keys:

  • pdf: Override parsing config for PDF files, either digital parsing, ocr parsing or layout parsing is supported.
  • html: Override parsing config for HTML files, only digital parsing and layout parsing are supported.
  • docx: Override parsing config for DOCX files, only digital parsing and layout parsing are supported.
  • pptx: Override parsing config for PPTX files, only digital parsing and layout parsing are supported.
  • xlsm: Override parsing config for XLSM files, only digital parsing and layout parsing are supported.
  • xlsx: Override parsing config for XLSX files, only digital parsing and layout parsing are supported.
Property Value
Type Description
MapFieldstringDocumentProcessingConfigTypesParsingConfig