Discovery Engine v1beta API - Class DocumentProcessingConfig (1.0.0-beta11)

public sealed class DocumentProcessingConfig : IMessage<DocumentProcessingConfig>, IEquatable<DocumentProcessingConfig>, IDeepCloneable<DocumentProcessingConfig>, IBufferMessage, IMessage

Reference documentation and code samples for the Discovery Engine v1beta API class DocumentProcessingConfig.

A singleton resource of [DataStore][google.cloud.discoveryengine.v1beta.DataStore]. It's empty when [DataStore][google.cloud.discoveryengine.v1beta.DataStore] is created, which defaults to digital parser. The first call to [DataStoreService.UpdateDocumentProcessingConfig][] method will initialize the config.

Inheritance

object > DocumentProcessingConfig

Namespace

Google.Cloud.DiscoveryEngine.V1Beta

Assembly

Google.Cloud.DiscoveryEngine.V1Beta.dll

Constructors

DocumentProcessingConfig()

public DocumentProcessingConfig()

DocumentProcessingConfig(DocumentProcessingConfig)

public DocumentProcessingConfig(DocumentProcessingConfig other)
Parameter
NameDescription
otherDocumentProcessingConfig

Properties

DefaultParsingConfig

public DocumentProcessingConfig.Types.ParsingConfig DefaultParsingConfig { get; set; }

Configurations for default Document parser. If not specified, we will configure it as default DigitalParsingConfig, and the default parsing config will be applied to all file types for Document parsing.

Property Value
TypeDescription
DocumentProcessingConfigTypesParsingConfig

DocumentProcessingConfigName

public DocumentProcessingConfigName DocumentProcessingConfigName { get; set; }

DocumentProcessingConfigName-typed view over the Name resource name property.

Property Value
TypeDescription
DocumentProcessingConfigName

Name

public string Name { get; set; }

The full resource name of the Document Processing Config. Format: projects/*/locations/*/collections/*/dataStores/*/documentProcessingConfig.

Property Value
TypeDescription
string

ParsingConfigOverrides

public MapField<string, DocumentProcessingConfig.Types.ParsingConfig> ParsingConfigOverrides { get; }

Map from file type to override the default parsing configuration based on the file type. Supported keys:

  • pdf: Override parsing config for PDF files, either digital parsing, ocr parsing or layout parsing is supported.
  • html: Override parsing config for HTML files, only digital parsing and or layout parsing are supported.
  • docx: Override parsing config for DOCX files, only digital parsing and or layout parsing are supported.
Property Value
TypeDescription
MapFieldstringDocumentProcessingConfigTypesParsingConfig