Discovery Engine v1beta API - Class GcsSource (1.0.0-beta13)

public sealed class GcsSource : IMessage<GcsSource>, IEquatable<GcsSource>, IDeepCloneable<GcsSource>, IBufferMessage, IMessage

Reference documentation and code samples for the Discovery Engine v1beta API class GcsSource.

Cloud Storage location for input content.

Inheritance

object > GcsSource

Namespace

Google.Cloud.DiscoveryEngine.V1Beta

Assembly

Google.Cloud.DiscoveryEngine.V1Beta.dll

Constructors

GcsSource()

public GcsSource()

GcsSource(GcsSource)

public GcsSource(GcsSource other)
Parameter
Name Description
other GcsSource

Properties

DataSchema

public string DataSchema { get; set; }

The schema to use when parsing the data from the source.

Supported values for document imports:

  • document (default): One JSON [Document][google.cloud.discoveryengine.v1beta.Document] per line. Each document must have a valid [Document.id][google.cloud.discoveryengine.v1beta.Document.id].
  • content: Unstructured data (e.g. PDF, HTML). Each file matched by input_uris becomes a document, with the ID set to the first 128 bits of SHA256(URI) encoded as a hex string.
  • custom: One custom data JSON per row in arbitrary format that conforms to the defined [Schema][google.cloud.discoveryengine.v1beta.Schema] of the data store. This can only be used by the GENERIC Data Store vertical.
  • csv: A CSV file with header conforming to the defined [Schema][google.cloud.discoveryengine.v1beta.Schema] of the data store. Each entry after the header is imported as a Document. This can only be used by the GENERIC Data Store vertical.

Supported values for user even imports:

  • user_event (default): One JSON [UserEvent][google.cloud.discoveryengine.v1beta.UserEvent] per line.
Property Value
Type Description
string

InputUris

public RepeatedField<string> InputUris { get; }

Required. Cloud Storage URIs to input files. URI can be up to 2000 characters long. URIs can match the full object path (for example, gs://bucket/directory/object.json) or a pattern matching one or more files, such as gs://bucket/directory/*.json.

A request can contain at most 100 files (or 100,000 files if data_schema is content). Each file can be up to 2 GB (or 100 MB if data_schema is content).

Property Value
Type Description
RepeatedFieldstring