public sealed class GcsSource : IMessage<GcsSource>, IEquatable<GcsSource>, IDeepCloneable<GcsSource>, IBufferMessage, IMessage
Reference documentation and code samples for the Discovery Engine v1 API class GcsSource.
Cloud Storage location for input content.
Implements
IMessageGcsSource, IEquatableGcsSource, IDeepCloneableGcsSource, IBufferMessage, IMessageNamespace
Google.Cloud.DiscoveryEngine.V1Assembly
Google.Cloud.DiscoveryEngine.V1.dll
Constructors
GcsSource()
public GcsSource()
GcsSource(GcsSource)
public GcsSource(GcsSource other)
Parameter | |
---|---|
Name | Description |
other | GcsSource |
Properties
DataSchema
public string DataSchema { get; set; }
The schema to use when parsing the data from the source.
Supported values for document imports:
document
(default): One JSON [Document][google.cloud.discoveryengine.v1.Document] per line. Each document must have a valid [Document.id][google.cloud.discoveryengine.v1.Document.id].content
: Unstructured data (e.g. PDF, HTML). Each file matched byinput_uris
becomes a document, with the ID set to the first 128 bits of SHA256(URI) encoded as a hex string.custom
: One custom data JSON per row in arbitrary format that conforms to the defined [Schema][google.cloud.discoveryengine.v1.Schema] of the data store. This can only be used by Gen App Builder.csv
: A CSV file with header conforming to the defined [Schema][google.cloud.discoveryengine.v1.Schema] of the data store. Each entry after the header is imported as a Document. This can only be used by Gen App Builder.
Supported values for user even imports:
user_event
(default): One JSON [UserEvent][google.cloud.discoveryengine.v1.UserEvent] per line.
Property Value | |
---|---|
Type | Description |
string |
InputUris
public RepeatedField<string> InputUris { get; }
Required. Cloud Storage URIs to input files. URI can be up to
2000 characters long. URIs can match the full object path (for example,
gs://bucket/directory/object.json
) or a pattern matching one or more
files, such as gs://bucket/directory/*.json
.
A request can contain at most 100 files (or 100,000 files if data_schema
is content
). Each file can be up to 2 GB (or 100 MB if data_schema
is
content
).
Property Value | |
---|---|
Type | Description |
RepeatedFieldstring |