Class Document

public sealed class Document : IMessage<Document>, IEquatable<Document>, IDeepCloneable<Document>, IBufferMessage, IMessage

Document represents the canonical document resource in Document Understanding AI. It is an interchange format that provides insights into documents and allows for collaboration between users and Document Understanding AI to iterate and optimize for quality.

Inheritance

Object > Document

Namespace

Google.Cloud.DocumentAI.V1Beta2

Assembly

Google.Cloud.DocumentAI.V1Beta2.dll

Constructors

Document()

public Document()

Document(Document)

public Document(Document other)
Parameter
TypeNameDescription
Documentother

Properties

Content

public ByteString Content { get; set; }

Inline document content, represented as a stream of bytes. Note: As with all bytes fields, protobuffers use a pure binary representation, whereas JSON representations use base64.

Property Value
TypeDescription
ByteString

Entities

public RepeatedField<Document.Types.Entity> Entities { get; }

A list of entities detected on [Document.text][google.cloud.documentai.v1beta2.Document.text]. For document shards, entities in this list may cross shard boundaries.

Property Value
TypeDescription
RepeatedField<Document.Types.Entity>

EntityRelations

public RepeatedField<Document.Types.EntityRelation> EntityRelations { get; }

Relationship among [Document.entities][google.cloud.documentai.v1beta2.Document.entities].

Property Value
TypeDescription
RepeatedField<Document.Types.EntityRelation>

Error

public Status Error { get; set; }

Any error that occurred while processing this document.

Property Value
TypeDescription
Status

Labels

public RepeatedField<Document.Types.Label> Labels { get; }

[Label][google.cloud.documentai.v1beta2.Document.Label]s for this document.

Property Value
TypeDescription
RepeatedField<Document.Types.Label>

MimeType

public string MimeType { get; set; }

An IANA published MIME type (also referred to as media type). For more information, see https://www.iana.org/assignments/media-types/media-types.xhtml.

Property Value
TypeDescription
String

Pages

public RepeatedField<Document.Types.Page> Pages { get; }

Visual page layout for the [Document][google.cloud.documentai.v1beta2.Document].

Property Value
TypeDescription
RepeatedField<Document.Types.Page>

ShardInfo

public Document.Types.ShardInfo ShardInfo { get; set; }

Information about the sharding if this document is sharded part of a larger document. If the document is not sharded, this message is not specified.

Property Value
TypeDescription
Document.Types.ShardInfo

SourceCase

public Document.SourceOneofCase SourceCase { get; }
Property Value
TypeDescription
Document.SourceOneofCase

Text

public string Text { get; set; }

UTF-8 encoded text in reading order from the document.

Property Value
TypeDescription
String

TextStyles

public RepeatedField<Document.Types.Style> TextStyles { get; }

Styles for the [Document.text][google.cloud.documentai.v1beta2.Document.text].

Property Value
TypeDescription
RepeatedField<Document.Types.Style>

Uri

public string Uri { get; set; }

Currently supports Google Cloud Storage URI of the form gs://bucket_name/object_name. Object versioning is not supported. See Google Cloud Storage Request URIs for more info.

Property Value
TypeDescription
String

Implements