Enum EncodingType (2.2.0)

public enum EncodingType

Represents the text encoding that the caller uses to process the output. Providing an EncodingType is recommended because the API provides the beginning offsets for various outputs, such as tokens and mentions, and languages that natively use different text encodings may access offsets differently.

Namespace

Google.Cloud.Language.V1

Assembly

Google.Cloud.Language.V1.dll

Fields

NameDescription
None

If EncodingType is not specified, encoding-dependent information (such as begin_offset) will be set at -1.

Utf16

Encoding-dependent information (such as begin_offset) is calculated based on the UTF-16 encoding of the input. Java and JavaScript are examples of languages that use this encoding natively.

Utf32

Encoding-dependent information (such as begin_offset) is calculated based on the UTF-32 encoding of the input. Python is an example of a language that uses this encoding natively.

Utf8

Encoding-dependent information (such as begin_offset) is calculated based on the UTF-8 encoding of the input. C++ and Go are examples of languages that use this encoding natively.