Interface WordInfoOrBuilder (4.46.0)

public interface WordInfoOrBuilder extends MessageOrBuilder

Implements

MessageOrBuilder

Methods

getConfidence()

public abstract float getConfidence()

The confidence estimate between 0.0 and 1.0. A higher number indicates an estimated greater likelihood that the recognized words are correct. This field is set only for the top alternative of a non-streaming result or, of a streaming result where is_final=true. This field is not guaranteed to be accurate and users should not rely on it to be always provided. The default of 0.0 is a sentinel value indicating confidence was not set.

float confidence = 4;

Returns
Type Description
float

The confidence.

getEndTime()

public abstract Duration getEndTime()

Time offset relative to the beginning of the audio, and corresponding to the end of the spoken word. This field is only set if enable_word_time_offsets=true and only in the top hypothesis. This is an experimental feature and the accuracy of the time offset can vary.

.google.protobuf.Duration end_time = 2;

Returns
Type Description
Duration

The endTime.

getEndTimeOrBuilder()

public abstract DurationOrBuilder getEndTimeOrBuilder()

Time offset relative to the beginning of the audio, and corresponding to the end of the spoken word. This field is only set if enable_word_time_offsets=true and only in the top hypothesis. This is an experimental feature and the accuracy of the time offset can vary.

.google.protobuf.Duration end_time = 2;

Returns
Type Description
DurationOrBuilder

getSpeakerLabel()

public abstract String getSpeakerLabel()

Output only. A label value assigned for every unique speaker within the audio. This field specifies which speaker was detected to have spoken this word. For some models, like medical_conversation this can be actual speaker role, for example "patient" or "provider", but generally this would be a number identifying a speaker. This field is only set if enable_speaker_diarization = 'true' and only for the top alternative.

string speaker_label = 6 [(.google.api.field_behavior) = OUTPUT_ONLY];

Returns
Type Description
String

The speakerLabel.

getSpeakerLabelBytes()

public abstract ByteString getSpeakerLabelBytes()

Output only. A label value assigned for every unique speaker within the audio. This field specifies which speaker was detected to have spoken this word. For some models, like medical_conversation this can be actual speaker role, for example "patient" or "provider", but generally this would be a number identifying a speaker. This field is only set if enable_speaker_diarization = 'true' and only for the top alternative.

string speaker_label = 6 [(.google.api.field_behavior) = OUTPUT_ONLY];

Returns
Type Description
ByteString

The bytes for speakerLabel.

getSpeakerTag() (deprecated)

public abstract int getSpeakerTag()

Deprecated. google.cloud.speech.v1.WordInfo.speaker_tag is deprecated. See google/cloud/speech/v1/cloud_speech.proto;l=974

Output only. A distinct integer value is assigned for every speaker within the audio. This field specifies which one of those speakers was detected to have spoken this word. Value ranges from '1' to diarization_speaker_count. speaker_tag is set if enable_speaker_diarization = 'true' and only for the top alternative. Note: Use speaker_label instead.

int32 speaker_tag = 5 [deprecated = true, (.google.api.field_behavior) = OUTPUT_ONLY];

Returns
Type Description
int

The speakerTag.

getStartTime()

public abstract Duration getStartTime()

Time offset relative to the beginning of the audio, and corresponding to the start of the spoken word. This field is only set if enable_word_time_offsets=true and only in the top hypothesis. This is an experimental feature and the accuracy of the time offset can vary.

.google.protobuf.Duration start_time = 1;

Returns
Type Description
Duration

The startTime.

getStartTimeOrBuilder()

public abstract DurationOrBuilder getStartTimeOrBuilder()

Time offset relative to the beginning of the audio, and corresponding to the start of the spoken word. This field is only set if enable_word_time_offsets=true and only in the top hypothesis. This is an experimental feature and the accuracy of the time offset can vary.

.google.protobuf.Duration start_time = 1;

Returns
Type Description
DurationOrBuilder

getWord()

public abstract String getWord()

The word corresponding to this set of information.

string word = 3;

Returns
Type Description
String

The word.

getWordBytes()

public abstract ByteString getWordBytes()

The word corresponding to this set of information.

string word = 3;

Returns
Type Description
ByteString

The bytes for word.

hasEndTime()

public abstract boolean hasEndTime()

Time offset relative to the beginning of the audio, and corresponding to the end of the spoken word. This field is only set if enable_word_time_offsets=true and only in the top hypothesis. This is an experimental feature and the accuracy of the time offset can vary.

.google.protobuf.Duration end_time = 2;

Returns
Type Description
boolean

Whether the endTime field is set.

hasStartTime()

public abstract boolean hasStartTime()

Time offset relative to the beginning of the audio, and corresponding to the start of the spoken word. This field is only set if enable_word_time_offsets=true and only in the top hypothesis. This is an experimental feature and the accuracy of the time offset can vary.

.google.protobuf.Duration start_time = 1;

Returns
Type Description
boolean

Whether the startTime field is set.