StreamingRecognitionResult

A streaming speech recognition result corresponding to a portion of the audio that is currently being processed.

JSON representation
{
  "alternatives": [
    {
      object (SpeechRecognitionAlternative)
    }
  ],
  "isFinal": boolean,
  "stability": number,
  "resultEndOffset": string,
  "channelTag": integer,
  "languageCode": string
}
Fields
alternatives[]

object (SpeechRecognitionAlternative)

May contain one or more recognition hypotheses. These alternatives are ordered in terms of accuracy, with the top (first) alternative being the most probable, as ranked by the recognizer.

isFinal

boolean

If false, this StreamingRecognitionResult represents an interim result that may change. If true, this is the final time the speech service will return this particular StreamingRecognitionResult, the recognizer will not return any further hypotheses for this portion of the transcript and corresponding audio.

stability

number

An estimate of the likelihood that the recognizer will not change its guess about this interim result. Values range from 0.0 (completely unstable) to 1.0 (completely stable). This field is only provided for interim results (isFinal=false). The default of 0.0 is a sentinel value indicating stability was not set.

resultEndOffset

string (Duration format)

Time offset of the end of this result relative to the beginning of the audio.

A duration in seconds with up to nine fractional digits, ending with 's'. Example: "3.5s".

channelTag

integer

For multi-channel audio, this is the channel number corresponding to the recognized result for the audio from that channel. For audioChannelCount = N, its output values can range from 1 to N.

languageCode

string

Output only. The BCP-47 language tag of the language in this result. This language code was detected to have the most likelihood of being spoken in the audio.