StreamingRecognitionResult

JSON representation

A streaming speech recognition result corresponding to a portion of the audio that is currently being processed.

JSON representation
{ "alternatives": [ { object (`SpeechRecognitionAlternative`) } ], "isFinal": boolean, "stability": number, "resultEndOffset": string, "channelTag": integer, "languageCode": string }

Fields
`alternatives[]`	`object (SpeechRecognitionAlternative)` May contain one or more recognition hypotheses. These alternatives are ordered in terms of accuracy, with the top (first) alternative being the most probable, as ranked by the recognizer.
`isFinal`	`boolean` If `false`, this `StreamingRecognitionResult` represents an interim result that may change. If `true`, this is the final time the speech service will return this particular `StreamingRecognitionResult`, the recognizer will not return any further hypotheses for this portion of the transcript and corresponding audio.
`stability`	`number` An estimate of the likelihood that the recognizer will not change its guess about this interim result. Values range from 0.0 (completely unstable) to 1.0 (completely stable). This field is only provided for interim results (`isFinal`=`false`). The default of 0.0 is a sentinel value indicating `stability` was not set.
`resultEndOffset`	`string (Duration format)` Time offset of the end of this result relative to the beginning of the audio. A duration in seconds with up to nine fractional digits, ending with '`s`'. Example: `"3.5s"`.
`channelTag`	`integer` For multi-channel audio, this is the channel number corresponding to the recognized result for the audio from that channel. For `audioChannelCount` = `N`, its output values can range from `1` to `N`.
`languageCode`	`string` Output only. The BCP-47 language tag of the language in this result. This language code was detected to have the most likelihood of being spoken in the audio.