- 1.37.0 (latest)
- 1.36.0
- 1.35.0
- 1.34.1
- 1.33.0
- 1.32.1
- 1.31.1
- 1.30.1
- 1.29.0
- 1.28.0
- 1.27.0
- 1.26.0
- 1.25.0
- 1.24.0
- 1.23.1
- 1.22.0
- 1.21.0
- 1.20.1
- 1.19.0
- 1.18.0
- 1.17.1
- 1.16.0
- 1.15.0
- 1.14.1
- 1.13.5
- 1.12.1
- 1.11.0
- 1.10.0
- 1.9.1
- 1.8.0
- 1.7.0
- 1.6.0
- 1.5.0
- 1.4.0
- 1.3.2
- 1.2.0
- 1.1.1
- 1.0.0
- 0.8.2
- 0.7.1
- 0.6.0
- 0.5.0
- 0.4.1
- 0.3.0
- 0.2.0
- 0.1.1
InputAudioConfig(mapping=None, *, ignore_unknown_fields=False, **kwargs)
Instructs the speech recognizer on how to process the audio content.
Attributes:
audio_encoding (google.cloud.dialogflowcx_v3beta1.types.AudioEncoding):
Required. Audio encoding of the audio content
to process.
sample_rate_hertz (int):
Sample rate (in Hertz) of the audio content sent in the
query. Refer to Cloud Speech API
documentation <https://cloud.google.com/speech-to-text/docs/basics>
__
for more details.
enable_word_info (bool):
Optional. If true
, Dialogflow returns
[SpeechWordInfo][google.cloud.dialogflow.cx.v3beta1.SpeechWordInfo]
in
[StreamingRecognitionResult][google.cloud.dialogflow.cx.v3beta1.StreamingRecognitionResult]
with information about the recognized speech words, e.g.
start and end time offsets. If false or unspecified, Speech
doesn't return any word-level information.
phrase_hints (Sequence[str]):
Optional. A list of strings containing words and phrases
that the speech recognizer should recognize with higher
likelihood.
See `the Cloud Speech
documentation <https://cloud.google.com/speech-to-text/docs/basics#phrase-hints>`__
for more details.
model (str):
Optional. Which Speech model to select for the given
request. Select the model best suited to your domain to get
best results. If a model is not explicitly specified, then
we auto-select a model based on the parameters in the
InputAudioConfig. If enhanced speech model is enabled for
the agent and an enhanced version of the specified model for
the language does not exist, then the speech is recognized
using the standard version of the specified model. Refer to
`Cloud Speech API
documentation <https://cloud.google.com/speech-to-text/docs/basics#select-model>`__
for more details.
model_variant (google.cloud.dialogflowcx_v3beta1.types.SpeechModelVariant):
Optional. Which variant of the [Speech
model][google.cloud.dialogflow.cx.v3beta1.InputAudioConfig.model]
to use.
single_utterance (bool):
Optional. If ``false`` (default), recognition does not cease
until the client closes the stream. If ``true``, the
recognizer will detect a single spoken utterance in input
audio. Recognition ceases when it detects the audio's voice
has stopped or paused. In this case, once a detected intent
is received, the client should close the stream and start a
new request with a new stream as needed. Note: This setting
is relevant only for streaming methods.