public sealed class InputAudioConfig : IMessage<InputAudioConfig>, IEquatable<InputAudioConfig>, IDeepCloneable<InputAudioConfig>, IBufferMessage, IMessage
Reference documentation and code samples for the Dialogflow v3 API class InputAudioConfig.
Instructs the speech recognizer on how to process the audio content.
Implements
IMessageInputAudioConfig, IEquatableInputAudioConfig, IDeepCloneableInputAudioConfig, IBufferMessage, IMessageNamespace
Google.Cloud.Dialogflow.Cx.V3Assembly
Google.Cloud.Dialogflow.Cx.V3.dll
Constructors
InputAudioConfig()
public InputAudioConfig()
InputAudioConfig(InputAudioConfig)
public InputAudioConfig(InputAudioConfig other)
Parameter | |
---|---|
Name | Description |
other | InputAudioConfig |
Properties
AudioEncoding
public AudioEncoding AudioEncoding { get; set; }
Required. Audio encoding of the audio content to process.
Property Value | |
---|---|
Type | Description |
AudioEncoding |
BargeInConfig
public BargeInConfig BargeInConfig { get; set; }
Configuration of barge-in behavior during the streaming of input audio.
Property Value | |
---|---|
Type | Description |
BargeInConfig |
EnableWordInfo
public bool EnableWordInfo { get; set; }
Optional. If true
, Dialogflow returns
[SpeechWordInfo][google.cloud.dialogflow.cx.v3.SpeechWordInfo] in
[StreamingRecognitionResult][google.cloud.dialogflow.cx.v3.StreamingRecognitionResult]
with information about the recognized speech words, e.g. start and end time
offsets. If false or unspecified, Speech doesn't return any word-level
information.
Property Value | |
---|---|
Type | Description |
bool |
Model
public string Model { get; set; }
Optional. Which Speech model to select for the given request. Select the model best suited to your domain to get best results. If a model is not explicitly specified, then Dialogflow auto-selects a model based on other parameters in the InputAudioConfig and Agent settings. If enhanced speech model is enabled for the agent and an enhanced version of the specified model for the language does not exist, then the speech is recognized using the standard version of the specified model. Refer to Cloud Speech API documentation for more details. If you specify a model, the following models typically have the best performance:
- phone_call (best for Agent Assist and telephony)
- latest_short (best for Dialogflow non-telephony)
- command_and_search
Leave this field unspecified to use Agent Speech settings for model selection.
Property Value | |
---|---|
Type | Description |
string |
ModelVariant
public SpeechModelVariant ModelVariant { get; set; }
Optional. Which variant of the [Speech model][google.cloud.dialogflow.cx.v3.InputAudioConfig.model] to use.
Property Value | |
---|---|
Type | Description |
SpeechModelVariant |
PhraseHints
public RepeatedField<string> PhraseHints { get; }
Optional. A list of strings containing words and phrases that the speech recognizer should recognize with higher likelihood.
See the Cloud Speech documentation for more details.
Property Value | |
---|---|
Type | Description |
RepeatedFieldstring |
SampleRateHertz
public int SampleRateHertz { get; set; }
Sample rate (in Hertz) of the audio content sent in the query. Refer to Cloud Speech API documentation for more details.
Property Value | |
---|---|
Type | Description |
int |
SingleUtterance
public bool SingleUtterance { get; set; }
Optional. If false
(default), recognition does not cease until the
client closes the stream.
If true
, the recognizer will detect a single spoken utterance in input
audio. Recognition ceases when it detects the audio's voice has
stopped or paused. In this case, once a detected intent is received, the
client should close the stream and start a new request with a new stream as
needed.
Note: This setting is relevant only for streaming methods.
Property Value | |
---|---|
Type | Description |
bool |