Dialogflow voice agents use Speech-to-Text for speech recognition, which is included in Dialogflow pricing. Dialogflow automatically selects a speech recognition model for you, but you can optionally specify the model.
Available models
All available models are listed at Speech-to-Text models. Select a model that is best suited to your domain and supports your agent language and speech features.
If a model is not explicitly specified, then Dialogflow auto-selects a model based on the audio configuration in API requests and agent settings.
The following models typically have the best performance:
- telephony_short (best for telephony Dialogflow)
- telephony (best for Agent Assist) (also good for telephony Dialogflow when advanced timeout-based end of speech sensitivity is enabled)
- phone_call (good for Agent Assist and telephony Dialogflow)
- latest_short (best for non-telephony Dialogflow)
- command_and_search (best for languages where other models are not available)
Specify a model
You can supply the model for an agent, flow, or page with the model selection setting.You can also supply the model when calling the
Sessions.detectIntent
or Sessions.streamingDetectIntent
methods;
Select a protocol and version for the Session reference:
Protocol | V3 | V3beta1 |
---|---|---|
REST | Session resource | Session resource |
RPC | Session interface | Session interface |
C++ | SessionsClient | Not available |
C# | SessionsClient | Not available |
Go | SessionsClient | Not available |
Java | SessionsClient | SessionsClient |
Node.js | SessionsClient | SessionsClient |
PHP | Not available | Not available |
Python | SessionsClient | SessionsClient |
Ruby | Not available | Not available |
ConversationProfile
for
Agent Assist.
Specifying the model in a detect intent or conversation profile API call will
override any model selections applied to the agent, flow, or page, unless you
enable the Override request-level speech
model setting.