- 0.64.0 (latest)
- 0.63.0
- 0.62.0
- 0.60.0
- 0.59.0
- 0.58.0
- 0.57.0
- 0.56.0
- 0.55.0
- 0.54.0
- 0.53.0
- 0.52.0
- 0.51.0
- 0.50.0
- 0.48.0
- 0.47.0
- 0.46.0
- 0.45.0
- 0.44.0
- 0.43.0
- 0.42.0
- 0.41.0
- 0.40.0
- 0.39.0
- 0.38.0
- 0.35.0
- 0.34.0
- 0.33.0
- 0.32.0
- 0.31.0
- 0.30.0
- 0.29.0
- 0.28.0
- 0.27.0
- 0.26.0
- 0.25.0
- 0.24.0
- 0.23.0
- 0.22.0
- 0.20.0
- 0.19.0
- 0.18.0
- 0.17.0
- 0.16.0
- 0.15.0
- 0.14.7
- 0.13.1
- 0.12.1
- 0.11.5
public interface InputAudioConfigOrBuilder extends MessageOrBuilder
Implements
MessageOrBuilderMethods
getAudioEncoding()
public abstract AudioEncoding getAudioEncoding()
Required. Audio encoding of the audio content to process.
.google.cloud.dialogflow.cx.v3.AudioEncoding audio_encoding = 1 [(.google.api.field_behavior) = REQUIRED];
Returns | |
---|---|
Type | Description |
AudioEncoding |
The audioEncoding. |
getAudioEncodingValue()
public abstract int getAudioEncodingValue()
Required. Audio encoding of the audio content to process.
.google.cloud.dialogflow.cx.v3.AudioEncoding audio_encoding = 1 [(.google.api.field_behavior) = REQUIRED];
Returns | |
---|---|
Type | Description |
int |
The enum numeric value on the wire for audioEncoding. |
getEnableWordInfo()
public abstract boolean getEnableWordInfo()
Optional. If true
, Dialogflow returns
SpeechWordInfo in
StreamingRecognitionResult
with information about the recognized speech words, e.g. start and end time
offsets. If false or unspecified, Speech doesn't return any word-level
information.
bool enable_word_info = 13;
Returns | |
---|---|
Type | Description |
boolean |
The enableWordInfo. |
getModel()
public abstract String getModel()
Optional. Which Speech model to select for the given request. Select the model best suited to your domain to get best results. If a model is not explicitly specified, then we auto-select a model based on the parameters in the InputAudioConfig. If enhanced speech model is enabled for the agent and an enhanced version of the specified model for the language does not exist, then the speech is recognized using the standard version of the specified model. Refer to Cloud Speech API documentation for more details. If you specify a model, the following models typically have the best performance:
- phone_call (best for Agent Assist and telephony)
- latest_short (best for Dialogflow non-telephony)
- command_and_search (best for very short utterances and commands)
string model = 7;
Returns | |
---|---|
Type | Description |
String |
The model. |
getModelBytes()
public abstract ByteString getModelBytes()
Optional. Which Speech model to select for the given request. Select the model best suited to your domain to get best results. If a model is not explicitly specified, then we auto-select a model based on the parameters in the InputAudioConfig. If enhanced speech model is enabled for the agent and an enhanced version of the specified model for the language does not exist, then the speech is recognized using the standard version of the specified model. Refer to Cloud Speech API documentation for more details. If you specify a model, the following models typically have the best performance:
- phone_call (best for Agent Assist and telephony)
- latest_short (best for Dialogflow non-telephony)
- command_and_search (best for very short utterances and commands)
string model = 7;
Returns | |
---|---|
Type | Description |
ByteString |
The bytes for model. |
getModelVariant()
public abstract SpeechModelVariant getModelVariant()
Optional. Which variant of the Speech model to use.
.google.cloud.dialogflow.cx.v3.SpeechModelVariant model_variant = 10;
Returns | |
---|---|
Type | Description |
SpeechModelVariant |
The modelVariant. |
getModelVariantValue()
public abstract int getModelVariantValue()
Optional. Which variant of the Speech model to use.
.google.cloud.dialogflow.cx.v3.SpeechModelVariant model_variant = 10;
Returns | |
---|---|
Type | Description |
int |
The enum numeric value on the wire for modelVariant. |
getPhraseHints(int index)
public abstract String getPhraseHints(int index)
Optional. A list of strings containing words and phrases that the speech recognizer should recognize with higher likelihood. See the Cloud Speech documentation for more details.
repeated string phrase_hints = 4;
Parameter | |
---|---|
Name | Description |
index |
int The index of the element to return. |
Returns | |
---|---|
Type | Description |
String |
The phraseHints at the given index. |
getPhraseHintsBytes(int index)
public abstract ByteString getPhraseHintsBytes(int index)
Optional. A list of strings containing words and phrases that the speech recognizer should recognize with higher likelihood. See the Cloud Speech documentation for more details.
repeated string phrase_hints = 4;
Parameter | |
---|---|
Name | Description |
index |
int The index of the value to return. |
Returns | |
---|---|
Type | Description |
ByteString |
The bytes of the phraseHints at the given index. |
getPhraseHintsCount()
public abstract int getPhraseHintsCount()
Optional. A list of strings containing words and phrases that the speech recognizer should recognize with higher likelihood. See the Cloud Speech documentation for more details.
repeated string phrase_hints = 4;
Returns | |
---|---|
Type | Description |
int |
The count of phraseHints. |
getPhraseHintsList()
public abstract List<String> getPhraseHintsList()
Optional. A list of strings containing words and phrases that the speech recognizer should recognize with higher likelihood. See the Cloud Speech documentation for more details.
repeated string phrase_hints = 4;
Returns | |
---|---|
Type | Description |
List<String> |
A list containing the phraseHints. |
getSampleRateHertz()
public abstract int getSampleRateHertz()
Sample rate (in Hertz) of the audio content sent in the query. Refer to Cloud Speech API documentation for more details.
int32 sample_rate_hertz = 2;
Returns | |
---|---|
Type | Description |
int |
The sampleRateHertz. |
getSingleUtterance()
public abstract boolean getSingleUtterance()
Optional. If false
(default), recognition does not cease until the
client closes the stream.
If true
, the recognizer will detect a single spoken utterance in input
audio. Recognition ceases when it detects the audio's voice has
stopped or paused. In this case, once a detected intent is received, the
client should close the stream and start a new request with a new stream as
needed.
Note: This setting is relevant only for streaming methods.
bool single_utterance = 8;
Returns | |
---|---|
Type | Description |
boolean |
The singleUtterance. |