public final class InputAudioConfig extends GeneratedMessageV3 implements InputAudioConfigOrBuilder
Instructs the speech recognizer on how to process the audio content.
Protobuf type google.cloud.dialogflow.cx.v3.InputAudioConfig
Static Fields
public static final int AUDIO_ENCODING_FIELD_NUMBER
Field Value |
Type |
Description |
int |
|
public static final int ENABLE_WORD_INFO_FIELD_NUMBER
Field Value |
Type |
Description |
int |
|
public static final int MODEL_FIELD_NUMBER
Field Value |
Type |
Description |
int |
|
public static final int MODEL_VARIANT_FIELD_NUMBER
Field Value |
Type |
Description |
int |
|
public static final int PHRASE_HINTS_FIELD_NUMBER
Field Value |
Type |
Description |
int |
|
public static final int SAMPLE_RATE_HERTZ_FIELD_NUMBER
Field Value |
Type |
Description |
int |
|
public static final int SINGLE_UTTERANCE_FIELD_NUMBER
Field Value |
Type |
Description |
int |
|
Static Methods
public static InputAudioConfig getDefaultInstance()
public static final Descriptors.Descriptor getDescriptor()
public static InputAudioConfig.Builder newBuilder()
public static InputAudioConfig.Builder newBuilder(InputAudioConfig prototype)
public static InputAudioConfig parseDelimitedFrom(InputStream input)
public static InputAudioConfig parseDelimitedFrom(InputStream input, ExtensionRegistryLite extensionRegistry)
public static InputAudioConfig parseFrom(byte[] data)
Parameter |
Name |
Description |
data |
byte[]
|
public static InputAudioConfig parseFrom(byte[] data, ExtensionRegistryLite extensionRegistry)
public static InputAudioConfig parseFrom(ByteString data)
public static InputAudioConfig parseFrom(ByteString data, ExtensionRegistryLite extensionRegistry)
public static InputAudioConfig parseFrom(CodedInputStream input)
public static InputAudioConfig parseFrom(CodedInputStream input, ExtensionRegistryLite extensionRegistry)
public static InputAudioConfig parseFrom(InputStream input)
public static InputAudioConfig parseFrom(InputStream input, ExtensionRegistryLite extensionRegistry)
public static InputAudioConfig parseFrom(ByteBuffer data)
public static InputAudioConfig parseFrom(ByteBuffer data, ExtensionRegistryLite extensionRegistry)
public static Parser<InputAudioConfig> parser()
Methods
public boolean equals(Object obj)
Parameter |
Name |
Description |
obj |
Object
|
Overrides
public AudioEncoding getAudioEncoding()
Required. Audio encoding of the audio content to process.
.google.cloud.dialogflow.cx.v3.AudioEncoding audio_encoding = 1 [(.google.api.field_behavior) = REQUIRED];
public int getAudioEncodingValue()
Required. Audio encoding of the audio content to process.
.google.cloud.dialogflow.cx.v3.AudioEncoding audio_encoding = 1 [(.google.api.field_behavior) = REQUIRED];
Returns |
Type |
Description |
int |
The enum numeric value on the wire for audioEncoding.
|
public InputAudioConfig getDefaultInstanceForType()
public boolean getEnableWordInfo()
Optional. If true
, Dialogflow returns
SpeechWordInfo in
StreamingRecognitionResult
with information about the recognized speech words, e.g. start and end time
offsets. If false or unspecified, Speech doesn't return any word-level
information.
bool enable_word_info = 13;
Returns |
Type |
Description |
boolean |
The enableWordInfo.
|
Optional. Which Speech model to select for the given request. Select the
model best suited to your domain to get best results. If a model is not
explicitly specified, then we auto-select a model based on the parameters
in the InputAudioConfig.
If enhanced speech model is enabled for the agent and an enhanced
version of the specified model for the language does not exist, then the
speech is recognized using the standard version of the specified model.
Refer to
Cloud Speech API
documentation
for more details.
If you specify a model, the following models typically have the best
performance:
- phone_call (best for Agent Assist and telephony)
- latest_short (best for Dialogflow non-telephony)
- command_and_search (best for very short utterances and commands)
string model = 7;
Returns |
Type |
Description |
String |
The model.
|
public ByteString getModelBytes()
Optional. Which Speech model to select for the given request. Select the
model best suited to your domain to get best results. If a model is not
explicitly specified, then we auto-select a model based on the parameters
in the InputAudioConfig.
If enhanced speech model is enabled for the agent and an enhanced
version of the specified model for the language does not exist, then the
speech is recognized using the standard version of the specified model.
Refer to
Cloud Speech API
documentation
for more details.
If you specify a model, the following models typically have the best
performance:
- phone_call (best for Agent Assist and telephony)
- latest_short (best for Dialogflow non-telephony)
- command_and_search (best for very short utterances and commands)
string model = 7;
Returns |
Type |
Description |
ByteString |
The bytes for model.
|
public SpeechModelVariant getModelVariant()
Optional. Which variant of the Speech
model to use.
.google.cloud.dialogflow.cx.v3.SpeechModelVariant model_variant = 10;
public int getModelVariantValue()
Optional. Which variant of the Speech
model to use.
.google.cloud.dialogflow.cx.v3.SpeechModelVariant model_variant = 10;
Returns |
Type |
Description |
int |
The enum numeric value on the wire for modelVariant.
|
public Parser<InputAudioConfig> getParserForType()
Overrides
public String getPhraseHints(int index)
Optional. A list of strings containing words and phrases that the speech
recognizer should recognize with higher likelihood.
See the Cloud Speech
documentation
for more details.
repeated string phrase_hints = 4;
Parameter |
Name |
Description |
index |
int
The index of the element to return.
|
Returns |
Type |
Description |
String |
The phraseHints at the given index.
|
public ByteString getPhraseHintsBytes(int index)
Optional. A list of strings containing words and phrases that the speech
recognizer should recognize with higher likelihood.
See the Cloud Speech
documentation
for more details.
repeated string phrase_hints = 4;
Parameter |
Name |
Description |
index |
int
The index of the value to return.
|
Returns |
Type |
Description |
ByteString |
The bytes of the phraseHints at the given index.
|
public int getPhraseHintsCount()
Optional. A list of strings containing words and phrases that the speech
recognizer should recognize with higher likelihood.
See the Cloud Speech
documentation
for more details.
repeated string phrase_hints = 4;
Returns |
Type |
Description |
int |
The count of phraseHints.
|
public ProtocolStringList getPhraseHintsList()
Optional. A list of strings containing words and phrases that the speech
recognizer should recognize with higher likelihood.
See the Cloud Speech
documentation
for more details.
repeated string phrase_hints = 4;
public int getSampleRateHertz()
Returns |
Type |
Description |
int |
The sampleRateHertz.
|
public int getSerializedSize()
Returns |
Type |
Description |
int |
|
Overrides
public boolean getSingleUtterance()
Optional. If false
(default), recognition does not cease until the
client closes the stream.
If true
, the recognizer will detect a single spoken utterance in input
audio. Recognition ceases when it detects the audio's voice has
stopped or paused. In this case, once a detected intent is received, the
client should close the stream and start a new request with a new stream as
needed.
Note: This setting is relevant only for streaming methods.
bool single_utterance = 8;
Returns |
Type |
Description |
boolean |
The singleUtterance.
|
public final UnknownFieldSet getUnknownFields()
Overrides
Returns |
Type |
Description |
int |
|
Overrides
protected GeneratedMessageV3.FieldAccessorTable internalGetFieldAccessorTable()
Overrides
public final boolean isInitialized()
Overrides
public InputAudioConfig.Builder newBuilderForType()
protected InputAudioConfig.Builder newBuilderForType(GeneratedMessageV3.BuilderParent parent)
Overrides
protected Object newInstance(GeneratedMessageV3.UnusedPrivateParameter unused)
Returns |
Type |
Description |
Object |
|
Overrides
public InputAudioConfig.Builder toBuilder()
public void writeTo(CodedOutputStream output)
Overrides