Instructs the speech synthesizer how to generate the output audio content. If this audio config is supplied in a request, it overrides all existing text-to-speech settings applied to the agent.
Required. Audio encoding of the synthesized audio content.
sampleRateHertz
integer
The synthesis sample rate (in hertz) for this audio. If not provided, then the synthesizer will use the default sample rate based on the audio encoding. If this is different from the voice's natural sample rate, then the synthesizer will honor this request by converting to the desired sample rate (which might result in worse audio quality).
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Hard to understand","hardToUnderstand","thumb-down"],["Incorrect information or sample code","incorrectInformationOrSampleCode","thumb-down"],["Missing the information/samples I need","missingTheInformationSamplesINeed","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2025-03-05 UTC."],[[["This content defines the JSON structure for configuring audio output from a speech synthesizer, overriding existing agent settings."],["The `audioEncoding` field specifies the required audio encoding format for the synthesized speech."],["`sampleRateHertz` determines the audio's sample rate, defaulting to the encoding's standard rate if unspecified."],["The `synthesizeSpeechConfig` field allows detailed customization of how the speech is generated."]]],[]]