Instructs the speech synthesizer how to generate the output audio content. If this audio config is supplied in a request, it overrides all existing text-to-speech settings applied to the agent.
Required. Audio encoding of the synthesized audio content.
sampleRateHertz
integer
The synthesis sample rate (in hertz) for this audio. If not provided, then the synthesizer will use the default sample rate based on the audio encoding. If this is different from the voice's natural sample rate, then the synthesizer will honor this request by converting to the desired sample rate (which might result in worse audio quality).
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Hard to understand","hardToUnderstand","thumb-down"],["Incorrect information or sample code","incorrectInformationOrSampleCode","thumb-down"],["Missing the information/samples I need","missingTheInformationSamplesINeed","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2025-06-27 UTC."],[[["\u003cp\u003eThis content defines the JSON structure for configuring audio output from a speech synthesizer, overriding existing agent settings.\u003c/p\u003e\n"],["\u003cp\u003eThe \u003ccode\u003eaudioEncoding\u003c/code\u003e field specifies the required audio encoding format for the synthesized speech.\u003c/p\u003e\n"],["\u003cp\u003e\u003ccode\u003esampleRateHertz\u003c/code\u003e determines the audio's sample rate, defaulting to the encoding's standard rate if unspecified.\u003c/p\u003e\n"],["\u003cp\u003eThe \u003ccode\u003esynthesizeSpeechConfig\u003c/code\u003e field allows detailed customization of how the speech is generated.\u003c/p\u003e\n"]]],[],null,["# OutputAudioConfig\n\n- [JSON representation](#SCHEMA_REPRESENTATION)\n\nInstructs the speech synthesizer how to generate the output audio content. If this audio config is supplied in a request, it overrides all existing text-to-speech settings applied to the agent."]]