Audio encoding of the output audio format in Text-To-Speech.
Values:
OUTPUT_AUDIO_ENCODING_UNSPECIFIED (0):
Not specified.
OUTPUT_AUDIO_ENCODING_LINEAR_16 (1):
Uncompressed 16-bit signed little-endian
samples (Linear PCM). Audio content returned as
LINEAR16 also contains a WAV header.
OUTPUT_AUDIO_ENCODING_MP3 (2):
MP3 audio at 32kbps.
OUTPUT_AUDIO_ENCODING_MP3_64_KBPS (4):
MP3 audio at 64kbps.
OUTPUT_AUDIO_ENCODING_OGG_OPUS (3):
Opus encoded audio wrapped in an ogg
container. The result will be a file which can
be played natively on Android, and in browsers
(at least Chrome and Firefox). The quality of
the encoding is considerably higher than MP3
while using approximately the same bitrate.
OUTPUT_AUDIO_ENCODING_MULAW (5):
8-bit samples that compand 14-bit audio
samples using G.711 PCMU/mu-law.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Hard to understand","hardToUnderstand","thumb-down"],["Incorrect information or sample code","incorrectInformationOrSampleCode","thumb-down"],["Missing the information/samples I need","missingTheInformationSamplesINeed","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2024-12-04 UTC."],[],[]]