Class SynthesizeSpeechResponse (2.2.0)

SynthesizeSpeechResponse(mapping=None, *, ignore_unknown_fields=False, **kwargs)

The message returned to the client by the SynthesizeSpeech method.

Attributes

NameDescription
audio_content bytes
The audio data bytes encoded as specified in the request, including the header for encodings that are wrapped in containers (e.g. MP3, OGG_OPUS). For LINEAR16 audio, we include the WAV header. Note: as with all bytes fields, protobuffers use a pure binary representation, whereas JSON representations use base64.
timepoints Sequence[.cloud_tts.Timepoint]
A link between a position in the original request input and a corresponding time in the output audio. It's only supported via of SSML input.
audio_config .cloud_tts.AudioConfig
The audio metadata of audio_content.