- JSON representation
- MultiSpeakerMarkup
- Turn
- CustomPronunciations
- CustomPronunciationParams
- PhoneticEncoding
Contains text input to be synthesized. Either text
or ssml
must be supplied. Supplying both or neither returns google.rpc.Code.INVALID_ARGUMENT
. The input size is limited to 5000 bytes.
JSON representation |
---|
{ "customPronunciations": { object ( |
Fields | |
---|---|
custom |
Optional. The pronunciation customizations are applied to the input. If this is set, the input is synthesized using the given pronunciation customizations. The initial support is for English, French, Italian, German, and Spanish (EFIGS) languages, as provided in
. Journey and Instant Clone voices aren't supported. In order to customize the pronunciation of a phrase, there must be an exact match of the phrase in the input types. If using SSML, the phrase must not be inside a phoneme tag. |
Union field input_source . The input source, which is either plain text or SSML. input_source can be only one of the following: |
|
text |
The raw text to be synthesized. |
ssml |
The SSML document to be synthesized. The SSML document must be valid and well-formed. Otherwise the RPC will fail and return |
multi |
The multi-speaker input to be synthesized. Only applicable for multi-speaker synthesis. |
MultiSpeakerMarkup
A collection of turns for multi-speaker synthesis.
JSON representation |
---|
{
"turns": [
{
object ( |
Fields | |
---|---|
turns[] |
Required. Speaker turns. |
Turn
A multi-speaker turn.
JSON representation |
---|
{ "speaker": string, "text": string } |
Fields | |
---|---|
speaker |
Required. The speaker of the turn, for example, 'O' or 'Q'. Please refer to documentation for available speakers. |
text |
Required. The text to speak. |
CustomPronunciations
A collection of pronunciation customizations.
JSON representation |
---|
{
"pronunciations": [
{
object ( |
Fields | |
---|---|
pronunciations[] |
The pronunciation customizations are applied. |
CustomPronunciationParams
Pronunciation customization for a phrase.
JSON representation |
---|
{
"phrase": string,
"phoneticEncoding": enum ( |
Fields | |
---|---|
phrase |
The phrase to which the customization is applied. The phrase can be multiple words, such as proper nouns, but shouldn't span the length of the sentence. |
phonetic |
The phonetic encoding of the phrase. |
pronunciation |
The pronunciation of the phrase. This must be in the phonetic encoding specified above. |
PhoneticEncoding
The phonetic encoding of the phrase.
Enums | |
---|---|
PHONETIC_ENCODING_UNSPECIFIED |
Not specified. |
PHONETIC_ENCODING_IPA |
IPA, such as apple -> ˈæpəl. https://en.wikipedia.org/wiki/International_Phonetic_Alphabet |
PHONETIC_ENCODING_X_SAMPA |
X-SAMPA, such as apple -> "{p@l". https://en.wikipedia.org/wiki/X-SAMPA |