Text-to-Speech is priced based on the number of characters sent to the service to be synthesized into audio each month. See the voices page for a complete list of supported voices and languages.
The total number of characters in the input string are counted for billing purposes, including spaces. Speech Synthesis Markup Language (SSML) tags are also included in the character count. For example, this input string counts as 79 characters, including the SSML tags, newlines, and spaces:
<speak> <say-as interpret-as="cardinal">12345</say-as> and one more </speak>
|Feature||Free per month||Price after free quota is reached|
|Standard (non-WaveNet) voices||0 to 4 million characters||$4.00 USD / 1 million characters|
|WaveNet voices||0 to 1 million characters||$16.00 USD / 1 million characters|
Google Cloud Platform costs
If you use other Google Cloud Platform resources in tandem with the Text-to-Speech, such as Google App Engine instances, then you will also be billed for the use of those services. See the Google Cloud Platform Pricing Calculator to determine other costs based on current rates.