Release notes

This page documents production updates to Text-to-Speech. You can periodically check this page for announcements about new or updated features, bug fixes, known issues, and deprecated functionality.

You can see the latest product updates for all of Google Cloud on the Google Cloud page, or browse and filter all release notes in the Google Cloud Console.

To get the latest product updates delivered to you, add the URL of this page to your feed reader, or add the feed URL directly: https://cloud.google.com/feeds/tts-release-notes.xml

April 09, 2021

Text-to-Speech now offers voices in the following new languages. See the supported voices page for a complete list of voices and audio samples.

  • es-US (Spanish, US)
  • af-ZA (Afrikaans, South Africa)
  • bg-BG (Bulgarian, Bulgaria)
  • ca-ES (Catalan, Spain)
  • is-IS (Icelandic, Iceland)
  • lv-LV (Latvian, Latvia)
  • sr-RS (Serbian, Cyrillic)

April 07, 2021

Text-to-Speech now supports MULAW and ALAW audio encodings. See the AudioEncoding reference documentation for details.

March 01, 2021

Text-to-Speech has launched Beta support of new SSML tags: <phoneme>, <mark>, <lang>, <voice>, and <say-as interpret-as="duration"> to specify durations. See the phonemes for a list of phonemes available for your language.

Support for the <prosody> SSML tag has been enhanced to produce continuous TTS when possible.

  • Text-to-speech has resolved an issue that affected how volume changes are calculated, resulting in different but correct behavior.
  • Text-to-speech has resolved an issue that affected how pitch changes are calculated, resulting in different but correct behavior.

Text-to-Speech has improved the continuity of mixed-media results. Now when you mix text and sounds within a <s>/<s> block, Text-to-Speech generates a much shorter pause and better transition between the synthesized speech and the sound.

Text-to-Speech has improved its handling of speech synthesis requests sent using SSML markup.

Text-to-Speech has improved the verbalization and pacing of phone numbers.

January 22, 2021

New language: Text-to-Speech now supports Romanian (ro-RO). See the supported voices page for details and audio samples.

New voice: Text-to-Speech now offers 2 new Bengali (bn-IN) WaveNet voices. See the supported voices page for details and audio samples.

August 24, 2020

Text-to-Speech now offers four new English (US) voices, available as both WaveNet and Standard models. See the supported voices and languages page for more details.

Text-to-Speech now offers four new Chinese (Hong Kong) voices, available as Standard models. See the supported voices and languages page for more details.

May 01, 2020

Cloud Text-to-Speech now offers 36 new voices (both Standard and WaveNet) in the following languages. See the Supported Voices and Languages page for complete details.

  • Arabic
  • Bengali (India)
  • English (India)
  • French (France)
  • German (Germany)
  • Gujarati (India)
  • Hindi (India)
  • Indonesian (Indonesia)
  • Kannada (India)
  • Malayalam (India)
  • Mandarin Chinese
  • Russian (Russia)
  • Tamil (India)
  • Telugu (India)
  • Thai (Thailand)

August 27, 2019

Cloud Text-to-Speech now offers 76 new voices, both standard and WaveNet, in the following languages:

  • Arabic
  • Czech
  • Dutch
  • English (India)
  • Filipino
  • Finnish
  • Greek
  • Hindi
  • Hungarian
  • Indonesian
  • Italian
  • Japanese
  • Mandarin Chinese
  • Norwegian
  • Vietnamese

February 05, 2019

The audio profile feature is generally available for use in new applications. Cloud Text-to-Speech API now allows developers to specify an audio profile for the audio generated from Cloud Text-to-Speech API. Audio profiles are optimized for specific hardware used for playback, from headphones to car stereos.

Added new Standard and WaveNet voices in the following languages and variants:

  • Danish (Denmark)
  • Polish (Poland)
  • Portuguese (Brazil)
  • Russian (Russia)
  • Slovak (Slovakia)
  • Turkish (Turkey)
  • Ukrainian (Ukraine)

Review the voices list for complete details.

August 28, 2018

Cloud Text-to-Speech API general availability (GA) release.

This release includes the public availability of the v1 API endpoint, both in REST and RPC.

July 24, 2018

Added new WaveNet voices in the following languages and variants:

  • Dutch (Netherlands)
  • English (Australia)
  • English (UK)
  • German
  • Italian
  • Japanese

Review the voices list for complete details.

Cloud Text-to-Speech API now allows developers to specify an audio profile for the audio generated from Cloud Text-to-Speech API. Audio profiles are optimized for specific hardware used for playback, from headphones to car stereos.

June 01, 2018

Added Korean (ko-KR) voice. Review the voices list for complete details.

March 27, 2018

Cloud Text-to-Speech API is now available in beta.

March 20, 2018

The names of voices provided for speech synthesis in the Text-to-Speech API have changed. Previous versions of the voice names do not work. To see the list of voices provided in the Text-to-Speech API including the correct names, see the voice list.

March 02, 2018

The gender field changed to ssmlGender.

February 02, 2018

Add voices in the following languages:

  • English (US)
  • French (Canada)
  • Dutch
  • Portuguese (Brazil)
  • Swedish
  • Turkish

See the voices list for complete details.

November 10, 2017

Cloud Text-to-Speech API Alpha release.