Text-to-Speech release notes

This page documents production updates to Text-to-Speech. You can periodically check this page for announcements about new or updated features, bug fixes, known issues, and deprecated functionality.

You can see the latest product updates for all of Google Cloud on the Google Cloud page, browse and filter all release notes in the Google Cloud console, or programmatically access release notes in BigQuery.

To get the latest product updates delivered to you, add the URL of this page to your feed reader, or add the feed URL directly.

September 15, 2025

Chirp 3: HD voices is available on the asia-northeast1 endpoint. For more information, see Chirp 3: HD voices.

August 27, 2025

Chirp 3: HD voices is available on the europe-west2 endpoint. For more information, see Chirp 3: HD voices.

Chirp 3: instant custom voice supports the Chirp 3: HD voice controls for pace control, pause control, and custom pronunciations. For more information, see the Chirp 3: instant custom voice page.

August 21, 2025

Chirp 3: Instant custom voice supports new input audio encodings PCM, MP3, and M4A, with any sample rate. For more information, see the Chirp 3: Instant custom voice page.

July 24, 2025

Chirp 3: HD voices now offers General Availability (GA) support for four additional Nordic languages: Danish (da-DK), Finnish (fi-FI), Norwegian Bokmål (nb-NO), and Swedish (sv-SE). For more information, see Chirp 3: HD voices.

June 18, 2025

Chirp 3: Instant Custom Voice now extends support to ja-JP, now supporting more than 30 locales. For more information, check the Chirp 3: Instant Custom Voice documentation.

May 07, 2025

We just released three new voice features for Chirp 3: HD Voices. Pace control is available across all locales; pause control is available across all locales; custom pronunciations is available across all locales except bn-in, gu-in, nl-be, sw-ke, th-th, uk-ua, ur-in, and vi-vn. Be sure to check our Chirp 3: HD Voices documentation for more information.

April 16, 2025

The polyglot voices feature is only supported in multi-regions.

April 02, 2025

Chirp 3: HD voices with 8 speakers and 31 locales is now GA. It offers real-time streaming and batch processing capabilities and is accessible in global, us, eu, and asia-southeast1 regions.

Explore the latest Chirp 3: HD voices capabilities. Find out their full potential by visiting our updated documentation, specifically the voice controls section.

March 27, 2025

Chirp 3: HD voices in en-US now support experimental features for pace and pause controls.

March 17, 2025

Chirp 3: HD voices are only available in the global, us, eu, and asia-southeast1 regions. To use these voices, switch your endpoint to a supported region.

March 06, 2025

Chirp 3: HD voices now supports 8 new speakers in 31 new locales: ar-XA, bn-IN, cmn-CN, de-DE, en-AU, en-GB, en-IN, en-US, es-ES, es-US, fr-CA, fr-FR, gu-IN, hi-IN, id-ID, it-IT, ja-JP, kn-IN, ko-KR, ml-IN, mr-IN, nl-NL, pl-PL, pt-BR, ru-RU, sw-KE, ta-IN, te-IN, th-TH, tr-TR, and vi-VN.

February 10, 2025

Journey voices have been rebranded as Chirp HD voices.

December 03, 2024

Journey Voices now supports the Journey-O speaker for de-de, en-au, en-in, en-gb, es-es, es-us, fr-ca, fr-fr, and it-it.

November 22, 2024

Cloud TTS Journey voices have been updated to improve the accuracy of generated speech. This means you should notice fewer instances of dropped words.

November 11, 2024

Journey Voices now supports the de-de, en-gb, en-in, es-us, fr-ca, fr-fr, and it-it locales.

October 30, 2024

Studio Voices now support synthesis with multiple speakers to generate audios for interviews, interactive storytelling, video games, e-learning platforms, and accessibility solutions.

October 18, 2024

Journey Voices and streaming synthesis now support the de-de, en-gb, en-in, es-us, fr-ca, fr-fr, and it-it locales.

September 10, 2024

Journey Voices is now in Preview and supports text streaming.

May 14, 2024

Cloud Text-to-Speech now offers updated Journey voices with an additional speaker, en-us-Journey-O.

April 19, 2024

Cloud Text-to-Speech now offers es-ES Studio voices: es-ES-Studio-C and es-ES-Studio-F

February 26, 2024

Studio voices are now GA.

Casual voices are now in preview.

December 29, 2023

Journey voices are now in experimental.

November 29, 2023

Cloud Text-to-Speech now offers de-DE and fr-FR Studio voices: de-DE-Studio-B, de-DE-Studio-C, fr-FR-Studio-A, and fr-FR-Studio-D.

November 06, 2023

As of November 13 2023, speaker en-US-Studio-M will no longer be available. All requests sent to en-US-Studio-M will be routed to speaker en-US-Studio-Q. There is no action needed.

November 03, 2023

Cloud Text-to-Speech now offers en-GB Studio voices: en-GB-Studio-B and en-GB-Studio-C.

October 25, 2023

Styles are now supported in Neural2 voices through SSML. The following styles are supported

<google:emotion name="apologetic">
<google:emotion name="calm">
<google:emotion name="empathetic">
<google:emotion name="firm">
<google:emotion name="lively">

for the following voices:

en-us-Neural2-F
en-us-Neural2-J

October 24, 2023

Studio voices now support 5,000 bytes of either text or SSML input per synthesis request.

Long Audio Synthesis now supports Studio voices.

Long Audio Synthesis now supports SSML inputs.

October 16, 2023

The Long Audio Synthesis API now supports the following languages: English, Spanish, French, German, Japanese, Hindi, Italian, Korean, Portuguese, Thai, Vietnamese, Danish, Filipino.

There is no longer billing differentiation for Cloud Text-to-Speech Offline Custom Voice API calls. See the <ReportedUsage> documentation for more details.

June 28, 2023

Studio voices now support SSML, except for the following tags: <mark>, <emphasis>, <prosody>, and <lang>

March 16, 2023

Cloud Text-to-Speech now offers Long Audio Synthesis. This new API can be used to synthesize texts longer than 5 KB. For more information about API usage using the command line, see Create long audio from text by using the command line.

March 06, 2023

Text-to-Speech now offers a Spanish Studio voice, es-US-Studio-B, in addition to its existing English Studio voices.

February 16, 2023

Text-to-Speech offers these new voices. See the supported voices page for a complete list of voices and audio samples.

eu-ES-Standard-A
gl-ES-Standard-A

February 08, 2023

Text-to-Speech now offers Studio voices. This voice type is designed specifically for use with long-form texts such as narration and news reading. See the supported voices page for a complete list of voices and audio samples.

en-US-Studio-M
en-US-Studio-O

January 10, 2023

On or after July 9th, 2023, Cloud Text-to-Speech will replace the following voices with new voices of similar quality and accent. The new voices are available to try now. No action will be needed from you to switch to the new voice on July 9th, 2023. However, you are free to switch to the new voice at any time.

Removing ml-IN-Standard-A
- Redirecting to ml-IN-Standard-C
Removing ml-IN-Wavenet-A
- Redirecting ml-IN-Wavenet-C
Removing ml-IN-Standard-B
- Redirecting to ml-IN-Standard-D
Removing ml-IN-Wavenet-B
- Redirecting ml-IN-Wavenet-D
Removing bn-IN-Standard-A
- Redirecting to bn-IN-Standard-C
Removing bn-IN-Wavenet-A
- Redirecting bn-IN-Wavenet-C
Removing bn-IN-Standard-B
- Redirecting to bn-IN-Standard-D
Removing bn-IN-Wavenet-B
- Redirecting bn-IN-Wavenet-D
Removing kn-IN-Standard-A
- Redirecting to kn-IN-Standard-C
Removing kn-IN-Wavenet-A
- Redirecting kn-IN-Wavenet-C
Removing kn-IN-Standard-B
- Redirecting to kn-IN-Standard-D
Removing kn-IN-Wavenet-B
- Redirecting kn-IN-Wavenet-D
Removing gu-IN-Standard-A
- Redirecting to gu-IN-Standard-C
Removing gu-IN-Wavenet-A
- Redirecting gu-IN-Wavenet-C
Removing gu-IN-Standard-B
- Redirecting to gu-IN-Standard-D
Removing gu-IN-Wavenet-B
- Redirecting gu-IN-Wavenet-D
Removing it-IT-Standard-A
- Redirecting to it-IT-Standard-B
Removing it-IT-Wavenet-A
- Redirecting to it-IT-Wavenet-B
Removing es-ES-Standard-A
- Redirecting to es-ES-Standard-C

December 22, 2022

Text-to-Speech now offers these new voices. See the supported voices page for a complete list of voices and audio samples.

ml-IN-Wavenet-C
ml-IN-Wavenet-D

Note these voices are bilingual with en-IN.

Text-to-Speech now offers these new news reading voices. See the supported voices page for a complete list of voices and audio samples.

es-US-News-D
es-US-News-E
es-US-News-F
es-US-News-G
en-AU-News-E
en-AU-News-F
en-AU-News-G
en-GB-News-G
en-GB-News-H
en-GB-News-I
en-GB-News-J
en-GB-News-K
en-GB-News-L
en-GB-News-M

November 29, 2022

Text-to-Speech now offers additional Neural2 voices across 9 locales with 40+ speakers. Voices are available in the us-central1, us, and eu endpoints. See the supported voices page for a complete list of voices and audio samples.

November 10, 2022

Text-to-Speech now offers these new voices. See the supported voices page for a complete list of voices and audio samples.

en-US-News-K
en-US-News-L
en-US-News-M
en-US-News-N

October 24, 2022

Text-to-Speech improved the quality of these voices. See the supported voices page for a complete list of voices and audio samples.

en-GB-Wavenet-A
en-GB-Wavenet-B
en-GB-Wavenet-C
en-GB-Wavenet-D
en-GB-Wavenet-F
es-ES-Wavenet-B
es-ES-Wavenet-C
es-ES-Wavenet-D
hi-IN-Wavenet-A
hi-IN-Wavenet-B
hi-IN-Wavenet-C
hi-IN-Wavenet-D

October 07, 2022

Text-to-Speech now offers these new voices. See the supported voices page for a complete list of voices and audio samples.

mr-IN-Wavenet-A
mr-IN-Standard-A
mr-IN-Wavenet-B
mr-IN-Standard-B
mr-IN-Wavenet-C
mr-IN-Standard-C

On or after April 8th, 2023, Cloud Text-to-Speech will replace the following voices with new voices of similar quality and accent. The new voices are available to try now. No action will be needed from you to switch to the new voice on April 8th, 2023. However, you are free to switch to the new voice at anytime

Removing ta-IN-Standard-A
1. Redirecting to ta-IN-Standard-C
Removing ta-IN-Wavenet-A
1. Redirecting to ta-IN-Wavenet-C
Removing ta-IN-Standard-B
1. Redirecting to ta-IN-Standard-D
Removing ta-IN-Wavenet-B
1. Redirecting to ta-IN-Wavenet-D
Removing pt-BR-Standard-A
1. Redirecting to pt-BR-Standard-C
Removing pt-BR-Wavenet-A
1. Redirecting to pt-BR-Wavenet-C
Removing ja-JP-Standard-A
1. Redirecting to ja-JP-Standard-B
Removing ja-JP-Wavenet-A
1. Redirecting to ja-JP-Wavenet-B

September 01, 2022

Text-to-Speech now offers these new voices. See the supported voices page for a complete list of voices and audio samples.

ta-IN-Wavenet-C
ta-IN-Standard-C
ta-IN-Wavenet-D
ta-IN-Standard-D

August 19, 2022

Text-to-Speech has improved the quality of these voices

pt-br-Standard-A
pt-br-Standard-B

August 05, 2022

Text-to-Speech now offers these new voices. See the supported voices page for a complete list of voices and audio samples.

pt-BR-Standard-C
pt-BR-Wavenet-C

June 27, 2022

Cloud Text-to-Speech now supports Neural2 voices in addition to Standard and WaveNet voice generation models. Neural2 uses Custom Voice technology without the need to train a unique voice. Neural2 voices are in Preview and are currently available in a single region for a limited number of languages.

March 09, 2022

Text-to-Speech now offers regional endpoints for the following places. See the How-to guides for more information on how to use these endpoints. - Europe - https://eu-texttospeech.googleapis.com - US - https://us-texttospeech.googleapis.com

June 17, 2021

Text-to-Speech now offers voices in the following new languages. See the supported voices page for a complete list of voices and audio samples.

ms-MY (Malay, Malaysia)
nl-BE (Dutch, Belgium)

April 09, 2021

Text-to-Speech now offers voices in the following new languages. See the supported voices page for a complete list of voices and audio samples.

es-US (Spanish, US)
af-ZA (Afrikaans, South Africa)
bg-BG (Bulgarian, Bulgaria)
ca-ES (Catalan, Spain)
is-IS (Icelandic, Iceland)
lv-LV (Latvian, Latvia)
sr-RS (Serbian, Cyrillic)

April 07, 2021

Text-to-Speech now supports MULAW and ALAW audio encodings. See the AudioEncoding reference documentation for details.

March 01, 2021

Text-to-Speech has launched Beta support of new SSML tags: <phoneme>, <mark>, <lang>, <voice>, and <say-as interpret-as="duration"> to specify durations. See the phonemes for a list of phonemes available for your language.

Support for the <prosody> SSML tag has been enhanced to produce continuous TTS when possible.

Text-to-speech has resolved an issue that affected how volume changes are calculated, resulting in different but correct behavior.
Text-to-speech has resolved an issue that affected how pitch changes are calculated, resulting in different but correct behavior.

Text-to-Speech has improved the continuity of mixed-media results. Now when you mix text and sounds within a <s>/<s> block, Text-to-Speech generates a much shorter pause and better transition between the synthesized speech and the sound.

Text-to-Speech has improved its handling of speech synthesis requests sent using SSML markup.

Text-to-Speech has improved the verbalization and pacing of phone numbers.

January 22, 2021

New language: Text-to-Speech now supports Romanian (ro-RO). See the supported voices page for details and audio samples.

New voice: Text-to-Speech now offers 2 new Bengali (bn-IN) WaveNet voices. See the supported voices page for details and audio samples.

August 24, 2020

Text-to-Speech now offers four new English (US) voices, available as both WaveNet and Standard models. See the supported voices and languages page for more details.

Text-to-Speech now offers four new Chinese (Hong Kong) voices, available as Standard models. See the supported voices and languages page for more details.

May 01, 2020

Cloud Text-to-Speech now offers 36 new voices (both Standard and WaveNet) in the following languages. See the Supported Voices and Languages page for complete details.

Arabic
Bengali (India)
English (India)
French (France)
German (Germany)
Gujarati (India)
Hindi (India)
Indonesian (Indonesia)
Kannada (India)
Malayalam (India)
Mandarin Chinese
Russian (Russia)
Tamil (India)
Telugu (India)
Thai (Thailand)

August 27, 2019

Cloud Text-to-Speech now offers 76 new voices, both standard and WaveNet, in the following languages:

Arabic
Czech
Dutch
English (India)
Filipino
Finnish
Greek
Hindi
Hungarian
Indonesian
Italian
Japanese
Mandarin Chinese
Norwegian
Vietnamese

February 05, 2019

The audio profile feature is generally available for use in new applications. Cloud Text-to-Speech API now allows developers to specify an audio profile for the audio generated from Cloud Text-to-Speech API. Audio profiles are optimized for specific hardware used for playback, from headphones to car stereos.

Added new Standard and WaveNet voices in the following languages and variants:

Danish (Denmark)
Polish (Poland)
Portuguese (Brazil)
Russian (Russia)
Slovak (Slovakia)
Turkish (Turkey)
Ukrainian (Ukraine)

Review the voices list for complete details.

August 28, 2018

Cloud Text-to-Speech API general availability (GA) release.

This release includes the public availability of the v1 API endpoint, both in REST and RPC.

July 24, 2018

Added new WaveNet voices in the following languages and variants:

Dutch (Netherlands)
English (Australia)
English (UK)
German
Italian
Japanese

Review the voices list for complete details.

Cloud Text-to-Speech API now allows developers to specify an audio profile for the audio generated from Cloud Text-to-Speech API. Audio profiles are optimized for specific hardware used for playback, from headphones to car stereos.

June 01, 2018

Added Korean (ko-KR) voice. Review the voices list for complete details.

March 27, 2018

Cloud Text-to-Speech API is now available in beta.

March 20, 2018

The names of voices provided for speech synthesis in the Text-to-Speech API have changed. Previous versions of the voice names do not work. To see the list of voices provided in the Text-to-Speech API including the correct names, see the voice list.

March 02, 2018

The gender field changed to ssmlGender.

February 02, 2018

Add voices in the following languages:

English (US)
French (Canada)
Dutch
Portuguese (Brazil)
Swedish
Turkish

See the voices list for complete details.

November 10, 2017

Cloud Text-to-Speech API Alpha release.

Text-to-Speech release notes Stay organized with collections Save and categorize content based on your preferences.

September 15, 2025

August 27, 2025

August 21, 2025

July 24, 2025

June 18, 2025

May 07, 2025

April 16, 2025

April 02, 2025

March 27, 2025

March 17, 2025

March 06, 2025

February 10, 2025

December 03, 2024

November 22, 2024

November 11, 2024

October 30, 2024

October 18, 2024

September 10, 2024

May 14, 2024

April 19, 2024

February 26, 2024

December 29, 2023

November 29, 2023

November 06, 2023

November 03, 2023

October 25, 2023

October 24, 2023

October 16, 2023

June 28, 2023

March 16, 2023

March 06, 2023

February 16, 2023

February 08, 2023

January 10, 2023

December 22, 2022

November 29, 2022

November 10, 2022

October 24, 2022

October 07, 2022

September 01, 2022

August 19, 2022

August 05, 2022

June 27, 2022

March 09, 2022

June 17, 2021

April 09, 2021

April 07, 2021

March 01, 2021

January 22, 2021

August 24, 2020

May 01, 2020

August 27, 2019

February 05, 2019

August 28, 2018

July 24, 2018

June 01, 2018

March 27, 2018

March 20, 2018

March 02, 2018

February 02, 2018

November 10, 2017

Text-to-Speech release notes