Learn how to build the next generation of AI applications. Join the Applied AI Summit on December 13.
Jump to
Text-to-Speech

Text-to-Speech AI

Convert text into natural-sounding speech using an API powered by the best of Google’s AI technologies.

New customers get $300 in free credits to spend on Text-to-Speech.

  • Improve customer interactions with intelligent, lifelike responses

  • Engage users with voice user interface in your devices and applications

  • Personalize your communication based on user preference of voice and language

Benefits

High fidelity speech

Deploy Google’s groundbreaking technologies to generate speech with humanlike intonation. Built based on DeepMind’s speech synthesis expertise, the API delivers voices that are near human quality.

Widest voice selection

Choose from a set of 380+ voices across 50+ languages and variants, including Mandarin, Hindi, Spanish, Arabic, Russian, and more. Pick the voice that works best for your user and application.

One-of-a-kind voice

Create a unique voice to represent your brand across all your customer touchpoints, instead of using a common voice shared with other organizations.

Demo

Put Text-to-Speech into action

Type what you want, select a language then click “Speak It” to hear.

Key features

Key features

Neural2 voices

Internationalize your voice experience with ready to use voices powered by the latest research behind Custom Voice.

Studio voices (Preview)

Dazzle your listeners with professionally narrated content recorded in a studio-quality environment. Make sure to put your headphones on!

Custom Voice

Train a custom voice model using your own audio recordings to create a unique and more natural sounding voice for your organization. You can define and choose the voice profile that suits your organization and quickly adjust to changes in voice needs without needing to record new phrases.

Voice tuning

Personalize the pitch of your selected voice, up to 20 semitones more or less from the default. Adjust your speaking rate to be 4x faster or slower than the normal rate.

Text and SSML support

Customize your speech with SSML tags that allow you to add pauses, numbers, date and time formatting, and other pronunciation instructions.

View all features

What's new

What's new

Sign up for Google Cloud newsletters to receive product updates, event information, special offers, and more.