Convert text into natural-sounding speech using an API powered by the best of Google’s AI technologies.
New customers get $300 in free credits to spend on Text-to-Speech.
Improve customer interactions with intelligent, lifelike responses
Engage users with voice user interface in your devices and applications
Personalize your communication based on user preference of voice and language
Deploy Google’s groundbreaking technologies to generate speech with humanlike intonation. Built based on DeepMind’s speech synthesis expertise, the API delivers voices that are near human quality.
Choose from a set of 380+ voices across 50+ languages and variants, including Mandarin, Hindi, Spanish, Arabic, Russian, and more. Pick the voice that works best for your user and application.
Type what you want, select a language then click “Speak It” to hear.
Internationalize your voice experience with ready to use voices powered by the latest research behind Custom Voice.
Dazzle your listeners with professionally narrated content recorded in a studio-quality environment. Make sure to put your headphones on!
Train a custom voice model using your own audio recordings to create a unique and more natural sounding voice for your organization. You can define and choose the voice profile that suits your organization and quickly adjust to changes in voice needs without needing to record new phrases.
Personalize the pitch of your selected voice, up to 20 semitones more or less from the default. Adjust your speaking rate to be 4x faster or slower than the normal rate.
Customize your speech with SSML tags that allow you to add pauses, numbers, date and time formatting, and other pronunciation instructions.