Real-time translation is available during streaming audio input from a microphone or prerecorded audio files, and the API optimizes the integration for reduced latency.
The API accurately punctuates your translation results (e.g., commas, periods, question marks).
Media Translation API comes with two enhanced models (video, phone call), so you can optimize accuracy for your specific audio use case.
Media Translation API supports 12 languages.
"At OnePlus, we aim to share the best technology with the world, hand in hand with our users. One important feature for our product is face-to-face communication across countries, time zones, and even languages. With Google Cloud’s Media Translation API, we are now able to provide real-time streaming translation for video chat with a simple API integration and ensure our customers feel effortlessly connected with minimal latency."
Gary Chen, Head of Software Product, OnePlus
BasicsGuide to the basics of using Media Translation API.
Supported languagesMedia Translation API supports 12 languages.
Best practicesRecommendations on how to provide audio data to Media Translation API.
Client librariesMedia Translation API client libraries are built on Google Cloud Client Libraries.
Translating streaming audioCode samples demonstrating how to translate streaming audio into text.
Release notesLatest product updates for Media Translation API.
Real-time video translation with AR subtitlesLearn how to add translated subtitles on top of any video in real-time.
Create real-time translation overlaysLearn how to overlay translations as subtitles over a live video feed, using a video mixer and a luma keyer.
Media Translation API is priced monthly based on the amount of audio translation successfully processed by the service and on the model used for translation. Usage is measured in increments rounded up to 15 seconds.