-
Create and use Recognizers
Store and reuse recognition configuration using Recognizers.
-
Automatically detect spoken language
Provide multiple language codes for audio transcription requests sent to Cloud Speech-to-Text.
-
Select a transcription model
Select a specialized machine learning model for audio transcription.
-
Transcribe audio with multiple-channels
Transcribe audio files that include more than one channel.
-
Get automatic punctuation
Include punctuation in transcription results from Speech-to-Text.
-
Enable word-level confidence
Specify that transcriptions should contain an accuracy indication for individual words.
-
Enable spoken punctuation and spoken emojis
Perform speech recognition on a remote file and include time offset (timestamp) values for recognized words.
-
Encrypt Speech-to-Text resources
Encrypt Speech-to-Text resources using customer-managed encryption keys (CMEK).
Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.