Transcribe speech to text by using the gcloud CLI
This page shows you how to send a speech recognition request to
Cloud Speech-to-Text using the
gcloud tool from the command
line.
Cloud Speech-to-Text enables easy integration of Google speech recognition technologies into developer applications. You can send audio data to the Cloud Speech-to-Text API, which then returns a text transcription of that audio file. For more information about the service, see Cloud STT basics.
Before you begin
Before you can send a request to the Cloud Speech-to-Text API, you must have completed the following actions. See the before you begin page for details.
- Enable Cloud Speech-to-Text on a GCP project.
- Make sure billing is enabled for Cloud Speech-to-Text.
-
Install the Google Cloud CLI. After installation, initialize the Google Cloud CLI by running the following command:
gcloud initIf you're using an external identity provider (IdP), you must first sign in to the gcloud CLI with your federated identity.
- (Optional) Create a new Google Cloud Storage bucket to store your audio data.
Make an audio transcription request
Now you can use Cloud STT to transcribe an audio file
to text. Use the following code sample to send a
recognize request to the Cloud Speech-to-Text API.
Open the command line shell and run the following command.
gcloud ml speech recognize gs://cloud-samples-tests/speech/brooklyn.flac \ --language-code=en-US
This command requests that Cloud STT transcribe the audio contained in a FLAC hosted at a publicly accessible location.
If the request is successful, the server returns a response in JSON format:
{
"results": [
{
"alternatives": [
{
"confidence": 0.9840146,
"transcript": "how old is the Brooklyn Bridge"
}
]
}
]
}Congratulations! You've sent your first request to Cloud STT.
If you receive an error or an empty response from Cloud STT, take a look at the troubleshooting and error mitigation steps.
Clean up
To avoid incurring charges to your Google Cloud account for the resources used on this page, follow these steps.
- Use the Google Cloud console to delete your project if you do not need it.
What's next
- Practice transcribing short audio files.
- Learn how to batch long audio files for speech recognition.
- Learn how to transcribe streaming audio like from a microphone.
- Get started with the Cloud STT in your language of choice by using a Cloud STT client library.
- Work through the sample applications.
- For best performance, accuracy, and other tips, see the best practices documentation.