This page shows you how to send a speech recognition request to Speech-to-Text in your favorite programming language using the Google Cloud Client Libraries.
Speech-to-Text enables easy integration of Google speech recognition technologies into developer applications. You can send audio data to the Speech-to-Text API, which then returns a text transcription of that audio file. For more information about the service, see Speech-to-Text basics.
Before you begin
-
Sign in to your Google Account.
If you don't already have one, sign up for a new account.
-
Set up a Cloud Console project.
Click to:
- Create or select a project.
- Enable the Cloud Speech-to-Text API for that project.
- Create a service account.
- Download a private key as JSON.
You can view and manage these resources at any time in the Cloud Console.
-
Set the environment variable
GOOGLE_APPLICATION_CREDENTIALS
to the path of the JSON file that contains your service account key. This variable only applies to your current shell session, so if you open a new session, set the variable again. - Install and initialize the Cloud SDK.
Install the client library
C#
Install-Package Google.Cloud.Speech.V1 -Pre
Go
go get -u cloud.google.com/go/speech/apiv1
Java
If you are using Maven, add
the following to your pom.xml
file. For more information about
BOMs, see The Google Cloud Platform Libraries BOM.
If you are using Gradle, add the following to your dependencies:
If you are using sbt, add the following to your dependencies:
If you're using IntelliJ or Eclipse, you can add client libraries to your project using the following IDE plugins:
The plugins provide additional functionality, such as key management for service accounts. Refer to each plugin's documentation for details.
Node.js
Before installing the library, make sure you've prepared your environment for Node.js development.
npm install --save @google-cloud/speech
PHP
composer require google/cloud-speech
Python
Before installing the library, make sure you've prepared your environment for Python development.
pip install --upgrade google-cloud-speech
Ruby
gem install google-cloud-speech
Make an audio transcription request
Now you can use Speech-to-Text to transcribe an audio file
to text. Use the following code to send a
recognize
request to the Speech-to-Text API.
C#
Go
Java
Node.js
Before running the example, make sure you've prepared your environment for Node.js development.
PHP
Python
Before running the example, make sure you've prepared your environment for Python development.
Ruby
Congratulations! You've sent your first request to Speech-to-Text.
If you receive an error or an empty response from Speech-to-Text, take a look at the troubleshooting and error mitigation steps.
Clean up
To avoid incurring charges to your Google Cloud account for the resources used in this quickstart, follow these steps.
- Use the Cloud Console to delete your project if you do not need it.
What's next
- Practice transcribing short audio files.
- Learn how to batch long audio files for speech recognition.
- Learn how to transcribe streaming audio like from a microphone.
- Get started with the Speech-to-Text in your language of choice by using a Speech-to-Text client library.
- Work through the sample applications.
- For best performance, accuracy, and other tips, see the best practices documentation.