Quickstart: Using the gcloud Tool

This page shows you how to send a speech recognition request to Cloud Speech-to-Text using the gcloud tool from the command line.

Cloud Speech-to-Text enables easy integration of Google speech recognition technologies into developer applications. You can send audio data to the Speech-to-Text API, which then returns a text transcription of that audio file. For more information about the service, see Cloud Speech-to-Text basics.

Before you begin

  1. Sign in to your Google Account.

    If you don't already have one, sign up for a new account.

  2. Set up a GCP Console project.

    Set up a project

    Click to:

    • Create or select a project.
    • Enable the Speech-to-Text API for that project.
    • Create a service account.
    • Download a private key as JSON.

    You can view and manage these resources at any time in the GCP Console.

  3. Install and initialize the Cloud SDK.
  4. Provide authentication credentials to your application code by running the following command. Replace [PATH] with the location of the JSON file that contains your credentials.

    gcloud auth activate-service-account --key-file=[PATH]

Make an audio transcription request

Now you can use Speech-to-Text to transcribe an audio file to text. Use the following code sample to send a recognize request to the Speech-to-Text API.

Open the command line shell and run the following command.

gcloud ml speech recognize 'gs://cloud-samples-tests/speech/brooklyn.flac' \
    --language-code='en-US'

This command requests that Speech-to-Text transcribe the audio contained in a FLAC hosted at a publicly accessible location.

If the request is successful, the server returns a response in JSON format:

{
  "results": [
    {
      "alternatives": [
        {
          "confidence": 0.9840146,
          "transcript": "how old is the Brooklyn Bridge"
        }
      ]
    }
  ]
}

Congratulations! You've sent your first request to Cloud Speech-to-Text !

What's next

Was this page helpful? Let us know how we did:

Send feedback about...

Cloud Speech API Documentation