Pricing

Cloud Speech-to-Text is priced monthly based on the amount of audio successfully processed by the service, measured in increments rounded up to 15 seconds.

Pricing Table

Feature 0-60 minutes Over 60 minutes, up to 1 million minutes
Speech Recognition (all models except video) Free $0.006 USD / 15 seconds*
Video Speech Recognition Free $0.012 USD / 15 seconds*

This pricing is for applications on personal systems (e.g., phones, tablets, laptops, desktops). Please contact us for approval and pricing to use the Speech-to-Text API on embedded devices (e.g., cars, TVs, appliances, or speakers).

* Each request is rounded up to the nearest increment of 15 seconds. For example, if you make three separate requests, each containing 7 seconds of audio, you are billed $0.018 USD for 45 seconds (3 × 15 seconds) of audio. Fractions of seconds are included when rounding up to the nearest increment of 15 seconds. That is, 15.14 seconds are rounded up and billed as 30 seconds.

Monthly usage is capped at 1 million minutes per month. For usage above 1 million minutes of audio per month, we would like to understand more about your needs. Please submit a Cloud Speech-to-Text Quota Request for your project.

Google Cloud Platform Costs

If you store audio files to be recognized in Google Cloud Storage, or use other Google Cloud Platform resources in tandem with Speech-to-Text , such as Google App Engine instances, then you will also be billed for the use of those services. See the Google Cloud Platform Pricing Calculator to determine other costs based on current rates.

To view your current billing status in the Cloud Console, including usage and your current bill, see the Billing page. For more details about managing your account, see the Cloud Billing Documentation or Billing and Payments Support.

Was this page helpful? Let us know how we did:

Send feedback about...

Cloud Speech API Documentation