Pricing

Speech-to-Text On-Prem is priced based on the amount of audio successfully processed by the service each month, measured in increments rounded up to 15 seconds.

You can view your current billing status, including usage and your current bill, in the Cloud Console. For more details about managing your account, see the Cloud billing documentation or billing and payments support.

Pricing Table

Feature Standard models
0-60 Minutes Over 60 Mins up to 1 Million Mins
Speech Recognition Free $0.006 / 15 seconds **

** Each request is rounded up to the nearest increment of 15 seconds.

Pricing calculations

Each request is rounded up to the nearest increment of 15 seconds. For example, if you make three separate requests, each containing 7 seconds of audio, you are billed $0.018 USD for 45 seconds (3 × 15 seconds) of audio. Fractions of seconds are included when rounding up to the nearest increment of 15 seconds. That is, 15.14 seconds are rounded up and billed as 30 seconds.

Google Cloud Platform Costs

If you store audio files to be recognized in Google Cloud Storage, or use other Google Cloud Platform resources in tandem with Speech-to-Text On-Prem, such as Google Compute Engine instances, then you will also be billed for the use of those services. See the Google Cloud Platform pricing calculator to determine other costs based on current rates. Usage of Speech-to-Text On-Prem with Anthos clusters may also incur additional Anthos licensing costs.