Speech-to-Text API v2 brings the latest Google Cloud API design for customers to meet enterprise security and regulatory requirements out of the box.
These requirements are realized through the following:
Data Residency: Speech-to-Text v2 offers the broad range of our existing transcription models in Google Cloud regions such as Belgium or Singapore. This allows the invocation of our transcription models through a fully regionalized service.
Recognizer Resourcefulness: Recognizers are reusable recognition configurations that can contain a combination of model, language, and features. This resourceful implementation eliminates the need for dedicated service accounts for authentication and authorization.
Logging: Resource creation and transcriptions generate logs available in the Google Cloud console, allowing for better telemetry and debugging.
Encryption: Speech-to-Text v2 supports Customer-managed encryption keys for all resources as well as batch transcription.
Audio Auto-Detect: Speech-to-Text v2 can automatically detect the sample rate, channel count, and format of your audio files, without needing to provide that information in the request configuration.
Migrating from v1 to v2
Migration from the v1 API to the v2 API does not happen automatically. Minimal implementation changes are required to take advantage of the feature set.
Migrating in API
Similar to Speech-to-Text v1, to transcribe audio,
you need to create a RecognitionConfig
by
selecting the language of your audio and the recognition model of your
choice:
Python
If needed, select a region in which you want to use the Speech-to-Text API, and check the language and model availability in that region:
Python
Optionally, create a recognizer resource if you need to reuse a specific recognition configuration across many transcription requests:
Python
There are other differences in the requests and responses in the new v2 API. For more details, see the reference documentation.
Migrating in UI
To migrate through Speech Google Cloud console, follow these steps:
Go to Speech Google Cloud console.
Navigate to the Transcriptions Page.
Click New Transcription and select your audio in the Audio configuration tab.
In the Transcription options tab, select V2.