Package com.google.cloud.speech.v2 (4.12.0)

A client to Cloud Speech-to-Text API

The interfaces provided are listed below, along with usage samples.

SpeechClient

Service Description: Enables speech transcription and resource management.

Sample for SpeechClient:


 // This snippet has been automatically generated and should be regarded as a code template only.
 // It will require modifications to work:
 // - It may require correct/in-range values for request initialization.
 // - It may require specifying regional endpoints when creating the service client as shown in
 // https://cloud.google.com/java/docs/setup#configure_endpoints_for_the_client_library
 try (SpeechClient speechClient = SpeechClient.create()) {
   RecognizerName name = RecognizerName.of("[PROJECT]", "[LOCATION]", "[RECOGNIZER]");
   Recognizer response = speechClient.getRecognizer(name);
 }

Classes

AutoDetectDecodingConfig

Automatically detected decoding parameters. Supported for the following encodings:

WAV_LINEAR16: 16-bit signed little-endian PCM samples in a WAV container.
WAV_MULAW: 8-bit companded mulaw samples in a WAV container.
WAV_ALAW: 8-bit companded alaw samples in a WAV container.
RFC4867_5_AMR: AMR frames with an rfc4867.5 header.
RFC4867_5_AMRWB: AMR-WB frames with an rfc4867.5 header.
FLAC: FLAC frames in the "native FLAC" container format.
MP3: MPEG audio frames with optional (ignored) ID3 metadata.
OGG_OPUS: Opus audio frames in an Ogg container.
WEBM_OPUS: Opus audio frames in a WebM container.

Protobuf type google.cloud.speech.v2.AutoDetectDecodingConfig

AutoDetectDecodingConfig.Builder

Automatically detected decoding parameters. Supported for the following encodings:

WAV_LINEAR16: 16-bit signed little-endian PCM samples in a WAV container.
WAV_MULAW: 8-bit companded mulaw samples in a WAV container.
WAV_ALAW: 8-bit companded alaw samples in a WAV container.
RFC4867_5_AMR: AMR frames with an rfc4867.5 header.
RFC4867_5_AMRWB: AMR-WB frames with an rfc4867.5 header.
FLAC: FLAC frames in the "native FLAC" container format.
MP3: MPEG audio frames with optional (ignored) ID3 metadata.
OGG_OPUS: Opus audio frames in an Ogg container.
WEBM_OPUS: Opus audio frames in a WebM container.

Protobuf type google.cloud.speech.v2.AutoDetectDecodingConfig

BatchRecognizeFileMetadata

Metadata about a single file in a batch for BatchRecognize.

Protobuf type google.cloud.speech.v2.BatchRecognizeFileMetadata

BatchRecognizeFileMetadata.Builder

Metadata about a single file in a batch for BatchRecognize.

Protobuf type google.cloud.speech.v2.BatchRecognizeFileMetadata

BatchRecognizeFileResult

Final results for a single file.

Protobuf type google.cloud.speech.v2.BatchRecognizeFileResult

BatchRecognizeFileResult.Builder

Final results for a single file.

Protobuf type google.cloud.speech.v2.BatchRecognizeFileResult

BatchRecognizeMetadata

Operation metadata for BatchRecognize.

Protobuf type google.cloud.speech.v2.BatchRecognizeMetadata

BatchRecognizeMetadata.Builder

Operation metadata for BatchRecognize.

Protobuf type google.cloud.speech.v2.BatchRecognizeMetadata

BatchRecognizeRequest

Request message for the BatchRecognize method.

Protobuf type google.cloud.speech.v2.BatchRecognizeRequest

BatchRecognizeRequest.Builder

Request message for the BatchRecognize method.

Protobuf type google.cloud.speech.v2.BatchRecognizeRequest

BatchRecognizeResponse

Response message for BatchRecognize that is packaged into a longrunning Operation.

Protobuf type google.cloud.speech.v2.BatchRecognizeResponse

BatchRecognizeResponse.Builder

Response message for BatchRecognize that is packaged into a longrunning Operation.

Protobuf type google.cloud.speech.v2.BatchRecognizeResponse

Output type for Cloud Storage of BatchRecognize transcripts. Though this proto isn't returned in this API anywhere, the Cloud Storage transcripts will be this proto serialized and should be parsed as such.

Protobuf type google.cloud.speech.v2.BatchRecognizeResults

BatchRecognizeResults.Builder

Protobuf type google.cloud.speech.v2.BatchRecognizeResults

BatchRecognizeTranscriptionMetadata

Metadata about transcription for a single file (for example, progress percent).

Protobuf type google.cloud.speech.v2.BatchRecognizeTranscriptionMetadata

BatchRecognizeTranscriptionMetadata.Builder

Metadata about transcription for a single file (for example, progress percent).

Protobuf type google.cloud.speech.v2.BatchRecognizeTranscriptionMetadata

CloudSpeechProto

Config

Message representing the config for the Speech-to-Text API. This includes an optional KMS key with which incoming data will be encrypted.

Protobuf type google.cloud.speech.v2.Config

Config.Builder

Message representing the config for the Speech-to-Text API. This includes an optional KMS key with which incoming data will be encrypted.

Protobuf type google.cloud.speech.v2.Config

ConfigName

ConfigName.Builder

Builder for projects/{project}/locations/{location}/config.

CreateCustomClassRequest

Request message for the CreateCustomClass method.

Protobuf type google.cloud.speech.v2.CreateCustomClassRequest

CreateCustomClassRequest.Builder

Request message for the CreateCustomClass method.

Protobuf type google.cloud.speech.v2.CreateCustomClassRequest

CreatePhraseSetRequest

Request message for the CreatePhraseSet method.

Protobuf type google.cloud.speech.v2.CreatePhraseSetRequest

CreatePhraseSetRequest.Builder

Request message for the CreatePhraseSet method.

Protobuf type google.cloud.speech.v2.CreatePhraseSetRequest

CreateRecognizerRequest

Request message for the CreateRecognizer method.

Protobuf type google.cloud.speech.v2.CreateRecognizerRequest

CreateRecognizerRequest.Builder

Request message for the CreateRecognizer method.

Protobuf type google.cloud.speech.v2.CreateRecognizerRequest

CustomClass

CustomClass for biasing in speech recognition. Used to define a set of words or phrases that represents a common concept or theme likely to appear in your audio, for example a list of passenger ship names.

Protobuf type google.cloud.speech.v2.CustomClass

CustomClass.Builder

Protobuf type google.cloud.speech.v2.CustomClass

CustomClass.ClassItem

An item of the class.

Protobuf type google.cloud.speech.v2.CustomClass.ClassItem

CustomClass.ClassItem.Builder

An item of the class.

Protobuf type google.cloud.speech.v2.CustomClass.ClassItem

CustomClassName

CustomClassName.Builder

Builder for projects/{project}/locations/{location}/customClasses/{custom_class}.

DeleteCustomClassRequest

Request message for the DeleteCustomClass method.

Protobuf type google.cloud.speech.v2.DeleteCustomClassRequest

DeleteCustomClassRequest.Builder

Request message for the DeleteCustomClass method.

Protobuf type google.cloud.speech.v2.DeleteCustomClassRequest

DeletePhraseSetRequest

Request message for the DeletePhraseSet method.

Protobuf type google.cloud.speech.v2.DeletePhraseSetRequest

DeletePhraseSetRequest.Builder

Request message for the DeletePhraseSet method.

Protobuf type google.cloud.speech.v2.DeletePhraseSetRequest

DeleteRecognizerRequest

Request message for the DeleteRecognizer method.

Protobuf type google.cloud.speech.v2.DeleteRecognizerRequest

DeleteRecognizerRequest.Builder

Request message for the DeleteRecognizer method.

Protobuf type google.cloud.speech.v2.DeleteRecognizerRequest

ExplicitDecodingConfig

Explicitly specified decoding parameters.

Protobuf type google.cloud.speech.v2.ExplicitDecodingConfig

ExplicitDecodingConfig.Builder

Explicitly specified decoding parameters.

Protobuf type google.cloud.speech.v2.ExplicitDecodingConfig

GcsOutputConfig

Output configurations for Cloud Storage.

Protobuf type google.cloud.speech.v2.GcsOutputConfig

GcsOutputConfig.Builder

Output configurations for Cloud Storage.

Protobuf type google.cloud.speech.v2.GcsOutputConfig

GetConfigRequest

Request message for the GetConfig method.

Protobuf type google.cloud.speech.v2.GetConfigRequest

GetConfigRequest.Builder

Request message for the GetConfig method.

Protobuf type google.cloud.speech.v2.GetConfigRequest

GetCustomClassRequest

Request message for the GetCustomClass method.

Protobuf type google.cloud.speech.v2.GetCustomClassRequest

GetCustomClassRequest.Builder

Request message for the GetCustomClass method.

Protobuf type google.cloud.speech.v2.GetCustomClassRequest

GetPhraseSetRequest

Request message for the GetPhraseSet method.

Protobuf type google.cloud.speech.v2.GetPhraseSetRequest

GetPhraseSetRequest.Builder

Request message for the GetPhraseSet method.

Protobuf type google.cloud.speech.v2.GetPhraseSetRequest

GetRecognizerRequest

Request message for the GetRecognizer method.

Protobuf type google.cloud.speech.v2.GetRecognizerRequest

GetRecognizerRequest.Builder

Request message for the GetRecognizer method.

Protobuf type google.cloud.speech.v2.GetRecognizerRequest

InlineOutputConfig

Output configurations for inline response.

Protobuf type google.cloud.speech.v2.InlineOutputConfig

InlineOutputConfig.Builder

Output configurations for inline response.

Protobuf type google.cloud.speech.v2.InlineOutputConfig

ListCustomClassesRequest

Request message for the ListCustomClasses method.

Protobuf type google.cloud.speech.v2.ListCustomClassesRequest

ListCustomClassesRequest.Builder

Request message for the ListCustomClasses method.

Protobuf type google.cloud.speech.v2.ListCustomClassesRequest

ListCustomClassesResponse

Response message for the ListCustomClasses method.

Protobuf type google.cloud.speech.v2.ListCustomClassesResponse

ListCustomClassesResponse.Builder

Response message for the ListCustomClasses method.

Protobuf type google.cloud.speech.v2.ListCustomClassesResponse

ListPhraseSetsRequest

Request message for the ListPhraseSets method.

Protobuf type google.cloud.speech.v2.ListPhraseSetsRequest

ListPhraseSetsRequest.Builder

Request message for the ListPhraseSets method.

Protobuf type google.cloud.speech.v2.ListPhraseSetsRequest

ListPhraseSetsResponse

Response message for the ListPhraseSets method.

Protobuf type google.cloud.speech.v2.ListPhraseSetsResponse

ListPhraseSetsResponse.Builder

Response message for the ListPhraseSets method.

Protobuf type google.cloud.speech.v2.ListPhraseSetsResponse

ListRecognizersRequest

Request message for the ListRecognizers method.

Protobuf type google.cloud.speech.v2.ListRecognizersRequest

ListRecognizersRequest.Builder

Request message for the ListRecognizers method.

Protobuf type google.cloud.speech.v2.ListRecognizersRequest

ListRecognizersResponse

Response message for the ListRecognizers method.

Protobuf type google.cloud.speech.v2.ListRecognizersResponse

ListRecognizersResponse.Builder

Response message for the ListRecognizers method.

Protobuf type google.cloud.speech.v2.ListRecognizersResponse

LocationName

LocationName.Builder

Builder for projects/{project}/locations/{location}.

OperationMetadata

Represents the metadata of a long-running operation.

Protobuf type google.cloud.speech.v2.OperationMetadata

OperationMetadata.Builder

Represents the metadata of a long-running operation.

Protobuf type google.cloud.speech.v2.OperationMetadata

PhraseSet

PhraseSet for biasing in speech recognition. A PhraseSet is used to provide "hints" to the speech recognizer to favor specific words and phrases in the results.

Protobuf type google.cloud.speech.v2.PhraseSet

PhraseSet.Builder

PhraseSet for biasing in speech recognition. A PhraseSet is used to provide "hints" to the speech recognizer to favor specific words and phrases in the results.

Protobuf type google.cloud.speech.v2.PhraseSet

PhraseSet.Phrase

A Phrase contains words and phrase "hints" so that the speech recognition is more likely to recognize them. This can be used to improve the accuracy for specific words and phrases, for example, if specific commands are typically spoken by the user. This can also be used to add additional words to the vocabulary of the recognizer. List items can also include CustomClass references containing groups of words that represent common concepts that occur in natural language.

Protobuf type google.cloud.speech.v2.PhraseSet.Phrase

PhraseSet.Phrase.Builder

Protobuf type google.cloud.speech.v2.PhraseSet.Phrase

PhraseSetName

PhraseSetName.Builder

Builder for projects/{project}/locations/{location}/phraseSets/{phrase_set}.

RecognitionConfig

Provides information to the Recognizer that specifies how to process the recognition request.

Protobuf type google.cloud.speech.v2.RecognitionConfig

RecognitionConfig.Builder

Provides information to the Recognizer that specifies how to process the recognition request.

Protobuf type google.cloud.speech.v2.RecognitionConfig

RecognitionFeatures

Available recognition features.

Protobuf type google.cloud.speech.v2.RecognitionFeatures

RecognitionFeatures.Builder

Available recognition features.

Protobuf type google.cloud.speech.v2.RecognitionFeatures

RecognitionOutputConfig

Configuration options for the output(s) of recognition.

Protobuf type google.cloud.speech.v2.RecognitionOutputConfig

RecognitionOutputConfig.Builder

Configuration options for the output(s) of recognition.

Protobuf type google.cloud.speech.v2.RecognitionOutputConfig

RecognitionResponseMetadata

Metadata about the recognition request and response.

Protobuf type google.cloud.speech.v2.RecognitionResponseMetadata

RecognitionResponseMetadata.Builder

Metadata about the recognition request and response.

Protobuf type google.cloud.speech.v2.RecognitionResponseMetadata

RecognizeRequest

Request message for the Recognize method. Either content or uri must be supplied. Supplying both or neither returns INVALID_ARGUMENT. See content limits.

Protobuf type google.cloud.speech.v2.RecognizeRequest

RecognizeRequest.Builder

Request message for the Recognize method. Either content or uri must be supplied. Supplying both or neither returns INVALID_ARGUMENT. See content limits.

Protobuf type google.cloud.speech.v2.RecognizeRequest

RecognizeResponse

Response message for the Recognize method.

Protobuf type google.cloud.speech.v2.RecognizeResponse

RecognizeResponse.Builder

Response message for the Recognize method.

Protobuf type google.cloud.speech.v2.RecognizeResponse

Recognizer

A Recognizer message. Stores recognition configuration and metadata.

Protobuf type google.cloud.speech.v2.Recognizer

Recognizer.Builder

A Recognizer message. Stores recognition configuration and metadata.

Protobuf type google.cloud.speech.v2.Recognizer

RecognizerName

RecognizerName.Builder

Builder for projects/{project}/locations/{location}/recognizers/{recognizer}.

SpeakerDiarizationConfig

Configuration to enable speaker diarization.

Protobuf type google.cloud.speech.v2.SpeakerDiarizationConfig

SpeakerDiarizationConfig.Builder

Configuration to enable speaker diarization.

Protobuf type google.cloud.speech.v2.SpeakerDiarizationConfig

SpeechAdaptation

Provides "hints" to the speech recognizer to favor specific words and phrases in the results. PhraseSets can be specified as an inline resource, or a reference to an existing PhraseSet resource.

Protobuf type google.cloud.speech.v2.SpeechAdaptation

SpeechAdaptation.AdaptationPhraseSet

A biasing PhraseSet, which can be either a string referencing the name of an existing PhraseSets resource, or an inline definition of a PhraseSet.

Protobuf type google.cloud.speech.v2.SpeechAdaptation.AdaptationPhraseSet

SpeechAdaptation.AdaptationPhraseSet.Builder

A biasing PhraseSet, which can be either a string referencing the name of an existing PhraseSets resource, or an inline definition of a PhraseSet.

Protobuf type google.cloud.speech.v2.SpeechAdaptation.AdaptationPhraseSet

SpeechAdaptation.Builder

Provides "hints" to the speech recognizer to favor specific words and phrases in the results. PhraseSets can be specified as an inline resource, or a reference to an existing PhraseSet resource.

Protobuf type google.cloud.speech.v2.SpeechAdaptation

SpeechClient

Service Description: Enables speech transcription and resource management.

This class provides the ability to make remote calls to the backing service through method calls that map to API methods. Sample code to get started:


 // This snippet has been automatically generated and should be regarded as a code template only.
 // It will require modifications to work:
 // - It may require correct/in-range values for request initialization.
 // - It may require specifying regional endpoints when creating the service client as shown in
 // https://cloud.google.com/java/docs/setup#configure_endpoints_for_the_client_library
 try (SpeechClient speechClient = SpeechClient.create()) {
   RecognizerName name = RecognizerName.of("[PROJECT]", "[LOCATION]", "[RECOGNIZER]");
   Recognizer response = speechClient.getRecognizer(name);
 }

Note: close() needs to be called on the SpeechClient object to clean up resources such as threads. In the example above, try-with-resources is used, which automatically calls close().

The surface of this class includes several types of Java methods for each of the API's methods:

A "flattened" method. With this type of method, the fields of the request type have been converted into function parameters. It may be the case that not all fields are available as parameters, and not every API method will have a flattened method entry point.
A "request object" method. This type of method only takes one parameter, a request object, which must be constructed before the call. Not every API method will have a request object method.
A "callable" method. This type of method takes no parameters and returns an immutable API callable object, which can be used to initiate calls to the service.

See the individual methods for example code.

Many parameters require resource names to be formatted in a particular way. To assist with these names, this class includes a format method for each type of name, and additionally a parse method to extract the individual identifiers contained within names that are returned.

This class can be customized by passing in a custom instance of SpeechSettings to create(). For example:

To customize credentials:


 // This snippet has been automatically generated and should be regarded as a code template only.
 // It will require modifications to work:
 // - It may require correct/in-range values for request initialization.
 // - It may require specifying regional endpoints when creating the service client as shown in
 // https://cloud.google.com/java/docs/setup#configure_endpoints_for_the_client_library
 SpeechSettings speechSettings =
     SpeechSettings.newBuilder()
         .setCredentialsProvider(FixedCredentialsProvider.create(myCredentials))
         .build();
 SpeechClient speechClient = SpeechClient.create(speechSettings);

To customize the endpoint:


 // This snippet has been automatically generated and should be regarded as a code template only.
 // It will require modifications to work:
 // - It may require correct/in-range values for request initialization.
 // - It may require specifying regional endpoints when creating the service client as shown in
 // https://cloud.google.com/java/docs/setup#configure_endpoints_for_the_client_library
 SpeechSettings speechSettings = SpeechSettings.newBuilder().setEndpoint(myEndpoint).build();
 SpeechClient speechClient = SpeechClient.create(speechSettings);

To use REST (HTTP1.1/JSON) transport (instead of gRPC) for sending and receiving requests over the wire:


 // This snippet has been automatically generated and should be regarded as a code template only.
 // It will require modifications to work:
 // - It may require correct/in-range values for request initialization.
 // - It may require specifying regional endpoints when creating the service client as shown in
 // https://cloud.google.com/java/docs/setup#configure_endpoints_for_the_client_library
 SpeechSettings speechSettings = SpeechSettings.newHttpJsonBuilder().build();
 SpeechClient speechClient = SpeechClient.create(speechSettings);

Please refer to the GitHub repository's samples for more quickstart code snippets.

SpeechClient.ListCustomClassesFixedSizeCollection

SpeechClient.ListCustomClassesPage

SpeechClient.ListCustomClassesPagedResponse

SpeechClient.ListLocationsFixedSizeCollection

SpeechClient.ListLocationsPage

SpeechClient.ListLocationsPagedResponse

SpeechClient.ListPhraseSetsFixedSizeCollection

SpeechClient.ListPhraseSetsPage

SpeechClient.ListPhraseSetsPagedResponse