Cloud Speech-to-Text V2 API - Module Google::Cloud::Speech::V2::ExplicitDecodingConfig::AudioEncoding (v0.15.0)

Reference documentation and code samples for the Cloud Speech-to-Text V2 API module Google::Cloud::Speech::V2::ExplicitDecodingConfig::AudioEncoding.

Supported audio data encodings.

Constants

AUDIO_ENCODING_UNSPECIFIED

value: 0
Default value. This value is unused.

LINEAR16

value: 1
Headerless 16-bit signed little-endian PCM samples.

MULAW

value: 2
Headerless 8-bit companded mulaw samples.

ALAW

value: 3
Headerless 8-bit companded alaw samples.

AMR

value: 4
AMR frames with an rfc4867.5 header.

AMR_WB

value: 5
AMR-WB frames with an rfc4867.5 header.

FLAC

value: 6
FLAC frames in the "native FLAC" container format.

MP3

value: 7
MPEG audio frames with optional (ignored) ID3 metadata.

OGG_OPUS

value: 8
Opus audio frames in an Ogg container.

WEBM_OPUS

value: 9
Opus audio frames in a WebM container.

MP4_AAC

value: 10
AAC audio frames in an MP4 container.

M4A_AAC

value: 11
AAC audio frames in an M4A container.

MOV_AAC

value: 12
AAC audio frames in an MOV container.