Realizar una solicitud de transcripción de audio (beta)
Organiza tus páginas con colecciones
Guarda y categoriza el contenido según tus preferencias.
Transcribe un archivo de audio local de forma síncrona.
Muestra de código
Salvo que se indique lo contrario, el contenido de esta página está sujeto a la licencia Atribución 4.0 de Creative Commons, y los ejemplos de código están sujetos a la licencia Apache 2.0. Para obtener más información, consulta las políticas del sitio de Google Developers. Java es una marca registrada de Oracle o sus afiliados.
[[["Fácil de comprender","easyToUnderstand","thumb-up"],["Resolvió mi problema","solvedMyProblem","thumb-up"],["Otro","otherUp","thumb-up"]],[["Difícil de entender","hardToUnderstand","thumb-down"],["Información o código de muestra incorrectos","incorrectInformationOrSampleCode","thumb-down"],["Faltan la información o los ejemplos que necesito","missingTheInformationSamplesINeed","thumb-down"],["Problema de traducción","translationIssue","thumb-down"],["Otro","otherDown","thumb-down"]],[],[],[],null,["# Make an audio transcription request (beta)\n\nTranscribe a local audio file synchronously.\n\nCode sample\n-----------\n\n### Python\n\n\nTo learn how to install and use the client library for Speech-to-Text, see\n[Speech-to-Text client libraries](/speech-to-text/docs/client-libraries).\n\n\nFor more information, see the\n[Speech-to-Text Python API\nreference documentation](/python/docs/reference/speech/latest).\n\n\nTo authenticate to Speech-to-Text, set up Application Default Credentials.\nFor more information, see\n\n[Set up authentication for a local development environment](/docs/authentication/set-up-adc-local-dev-environment).\n\n from google.cloud import speech_v1p1beta1 as speech\n\n\n def sample_recognize(storage_uri: str) -\u003e speech.RecognizeResponse:\n \"\"\"\n Performs synchronous speech recognition on an audio file\n\n Args:\n storage_uri URI for audio file in Cloud Storage, e.g. gs://[BUCKET]/[FILE]\n \"\"\"\n\n client = speech.SpeechClient()\n\n # storage_uri = 'gs://cloud-samples-data/speech/brooklyn_bridge.mp3'\n\n # The language of the supplied audio\n language_code = \"en-US\"\n\n # Sample rate in Hertz of the audio data sent\n sample_rate_hertz = 44100\n\n # Encoding of audio data sent. This sample sets this explicitly.\n # This field is optional for FLAC and WAV audio formats.\n encoding = speech.RecognitionConfig.AudioEncoding.MP3\n config = {\n \"language_code\": language_code,\n \"sample_rate_hertz\": sample_rate_hertz,\n \"encoding\": encoding,\n }\n audio = {\"uri\": storage_uri}\n\n response = client.recognize(config=config, audio=audio)\n\n for result in response.results:\n # First alternative is the most probable result\n alternative = result.alternatives[0]\n print(f\"Transcript: {alternative.transcript}\")\n\nWhat's next\n-----------\n\n\nTo search and filter code samples for other Google Cloud products, see the\n[Google Cloud sample browser](/docs/samples?product=speech)."]]