转录本地文件
使用集合让一切井井有条
根据您的偏好保存内容并对其进行分类。
转写短音频文件。
深入探索
如需查看包含此代码示例的详细文档,请参阅以下内容:
代码示例
如未另行说明,那么本页面中的内容已根据知识共享署名 4.0 许可获得了许可,并且代码示例已根据 Apache 2.0 许可获得了许可。有关详情,请参阅 Google 开发者网站政策。Java 是 Oracle 和/或其关联公司的注册商标。
[[["易于理解","easyToUnderstand","thumb-up"],["解决了我的问题","solvedMyProblem","thumb-up"],["其他","otherUp","thumb-up"]],[["很难理解","hardToUnderstand","thumb-down"],["信息或示例代码不正确","incorrectInformationOrSampleCode","thumb-down"],["没有我需要的信息/示例","missingTheInformationSamplesINeed","thumb-down"],["翻译问题","translationIssue","thumb-down"],["其他","otherDown","thumb-down"]],[],[],[],null,["# Transcribe a local file\n\nTranscribe a short audio file.\n\nExplore further\n---------------\n\n\nFor detailed documentation that includes this code sample, see the following:\n\n- [Send a transcription request to Cloud Speech-to-Text On-Prem](/speech-to-text/priv/docs/transcribe)\n\nCode sample\n-----------\n\n### Python\n\n\nTo learn how to install and use the client library for Speech-to-Text, see\n[Speech-to-Text client libraries](/speech-to-text/docs/client-libraries).\n\n\nFor more information, see the\n[Speech-to-Text Python API\nreference documentation](/python/docs/reference/speech/latest).\n\n\nTo authenticate to Speech-to-Text, set up Application Default Credentials.\nFor more information, see\n\n[Set up authentication for a local development environment](/docs/authentication/set-up-adc-local-dev-environment).\n\n def transcribe_onprem(\n local_file_path: str,\n api_endpoint: str,\n ) -\u003e speech_v1p1beta1.RecognizeResponse:\n \"\"\"\n Transcribe a short audio file using synchronous speech recognition on-prem\n\n Args:\n local_file_path: The path to local audio file, e.g. /path/audio.wav\n api_endpoint: Endpoint to call for speech recognition, e.g. 0.0.0.0:10000\n\n Returns:\n The speech recognition response\n {\n \"\"\"\n # api_endpoint = '0.0.0.0:10000'\n # local_file_path = '../resources/two_channel_16k.raw'\n\n # Create a gRPC channel to your server\n channel = grpc.insecure_channel(target=api_endpoint)\n transport = speech_v1p1beta1.services.speech.transports.SpeechGrpcTransport(\n channel=channel\n )\n\n client = speech_v1p1beta1.SpeechClient(transport=transport)\n\n # The language of the supplied audio\n language_code = \"en-US\"\n\n # Sample rate in Hertz of the audio data sent\n sample_rate_hertz = 16000\n\n # Encoding of audio data sent. This sample sets this explicitly.\n # This field is optional for FLAC and WAV audio formats.\n encoding = speech_v1p1beta1.RecognitionConfig.AudioEncoding.LINEAR16\n config = {\n \"encoding\": encoding,\n \"language_code\": language_code,\n \"sample_rate_hertz\": sample_rate_hertz,\n }\n with io.open(local_file_path, \"rb\") as f:\n content = f.read()\n audio = {\"content\": content}\n\n response = client.recognize(request={\"config\": config, \"audio\": audio})\n for result in response.results:\n # First alternative is the most probable result\n alternative = result.alternatives[0]\n print(f\"Transcript: {alternative.transcript}\")\n\n return response\n\nWhat's next\n-----------\n\n\nTo search and filter code samples for other Google Cloud products, see the\n[Google Cloud sample browser](/docs/samples?product=speech)."]]