Mantieni tutto organizzato con le raccolte
Salva e classifica i contenuti in base alle tue preferenze.
Trascrivere la voce in testo utilizzando gcloud CLI
Questa pagina mostra come inviare una richiesta di riconoscimento vocale a Speech-to-Text utilizzando lo strumento gcloud dalla riga di comando.
Speech-to-Text consente di integrare facilmente le tecnologie di riconoscimento vocale di Google nelle applicazioni degli sviluppatori. Puoi inviare
dati audio all'API Speech-to-Text, che restituisce una
trascrizione di testo del file audio. Per saperne di più sul servizio, consulta la pagina Nozioni di base di Speech-to-Text.
Prima di iniziare
Prima di poter inviare una richiesta all'API Speech-to-Text, devi aver completato
le seguenti azioni. Per informazioni dettagliate, consulta la pagina Prima di iniziare.
Abilita Speech-to-Text in un progetto GCP.
Assicurati che la fatturazione sia attivata per Speech-to-Text.
Installa Google Cloud CLI.
Dopo l'installazione,
inizializza Google Cloud CLI eseguendo il seguente comando:
(Facoltativo) Crea un nuovo bucket Google Cloud Storage per archiviare i dati audio.
Esecuzione di una richiesta di trascrizione audio
Ora puoi utilizzare Speech-to-Text per trascrivere un file audio in testo. Utilizza il seguente esempio di codice per inviare una richiesta
recognize all'API Speech-to-Text.
Apri la shell della riga di comando ed esegui il comando seguente.
[[["Facile da capire","easyToUnderstand","thumb-up"],["Il problema è stato risolto","solvedMyProblem","thumb-up"],["Altra","otherUp","thumb-up"]],[["Difficile da capire","hardToUnderstand","thumb-down"],["Informazioni o codice di esempio errati","incorrectInformationOrSampleCode","thumb-down"],["Mancano le informazioni o gli esempi di cui ho bisogno","missingTheInformationSamplesINeed","thumb-down"],["Problema di traduzione","translationIssue","thumb-down"],["Altra","otherDown","thumb-down"]],["Ultimo aggiornamento 2025-09-09 UTC."],[],[],null,["# Quickstart: Transcribe speech to text by using the gcloud CLI\n\nTranscribe speech to text by using the gcloud CLI\n=================================================\n\nThis page shows you how to send a speech recognition request to\nSpeech-to-Text using the\n[`gcloud`](/sdk/gcloud/reference/ml/speech) tool from the command\nline.\n\n\nSpeech-to-Text enables easy integration of Google speech\nrecognition technologies into developer applications. You can send\naudio data to the Speech-to-Text API, which then returns a\ntext transcription of that audio file. For more information about\nthe service, see\n[Speech-to-Text basics](/speech-to-text/docs/basics).\n\nBefore you begin\n----------------\n\nBefore you can send a request to the Speech-to-Text API, you must have completed\nthe following actions. See the\n[before you begin](/speech-to-text/docs/before-you-begin) page for details.\n\n- Enable Speech-to-Text on a GCP project.\n 1. Make sure billing is enabled for Speech-to-Text.\n-\n [Install](/sdk/docs/install) the Google Cloud CLI.\n\n After installation,\n [initialize](/sdk/docs/initializing) the Google Cloud CLI by running the following command:\n\n ```bash\n gcloud init\n ```\n\n\n If you're using an external identity provider (IdP), you must first\n [sign in to the gcloud CLI with your federated identity](/iam/docs/workforce-log-in-gcloud).\n- (Optional) Create a new Google Cloud Storage bucket to store your audio data.\n\nMake an audio transcription request\n-----------------------------------\n\nNow you can use Speech-to-Text to transcribe an audio file\nto text. Use the following code sample to send a\n[`recognize`](/speech-to-text/docs/reference/rest/v1/speech/recognize) request to the Speech-to-Text API.\n\nOpen the command line shell and run the following command. \n\n```bash\ngcloud ml speech recognize gs://cloud-samples-tests/speech/brooklyn.flac \\\n --language-code=en-US\n```\n\nThis command requests that Speech-to-Text transcribe\nthe audio contained in a [FLAC](/speech/docs/encoding) hosted at\na publicly accessible location.\n\nIf the request is successful, the server returns a response in\nJSON format: \n\n```text\n{\n \"results\": [\n {\n \"alternatives\": [\n {\n \"confidence\": 0.9840146,\n \"transcript\": \"how old is the Brooklyn Bridge\"\n }\n ]\n }\n ]\n}\n```\n\nCongratulations! You've sent your first request to Speech-to-Text.\n\n\nIf you receive an error or an empty response from\nSpeech-to-Text, take a look at the\n[troubleshooting](/speech-to-text/docs/troubleshooting) and\n[error mitigation](/speech-to-text/docs/error-messages) steps.\n\nClean up\n--------\n\n\nTo avoid incurring charges to your Google Cloud account for\nthe resources used on this page, follow these steps.\n\n- Use the [Google Cloud console](https://console.cloud.google.com/) to delete your project if you do not need it.\n\nWhat's next\n-----------\n\n\n- Practice [transcribing short audio files](/speech-to-text/docs/sync-recognize).\n- Learn how to [batch long audio files for speech recognition](/speech-to-text/docs/async-recognize).\n- Learn how to [transcribe streaming audio](/speech-to-text/docs/streaming-recognize) like from a microphone.\n- Get started with the Speech-to-Text in your language of choice by using a [Speech-to-Text client library](/speech-to-text/docs/reference/libraries).\n- Work through the [sample applications](/speech-to-text/docs/samples).\n- For best performance, accuracy, and other tips, see the [best practices](/speech-to-text/docs/best-practices) documentation.\n\n\u003cbr /\u003e"]]