A partire dal 29 aprile 2025, i modelli Gemini 1.5 Pro e Gemini 1.5 Flash non sono disponibili nei progetti che non li hanno mai utilizzati, inclusi i nuovi progetti. Per maggiori dettagli, vedi Versioni e ciclo di vita dei modelli.
[[["Facile da capire","easyToUnderstand","thumb-up"],["Il problema è stato risolto","solvedMyProblem","thumb-up"],["Altra","otherUp","thumb-up"]],[["Difficile da capire","hardToUnderstand","thumb-down"],["Informazioni o codice di esempio errati","incorrectInformationOrSampleCode","thumb-down"],["Mancano le informazioni o gli esempi di cui ho bisogno","missingTheInformationSamplesINeed","thumb-down"],["Problema di traduzione","translationIssue","thumb-down"],["Altra","otherDown","thumb-down"]],["Ultimo aggiornamento 2025-09-04 UTC."],[],[],null,["# Convert speech to text\n\nThis page shows you how to use Vertex AI Studio to convert speech to text.\n\nTo learn how to convert text to speech, see\n[Convert text to speech](/vertex-ai/generative-ai/docs/speech/text-to-speech).\n\nConvert speech to text\n----------------------\n\nTo convert speech to text, do the following:\n\n1. In the Vertex AI section of the Google Cloud console, go to\n the **Vertex AI Studio** page.\n\n [Go to Vertex AI Studio](https://console.cloud.google.com/vertex-ai/studio/overview)\n2. Click **Generate speech**.\n\n3. Select the **Speech-to-text** tab.\n\n4. In **Speech** , click **Browse** to select the audio file that you want to\n convert to text.\n\n5. In the **Language** selector box, select the language of the speech in the\n audio file.\n\n6. Click **Submit**.\n\n The converted text appears in **Text**.\n\nLimitations\n-----------\n\n- Audio files can be a maximum 60 seconds or 10 MB (whichever is less).\n- Files are transcribed with the [Chirp](https://cloud.google.com/speech-to-text/v2/docs/usm/usm-model) model.\n- Only 16-bit linear PCM WAV files are supported.\n\nYou can use the [Speech-to-Text UI](/speech-to-text/docs/transcribe-console) directly to overcome these limitations.\n\nWhat's next\n-----------\n\n- For more models, advanced features, and ability to transcribe files up to 8 hours, see [Speech-to-Text](/speech-to-text/docs/transcribe-console)."]]