Mulai 29 April 2025, model Gemini 1.5 Pro dan Gemini 1.5 Flash tidak tersedia di project yang belum pernah menggunakan model ini, termasuk project baru. Untuk mengetahui detailnya, lihat Versi dan siklus proses model.
[[["Mudah dipahami","easyToUnderstand","thumb-up"],["Memecahkan masalah saya","solvedMyProblem","thumb-up"],["Lainnya","otherUp","thumb-up"]],[["Sulit dipahami","hardToUnderstand","thumb-down"],["Informasi atau kode contoh salah","incorrectInformationOrSampleCode","thumb-down"],["Informasi/contoh yang saya butuhkan tidak ada","missingTheInformationSamplesINeed","thumb-down"],["Masalah terjemahan","translationIssue","thumb-down"],["Lainnya","otherDown","thumb-down"]],["Terakhir diperbarui pada 2025-09-04 UTC."],[],[],null,["# Convert speech to text\n\nThis page shows you how to use Vertex AI Studio to convert speech to text.\n\nTo learn how to convert text to speech, see\n[Convert text to speech](/vertex-ai/generative-ai/docs/speech/text-to-speech).\n\nConvert speech to text\n----------------------\n\nTo convert speech to text, do the following:\n\n1. In the Vertex AI section of the Google Cloud console, go to\n the **Vertex AI Studio** page.\n\n [Go to Vertex AI Studio](https://console.cloud.google.com/vertex-ai/studio/overview)\n2. Click **Generate speech**.\n\n3. Select the **Speech-to-text** tab.\n\n4. In **Speech** , click **Browse** to select the audio file that you want to\n convert to text.\n\n5. In the **Language** selector box, select the language of the speech in the\n audio file.\n\n6. Click **Submit**.\n\n The converted text appears in **Text**.\n\nLimitations\n-----------\n\n- Audio files can be a maximum 60 seconds or 10 MB (whichever is less).\n- Files are transcribed with the [Chirp](https://cloud.google.com/speech-to-text/v2/docs/usm/usm-model) model.\n- Only 16-bit linear PCM WAV files are supported.\n\nYou can use the [Speech-to-Text UI](/speech-to-text/docs/transcribe-console) directly to overcome these limitations.\n\nWhat's next\n-----------\n\n- For more models, advanced features, and ability to transcribe files up to 8 hours, see [Speech-to-Text](/speech-to-text/docs/transcribe-console)."]]