[[["容易理解","easyToUnderstand","thumb-up"],["確實解決了我的問題","solvedMyProblem","thumb-up"],["其他","otherUp","thumb-up"]],[["難以理解","hardToUnderstand","thumb-down"],["資訊或程式碼範例有誤","incorrectInformationOrSampleCode","thumb-down"],["缺少我需要的資訊/範例","missingTheInformationSamplesINeed","thumb-down"],["翻譯問題","translationIssue","thumb-down"],["其他","otherDown","thumb-down"]],["上次更新時間:2025-09-04 (世界標準時間)。"],[],[],null,["# Run LLM inference on Cloud Run GPUs with Hugging Face Transformers.js\n\nThe following codelab shows how to run a backend service that runs the [Transformers.js package](https://www.npmjs.com/package/@huggingface/transformers). The Transformers.js package is functionally equivalent to the [Hugging Face transformers python library](https://github.com/huggingface/transformers) together with Google's [Gemma 2](https://developers.googleblog.com/en/smaller-safer-more-transparent-advancing-responsible-ai-with-gemma/) model.\n\nSee the entire codelab at [How to Run Transformers.js on Cloud Run GPUs](https://codelabs.developers.google.com/codelabs/how-to-use-transformers-js-cloud-run-gpu#0)."]]