Genera segnalazione di bug NVIDIA per le GPU Blackwell
Mantieni tutto organizzato con le raccolte
Salva e classifica i contenuti in base alle tue preferenze.
Questo documento spiega come creare un report sui bug di NVIDIA per i tipi di macchina
che utilizzano le GPU NVIDIA Blackwell.
Se vuoi identificare quali dei tuoi tipi di macchine utilizzano la GPU NVIDIA Blackwell, consulta
Modelli di GPU.
Se il tuo tipo di macchina utilizza l'architettura GPU NVIDIA Blackwell, il report
sui bug NVIDIA generato non include dati hardware di basso livello critici quando esegui
lo script nvidia-bug-report.sh. Questi dati contengono informazioni come lo stato del livello fisico delle connessioni NVLink, i valori dei registri interni della GPU e i segmenti diagnostici non elaborati del firmware. Queste informazioni sono essenziali per diagnosticare i problemi, in particolare quelli
relativi a NVLink, che potrebbero causare
errori Xid della GPU
o un peggioramento irrisolto del rendimento.
Generare una segnalazione di bug NVIDIA
Per generare una segnalazione di bug, completa i seguenti passaggi:
Connettiti all'istanza GPU. Scegli una delle seguenti opzioni:
[[["Facile da capire","easyToUnderstand","thumb-up"],["Il problema è stato risolto","solvedMyProblem","thumb-up"],["Altra","otherUp","thumb-up"]],[["Difficile da capire","hardToUnderstand","thumb-down"],["Informazioni o codice di esempio errati","incorrectInformationOrSampleCode","thumb-down"],["Mancano le informazioni o gli esempi di cui ho bisogno","missingTheInformationSamplesINeed","thumb-down"],["Problema di traduzione","translationIssue","thumb-down"],["Altra","otherDown","thumb-down"]],["Ultimo aggiornamento 2025-09-03 UTC."],[],[],null,["# Generate NVIDIA bug report for Blackwell GPUs\n\n*** ** * ** ***\n\nThis document explains how to create an NVIDIA bug report for your machine types\nthat use [NVIDIA Blackwell GPUs](https://www.nvidia.com/en-us/data-center/technologies/blackwell-architecture/).\nIf you want to identify which of your machine types are using the NVIDIA Blackwell GPU, see\n[GPU models](/compute/docs/gpus#gpu-models).\n\nIf your machine type uses the NVIDIA Blackwell GPU architecture, the generated\nNVIDIA bug report doesn't include critical low-level hardware data when you run\nthe [nvidia-bug-report.sh](https://docs.nvidia.com/deploy/rma-process/index.html#topic_3_1)\nscript. This data contains information such as the physical layer status of NVLink\nconnections, internal GPU register values, and raw diagnostic segments from the\nfirmware. This information is essential for diagnosing issues, especially those\nrelated to NVLink, which could lead to\n[GPU Xid errors](https://docs.nvidia.com/deploy/xid-errors/index.html#xid-error-listing)\nor unresolved performance degradation.\n\nGenerate an NVIDIA bug report\n-----------------------------\n\nTo generate a bug report, complete the following steps:\n\n1. Connect to your GPU instance. Choose one of the following options:\n\n - [Connect to Linux instances](/compute/docs/connect/standard-ssh)\n - [Connect to Windows instances](/compute/docs/instances/connecting-to-windows)\n2. Download and install the MFT package by selecting one of the following options:\n\n### Container-Optimized OS\n\n\nIf your instance uses a Container-Optimized OS (COS) as the\nguest operating system, use the open source [GCE COS NVIDIA Bug Report Collector](https://github.com/GoogleCloudPlatform/cluster-toolkit/tree/main/community/gce-cos-nvidia-bug-report)\ntool to generate the bug report with MFT. This tool automatically\ninjects supported MST kernel modules that match the COS kernel,\ninstalls the userspace tool, generates the bug report, and\noptionally uploads the result to a Cloud Storage bucket.\n\n### Other OS\n\nFor other Linux OSes, complete the following steps:\n\n1. Download and install NVIDIA Firmware Tools (MFT) software version 4.32.0 or higher from the [NVIDIA website](https://network.nvidia.com/products/adapter-software/firmware-tools/).\n2. Install the tool. For more information, see [Compilation and installation](https://docs.nvidia.com/networking/display/mftv4320/compilation+and+installation) in the NVIDIA Firmware Tools (MFT) Documentation. After you install MFT, the [nvidia-bug-report.sh](https://docs.nvidia.com/deploy/rma-process/index.html#topic_3_1) script automatically uses the MFT tools to generate the report. You don't need to interact with MFT tools directly.\n3. Run the `nvidia-bug-report.sh` script to generate a bug report. This process takes about two minutes.\n4. Extract the report.\n5. Verify that the report includes MFT data by running the following command on your extracted bug report file: \n\n ```\n grep -m 1 -A 30 \"Starting GPU MST dump..\" PATH_TO_UNZIPPED_BUG_REPORT\n ```\n\n The output is similar to the following example: \n\n ```text\n Starting GPU MST dump..\n ... (additional MFT data) ...\n \n ```"]]