CVE-2026-24197: NVIDIA Display Driver for Linux DoS Flaw

CVE-2026-24197 Overview

CVE-2026-24197 affects the NVIDIA Display Driver for Linux. The vulnerability resides in the Multi-Instance GPU (MIG) partition management code. Insecure default initialization of memory subsystem routing resources can cause data corruption or a hang when partitions are reconfigured. A local authenticated attacker with low privileges can trigger the condition and induce a denial of service against the GPU subsystem.

The issue maps to CWE-1188: Insecure Default Initialization of Resource. Because the scope is changed during exploitation, impact extends beyond the attacker's own context to other tenants sharing the GPU.

Critical Impact
A local low-privileged user can corrupt GPU memory routing state or hang the device during MIG reconfiguration, disrupting workloads running on shared partitions.

Affected Products

NVIDIA Display Driver for Linux (MIG-capable GPUs)
Systems using Multi-Instance GPU partitioning (e.g., A100, H100 class data center GPUs)
Refer to the NVIDIA Support Response for the full list of affected branches

Discovery Timeline

2026-05-26 - CVE-2026-24197 published to NVD
2026-05-26 - Last updated in NVD database

Technical Details for CVE-2026-24197

Vulnerability Analysis

The NVIDIA Display Driver for Linux supports Multi-Instance GPU (MIG), which partitions a single physical GPU into isolated GPU instances. Each instance receives dedicated compute, memory, and memory subsystem routing resources. The driver is responsible for initializing these routing resources whenever a partition is created, destroyed, or reconfigured.

In the affected driver versions, the MIG partition management path uses an insecure default state for memory subsystem routing resources. When a local user with permission to interact with the GPU triggers partition reconfiguration, the driver consumes this default state without properly reinitializing it. The resulting routing configuration is inconsistent with the new partition topology.

This inconsistency can corrupt in-flight memory transactions or place the memory subsystem into a state that blocks further progress, hanging the GPU. The Scope:Changed CVSS metric reflects that the failure crosses the MIG isolation boundary, affecting workloads in other partitions on the same device.

Root Cause

The root cause is improper default initialization of memory subsystem routing resources during MIG partition reconfiguration [CWE-1188]. The driver assumes a safe initial state for routing structures that is not enforced before the reconfiguration sequence executes. Reconfiguration operations therefore inherit stale or unsafe defaults.

Attack Vector

Exploitation requires local access with low privileges on a system running the vulnerable Linux driver on a MIG-capable GPU. The attacker invokes MIG management operations through the NVIDIA driver interface, typically via tooling such as nvidia-smi mig or the NVML API, and requests partition reconfiguration. No user interaction is required. The result is data corruption or a GPU hang affecting all tenants on the device.

No public proof-of-concept exploit is available, and the issue is not listed in the CISA Known Exploited Vulnerabilities catalog. Verified exploitation code is not published, so technical specifics beyond the vendor advisory are not detailed here. See the NVIDIA Support Response for vendor guidance.

Detection Methods for CVE-2026-24197

Indicators of Compromise

GPU hangs or unresponsive CUDA contexts coinciding with MIG reconfiguration events
Kernel log entries from the nvidia module reporting Xid errors, MIG state transitions, or memory subsystem faults
Unexpected nvidia-smi mig -cgi, -dgi, or -cci invocations from non-administrative users
Workload failures across multiple MIG instances on the same physical GPU within a short window

Detection Strategies

Audit auditd or eBPF telemetry for invocations of nvidia-smi mig and NVML partition management calls by non-privileged accounts
Correlate dmesg output for NVRM, Xid, and MIG reconfiguration messages with workload failures
Alert on driver-reported memory subsystem errors observed through DCGM (Data Center GPU Manager) metrics

Monitoring Recommendations

Stream GPU telemetry from DCGM and nvidia-smi into a centralized data lake for cross-host correlation
Track MIG configuration changes as a high-value event class, with alerts on changes outside scheduled maintenance windows
Monitor for repeated GPU resets or driver reloads, which indicate hang recovery activity

How to Mitigate CVE-2026-24197

Immediate Actions Required

Apply the fixed NVIDIA Linux display driver version listed in the NVIDIA Support Response
Restrict access to MIG management operations to administrators only on multi-tenant GPU hosts
Inventory MIG-capable GPUs and verify driver versions across Kubernetes GPU nodes and bare-metal systems

Patch Information

NVIDIA has published fixed driver versions through its security bulletin. Consult the NVIDIA Support Response and the CVE.org Record for CVE-2026-24197 for the specific branch versions that address the issue. Update both the host driver and any container runtime images that ship NVIDIA user-mode components.

Workarounds

Disable MIG mode where partitioning is not required, using nvidia-smi -mig 0
Limit MIG reconfiguration to scheduled maintenance windows performed by administrators
Use Linux Discretionary Access Control and cgroup device controls to prevent untrusted users from issuing MIG management commands

bash

# Verify installed NVIDIA driver version
nvidia-smi --query-gpu=driver_version --format=csv,noheader

# List MIG-capable devices and current MIG mode
nvidia-smi -L
nvidia-smi --query-gpu=mig.mode.current --format=csv

# Disable MIG mode on GPU 0 as a temporary mitigation
sudo nvidia-smi -i 0 -mig 0