As of April 2026, NVIDIA has solidified its ecosystem, transitioning from the initial August 2025 launch of version 13.0 to the current deployment of
For Data Center Operators: MANDATORY if you use MIG. The stability fix outweighs the 3% performance hit you will take in HPC sims. cuda driver release news exclusive
Why this matters:
Previous drivers treated a kernel launch as a monolithic block. If a high-priority AI inference task arrived while a graphics or compute kernel was running, latency spiked. R570 introduces per-warp priority queues. Early benchmarks show a 40% reduction in tail latency for real-time LLM token generation when the GPU is also handling background compute. As of April 2026, NVIDIA has solidified its
The phased rollout is intentional. NVIDIA expects early bugs in the BME scheduler and UVM 2.5 prefetcher. They are letting AI labs and HPC centers test first before pushing to gamers. If a high-priority AI inference task arrived while