Cooperative Vectors in DirectX to use Blackwell Neural Shaders

Ability to integrate neural networks into shaders coming to DirectX

Nvidia recently talked new features for GeForce graphics cards – primarily the RTX Remix modding platform leaving beta and first games using Nvidia ACE. The company has another announcement: Neural Shaders, one of the architectural innovations in Blackwell GPUs, will be coming to DirectX. Microsoft is adding a Cooperative Vectors function to this API, which GeForce RTX 5000 series will support precisely through their Neural Shaders.

Neural Shaders represent one of the new features introduced by the Blackwell GPU architecture, consisting of closer integration between tensor cores (Nvidia’s AI accelerators in GPUs) and general-purpose compute units (“shaders”). While in older Nvidia GPUs it was difficult to use code (or it performed worse) that would simultaneously utilize general shader units and AI acceleration on tensor cores, Blackwell makes this possible.

Nvidia refers to this capability as “Neural Shaders.” Their application allows integration of a simple neural network into a shader running on a GPU, which can replace certain conventional algorithms through AI inference. Nvidia proposes using such small integrated AI models for tasks like simulating complex materials (such as skin with subsurface scattering effects), calling this Neural Materials, or for illumination simulation (Neural Radiance Cache).

Read more: Blackwell: GeForce RTX 5000 architecture and innovations [Analysis]

Cooperative Vectors

The implementation of similar techniques will be possible in DirectX through a technology Microsoft has named Cooperative Vectors. However, it is not the same thing as Nvidia’s Neural Shaders. Cooperative Vectors is a software feature or technology presented by DirectX (and presumably there will be some equivalent in Vulkan API as well), while Neural Shaders is a hardware feature that enables Blackwell GPUs to support it.

Neural Materials in the Nvidia Blackwell GPU architecture presentation

Cooperative Vectors are designed to allow the integration of AI components into traditional graphics pipelines (enabling “neural rendering techniques”), but with cross-platform support. This means these techniques should work across GPUs from different manufacturers, and Microsoft likely intends this to include gaming consoles (or more precisely Xbox, for which DirectX is relevant).

Cooperative Vectors enable the use of matrix compute operations with vectors of arbitrary length within shaders. Matrix operations are precisely what AI acceleration relies on. Through Cooperative Vectors technology, AI inference can be integrated directly into, for example, a pixel shader. There’s no need to switch the GPU into any special AI acceleration mode. This shader will run on the GPU as usual and can be executed alongside other standard graphics operations simultaneously. Access to AI functions in this form should be significantly simpler than the options preset before it (though typically this approach will likely be used for smaller, simpler AI models rather than “LLMs”).

Support expected from all GPU manufacturers

Cooperative Vectors are reportedly getting support not just in Nvidia GPUs, but also by AMD, Intel, and Qualcomm GPUs (which means ARM Windows platform). Current reports suggest Nvidia’s support begins with the GeForce RTX 5000 series featuring Blackwell architecture, implying older GeForce GPUs won’t support this innovation or perhaps not fully (according to Nvidia’s explanation, the company’s older GPUs require using CUDA or Compute Shader modes to utilize tensor cores, while Blackwell allows tensor cores to be accessed directly from pixel shaders).

It’s possible this isn’t something strictly impossible on older GPUs – rather, the integration between shader programs and AI inference on tensor cores might be inefficient or come with performance penalties (think something similar to how asynchronous shaders worked on older GPUs like Nvidia Maxwell but were not helping).

For competing GPUs, we don’t yet know which generations or architectures will offer Cooperative Vectors support. Intel’s GPUs feature specialized XMX units, though it hasn’t been clarified whether these allow integration of AI acceleration in shader code. However, as mentioned, some Arc GPUs will support it.

For AMD graphics (RDNA 3 and RDNA 4), AI acceleration is closely tied to shader units since both are handled by the same hardware (AI acceleration runs via WMMA instructions). These GPUs might therefore have certain prerequisites for supporting Cooperative Vectors, though official confirmation is still pending.

Support for Cooperative Vectors will appear in the DirectX SDK this April, which is when it’s scheduled to be launched in the “preview” version. It will be part of HLSL, the language for writing shaders in DirectX. Currently, this support is mainly relevant for game developers rather than end users – it will take some time before neural rendering techniques utilizing these GPU capabilities ship in generally available games.

Sources: Microsoft, Nvidia

English translation and edit by Jozef Dudáš

⠀

Back to: Ability to integrate neural networks into shaders coming to DirectX

Flattr this!

Bufo on BeQuiet! Silent Loop 3 (BW025): Founded on elite fanssorry, but I don't see any difference between SW4 Pro and SW4 HS on the...
Ľubomír Samák on BeQuiet! Silent Loop 3 (BW025): Founded on elite fansJeanfi was likely referring mainly to the impeller's aerodynamic design. But yes, it's kind of...
Ľubomír Samák on BeQuiet! Silent Loop 3 (BW025): Founded on elite fansThanks for the comment. Of course. For accurate assessment, all details would need to be...
Bufo on BeQuiet! Silent Loop 3 (BW025): Founded on elite fansHello Jeanfl, despite what BeQuiet! says in the description: https://www.bequiet.com/en/watercooler/5398 They do have a PRO...
Jeanfi on BeQuiet! Silent Loop 3 (BW025): Founded on elite fansHello, thanks for this review. I don't think that the fans are the Silent Wings...
M on Arctic’s new „pro“ variant of Liquid Freezer III (Pro) AIOIn their announcement post on Reddit (https://www.reddit.com/r/arcticcooling/comments/1jl57ad/essential_cooling_pro_performance/), it is stated that "More sizes (240, 280...
Ľubomír Samák on Arctic’s new „pro“ variant of Liquid Freezer III (Pro) AIOWe don't have any information about a 140mm version yet. A larger rotor hub would...
Ľubomír Samák on Arctic’s new „pro“ variant of Liquid Freezer III (Pro) AIOLet’s believe it's only a matter of time before these new fans become available standalone....
the patient on Arctic’s new „pro“ variant of Liquid Freezer III (Pro) AIOIt looks like the Pros have a bigger hub, which could lead to interesting results...

Cooperative Vectors in DirectX to use Blackwell Neural Shaders

Ability to integrate neural networks into shaders coming to DirectX

Cooperative Vectors

Support expected from all GPU manufacturers

Better, more capable than expected: RDNA 4 architecture deep dive

Nvidia boosts RTX Video Super Resolution performance, adds HDR

Blackwell: GeForce RTX 5000 architecture and innovations [Analysis]

Leave a Reply Cancel reply

Latest comments

Ability to integrate neural networks into shaders coming to DirectX

Cooperative Vectors

Support expected from all GPU manufacturers

Related articles

Leave a Reply Cancel reply

Latest comments

Cookies