Ray tracing on AMD RDNA 4 will be twice as fast thanks to BVH8

Leaks covering PS5 Pro reveal new features of Radeon RX 8000 GPUs

A few days ago, information leaked on Sony’s upcoming PlayStation 5 Pro console, which will feature upgraded hardware with a significantly stronger GPU and AI acceleration to enable AI-based PSSR upscaling. Other details have now emerged, however, with implications for the upcoming Radeon RX 8000 graphics, as they talk about the capabilities of RDNA 4 architecture’s GPUs in ray tracing. They are set to improve by quite a bit.

Previous reports have stated that Sony expects the PS5 Pro to have 2–3× better ray tracing performance (in some cases up to 4×), which will be a bigger increase than the console is expected to have in general. Insider Gaming has now added more information, suggesting that the increased ray tracing performance is clearly a feature of the RDNA 4 architecture. The PS5 Pro’s GPU is supposed to be a hybrid of RDNA 3 and RDNA 4, but the raytracing units are apparently pulled from RDNA 4.

BVH8

This APU (or rather its GPU part) is supposed to use special “BHV8 traversal” shaders for raytracing graphics, which should be brand new feature. RDNA 2 and RDNA 3 use BVH4 shaders. This should mean processing eight levels (boxes) within the BVH (bounding volume box hierarchy) at once in the ray accelerator of the RDNA 4 architecture. So the GPU should have twice as much performance as RDNA 2 or RDNA 3 when performing BVH analysis. Those architectures can only do four boxes at once.

It is not yet clear whether the performance of ray tracing will also be improved when performing the analysis of intersection of the rays with the object’s triangles, which is a subsequent part of the process. For RDNA 2 and 3, the ray accelerator can do four boxes and one triangle per cycle, so for RDNA 4 it could either be 8 boxes and 1 triangle, or also be doubled to 8 boxes and two triangles. In this way, RDNA 4 could approach level of per Ray Accelerator/CU ray tracing performance that is similar to the per RT core/SM block performance of Nvidia GPUs.

For comparison, Nvidia GPUs should also be four boxes and one triangle per cycle for RT cores in the first generation Turing architecture (RTX 2000), and four boxes and two triangles in the second generation Ampere architecture (RTX 3000). Ada Lovelace should be able to do four boxes and four triangles per cycle (so the ratio between box and triangle performance will vary from Radeon’s ratio).

AMD’s Navi 31 5nm/6nm chiplet GPU (source: AMD, via: ComputerBase)

AI upscaling also on Radeons?

According to Kepler_L2, by the way, the AI acceleration in the PS5 Pro console could also be using the RDNA 4 architecture GPU units, it should be using the WMMA operations (Wave Matrix Multiply-Accumulate). This could mean that standalone Radeon graphics cards with RDNA 4 architecture could theoretically have the same capability, and AMD could bring similar AI upscaling to gaming PCs with Radeon graphics cards – perhaps in combination with DirectSR technology so that games don’t have to separately support fall-back standard version of FSR, which works with all GPUs, in addition to the new upscaling using the AI units of newer cards.

Read More: DirectSR unifies DLSS, XeSS and FSR, end of incompatible games

Information that AMD is planning AI-assisted upscaling has surfaced recently, but it hasn’t been very specific. In theory, it could possibly work with RDNA 3 based graphics cards as well, which also already have some support for WMMA operations.

Read more: AMD CTO hints FSR4 will bring AI-based upscaling

The CPU part of the PlayStation 5 Pro won’t get much better, it has something similar to SmartShift

We’ve mainly focused on RDNA 4 information here, but Insider-Gaming also talked about the CPU performance of the PlayStation 5 Pro. The latter won’t undergo as much of an upgrade as the graphics. It’ll still rely on eight Zen 2 (or Zen 2 Lite) cores, and will only bump up the clock speed on them. The cores will run at up to 3.85 GHz, which wouldn’t be exactly great in a gaming PC anymore. However, in consoles, where games are more custom-optimised, it will have to do.

This higher clock speed will not be used by default. In base mode, the PS5 Pro should probably use the same 3.5 GHz clock speed as the regular PS5. However, a game with explicit support for the PS5 Pro will be able to request “High Frequency Mode”, which will mean clocking the Zen 2 cores to 3.85 GHz (+10%). However, this mode will increase the CPU core power consumption and the SoC will underclock the GPU a bit to accommodate this, so that the overall consumption doesn’t go too high. The GPU clock speed will be reduced by 1.5% during this mode, which should impair minimal performance (around 1%) hit, but the cost will be there, which is why the higher CPU clock speed is not used all the time. Games that would need more CPU performance, however, will have this option. This is basically similar to AMD’s SmartShift technology from laptops.

Otherwise, the clock speed of the ACV audio coprocessor is also expected to be higher, increasing the performance available for audio effects by 35%. Also confirmed is an increase in memory speed, which will be at 18.0GHz (effective clock), with a bandwidth of 576 GB/s (+28% vs. the regular PS5). Capacity will remain at 16GB, and the console’s SSD is expected to be 1TB, when the console comes out in the fall of 2024.

Sources: Insider Gaming, VideoCardz, Kepler_L2 (1, 2)

English translation and edit by Jozef Dudáš


  •  
  •  
  •  
Flattr this!

AMD to finally add more cores with Zen 6, up to 24 in desktop?

It feels like the competitive battle between AMD and Intel has somewhat extended into the “future dimension” of internet leaks talking about hardware that doesn’t yet exist. Recently we’ve seen news of the upcoming Nova Lake processors said to have up to 52 cores, and then about the Core Ultra 300 slated for this year. And as if on cue, “counter” leak about AMD Zen 6 processors that will face these future Intel CPUs has surfaced. Read more “AMD to finally add more cores with Zen 6, up to 24 in desktop?” »

  •  
  •  
  •  

Radeon RX 7900 XTX Gaming OC: Gigabyte’s Base Model

The Radeon RX 7900 XTX Gaming OC from Gigabyte is among the more affordable models which are rarely reviewed. Manufacturers lower production costs mainly by reducing the size of the heatsink. As a result, these cards don’t perform as well as premium high-end models in thermal and acoustic tests. However, their lower price makes them popular not only with individual buyers but also OEMs assembling prebuilt PCs. Read more “Radeon RX 7900 XTX Gaming OC: Gigabyte’s Base Model” »

  •  
  •  
  •  

RDNA 3 refresh? Affordable Radeon RX 7650 GRE launched in China

Instead of the much awaited RDNA4 GPUs, AMD has introduced a new graphics card belonging to the RX 7000 generation. However, this card is likely to remain relevant even after the new generation arrives, serving as a more affordable option below the RX 9000 series. For now, this model is Chinese market exclusive, but that might change – the RX 7900 GRE, which was initially a China-exclusive, eventually made its way to other markets as well. Read more “RDNA 3 refresh? Affordable Radeon RX 7650 GRE launched in China” »

  •  
  •  
  •  

Leave a Reply

Your email address will not be published. Required fields are marked *