* The AMD RDNA 4 Rumour Mill *

Grim5 · 16 Jul 2024 at 11:50

humbug said:
Why would it have 800 Watts power draw? The 7900 XTX is less than half of that. If the 5090 is 50% faster than the 4090 no one is going to say that's impossible it would have 700 watts power draw.

The rumour is it got cancelled because it would be too expensive.

Too expensive? Sounds like bs. People have been paying $2k for a 4090

Man-In-A-Teapot · 16 Jul 2024 at 12:27

"Unfortunately, though, we’re not expecting to see such a GPU or card ever appear, as it’s rumored that AMD hasn’t been able to make its new chiplet technology scale to this level."

So its not being released because AMD have tried to make it, but haven't been able to get it to work yet; so price not the reason (or at least not the sole one if it could have been made to work by spending more on it than they would get back).

brostradamus · 16 Jul 2024 at 12:35

i reckon amd too will be eventually embracing a superscalar architecture like nvidia, all basic functional units bunched together, its been a great success for nvidia, not only have they utterly destroyed amd in raw performance but are currently 1.5 generations ahead of them in efficiency

humbug · 16 Jul 2024 at 12:53

brostradamus said:
i reckon amd too will be eventually embracing a superscalar architecture like nvidia, all basic functional units bunched together, its been a great success for nvidia, not only have they utterly destroyed amd in raw performance but are currently 1.5 generations ahead of them in efficiency

Supescaler is simply executing multiple instructions per clock cycle, or what we now call and measure as IPC, this was cracked in the 1970's along with the move away from RISC to programable logic.
For decades everything has been Superscaler.

Also, the 7900 XTX despite being on 5nm vs the 4090 4nm is 85% the performance at 85% the power consumption.

brostradamus · 16 Jul 2024 at 13:19

humbug said:
Supescaler is simply executing multiple instructions per clock cycle, or what we now call and measure as IPC, this was cracked in the 1970's along with the move away from RISC to programable logic.
For decades everything has been Superscaler.

amd has vector units, nvidia has scalar bunched up functional units called cuda/tensor/rt cores etc... amd has to package instructions using a more complicated scheduler to maintain high utilisation of their vector units, nvidia just has to segregate instructions by the type of operation and then push it in the pipeline..

a very simplistic way to describe both architectures could be in terms of lets say visa application:
amd: runs a single window to process a single visa application. to process an application documents A, B, C will be processed by a single window per applicant
nvidia: maintains separate dedicated windows for processing documents A, B, C.. everyone submits their documents in a common queue, the documents are then separated and reorganized in smaller downstream queues specific to a given document type (separate queues for A, B, C and so on)

humbug said:
Also, the 7900 XTX despite being on 5nm vs the 4090 4nm is 85% the performance at 85% the power consumption.

perf numbers below for timespy extreme
rtx 4090: 19261
7900 xtx: 14688
and rtx 4090 doesnt touch max rated TDPs

https://www.kitguru.net/wp-content/uploads/2022/12/3D-TSE-3.png

and the performance gulf will be even greater for RT apps

q974739 · 16 Jul 2024 at 13:27

Grim5 said:
Too expensive? Sounds like bs. People have been paying $2k for a 4090

People have been paying that for an Nvidia card. AMD might have thought they wouldn't pay that for their card. "Not worth it without the DLSS", etc.

humbug · 16 Jul 2024 at 13:29

brostradamus said:
amd has vector units, nvidia has scalar bunched up functional units called cuda/tensor/rt cores etc... amd has to package instructions using a more complicated scheduler to maintain high utilisation of their vector units, nvidia just has to segregate instructions by the type of operation and then push it in the pipeline..

a very simplistic way to describe both architectures could be in terms of lets say visa application:
amd: runs a single window to process a single visa application. to process an application documents A, B, C will be processed by a single window per applicant
nvidia: maintains separate dedicated windows for processing documents A, B, C.. everybody, submits their documents in a common queue, the documents are then separated and processed by dedicated windows which specialise in processing a specific document

perf numbers below for timespy extreme
rtx 4090: 19261
7900 xtx: 14688
and rtx 4090 doesnt touch max rated TDPs

https://www.kitguru.net/wp-content/uploads/2022/12/3D-TSE-3.png

and the performance gulf will be even greater for RT apps

None of this has anything to do with Superscaling.

8FP.

H100: 3.96 TeraFlops
MI 300X: 5.23 TeraFlops

You're cherry picking a synthetic benchmark.

Multiple Gamers overall performance 4K.

RX 7900 XTX: 100%
RTX 4090 : 122%

Power consumption in gaming.

7900 XTX: 356 watts
RTX 4090: 411 watts (116%)

They are near enough the same performance per watt, again despite the 4090 being on a better lithography node, TSMC 4nm vs TSNC 5nm.

AMD Radeon RX 7900 XTX Review - Disrupting the GeForce RTX 4080

Navi 31 is here! The new $999 Radeon RX 7900 XTX in this review is AMD's new flagship card based on the wonderful chiplet technology that made the Ryzen Effect possible. In our testing we can confirm that the new RX 7900 XTX is indeed faster than the GeForce RTX 4080, but only with RT disabled.

www.techpowerup.com

brostradamus · 16 Jul 2024 at 14:00

humbug said:
None of this has anything to do with Superscaling.

thats how scalar-ness is interpreted, nvidia has a scalar architecture, amd uses a vector architecture, in a relative sense

humbug said:
H100: 3.96 TeraFlops
MI 300X: 5.23 TeraFlops

this is misleading because:
a. these are theoretical max values, only works for ideal load
b. you havent mentioned the data type which makes those numbers meaningless (fp64/fp32/fp16/tf32/int8.. what?)
c. tdp's are missing: mi300x 750 watt, h100 350w

NVIDIA H100 PCIe 80 GB Specs

NVIDIA GH100, 1755 MHz, 14592 Cores, 456 TMUs, 24 ROPs, 81920 MB HBM2e, 1593 MHz, 5120 bit

www.techpowerup.com

AMD Radeon Instinct MI300X Specs

AMD Aqua Vanjaram, 2100 MHz, 19456 Cores, 1216 TMUs, 0 ROPs, 196608 MB HBM3, 2525 MHz, 8192 bit

www.techpowerup.com

oh yeah and they are not from the same generation either going by launch dates, maybe you are unaware but nvidia has already launched b200

humbug said:
You're cherry picking a synthetic benchmark.

its the defacto dx12 benchmark
and its hypocritical at the same time, because you have conveniently skipped RT benchmarks
we are talking about the full extent of capabilities of both chips, so it doesnt make much sense to skip RT performance

here data on port royal:
7900 xtx: 15793
rtx 4090: 25692 (+63%)

AMD Radeon RX 7900 XT & 7900 XTX Review - Page 8

AMD is finally ready to throw some fresh hardware into the endless battle between Nvidia and AMD for gaming dominance, sure Intel is part of the rumble these days too, but it’s fair to say they’ve got…

www.eteknix.com

humbug said:
TSMC 4nm vs TSNC 5nm.

and tsmc 4n is just a custom 5n node for nvidia as has been reported by popular press outlets

B@Th*nG · 16 Jul 2024 at 14:34

short of reading all this is there any actual solid specs yet?

humbug · 16 Jul 2024 at 14:42

B@Th*nG said:
short of reading all this is there any actual solid specs yet?

No.... wont be for a while yet.

q974739 · 16 Jul 2024 at 17:42

B@Th*nG said:
short of reading all this is there any actual solid specs yet?

I'd take a date. I keep popping in here, hoping for an update. A timeframe. Something.

Dicehunter · 16 Jul 2024 at 18:30

brostradamus said:
thats how scalar-ness is interpreted, nvidia has a scalar architecture, amd uses a vector architecture, in a relative sense

this is misleading because:
a. these are theoretical max values, only works for ideal load
b. you havent mentioned the data type which makes those numbers meaningless (fp64/fp32/fp16/tf32/int8.. what?)
c. tdp's are missing: mi300x 750 watt, h100 350w

NVIDIA H100 PCIe 80 GB Specs

NVIDIA GH100, 1755 MHz, 14592 Cores, 456 TMUs, 24 ROPs, 81920 MB HBM2e, 1593 MHz, 5120 bit

www.techpowerup.com

AMD Radeon Instinct MI300X Specs

AMD Aqua Vanjaram, 2100 MHz, 19456 Cores, 1216 TMUs, 0 ROPs, 196608 MB HBM3, 2525 MHz, 8192 bit

www.techpowerup.com

oh yeah and they are not from the same generation either going by launch dates, maybe you are unaware but nvidia has already launched b200

its the defacto dx12 benchmark
and its hypocritical at the same time, because you have conveniently skipped RT benchmarks
we are talking about the full extent of capabilities of both chips, so it doesnt make much sense to skip RT performance

here data on port royal:
7900 xtx: 15793
rtx 4090: 25692 (+63%)

AMD Radeon RX 7900 XT & 7900 XTX Review - Page 8

AMD is finally ready to throw some fresh hardware into the endless battle between Nvidia and AMD for gaming dominance, sure Intel is part of the rumble these days too, but it’s fair to say they’ve got…

www.eteknix.com

and tsmc 4n is just a custom 5n node for nvidia as has been reported by popular press outlets

Yeah but listen... pineapple belongs on pizza.

brostradamus · 16 Jul 2024 at 20:30

Dicehunter said:
Yeah but listen... pineapple belongs on pizza.

AMD Hawaii GPU Specs

2816 Cores, 176 TMUs, 64 ROPs

www.techpowerup.com

humbug · 16 Jul 2024 at 21:21

Dicehunter said:
Yeah but listen... pineapple belongs on pizza.

Absolutely not!

Zarax · 17 Jul 2024 at 05:17

Dicehunter said:
Yeah but listen... pineapple belongs on pizza.

The italian embassy just declared you Persona non grata

nomadd · 17 Jul 2024 at 08:10

Dicehunter said:
pineapple belongs on pizza.

glue belongs on pizza.

Dicehunter · 17 Jul 2024 at 10:37

Zarax said:
The italian embassy just declared you Persona non grata

I consider that a mark of honour

Legion · 21 Jul 2024 at 16:23

AMD RDNA 4 GPUs To Feature Enhanced Ray Tracing Architecture With Double RT Intersect Engine, Coming To Radeon RX 8000 & Sony PS5 Pro

AMD RDNA 4 GPUs are expected to feature a brand new and enhanced Ray Tracing engine for Radeon RX 8000 & Sony PS5 Pro.

wccftech.com

Dicehunter · 21 Jul 2024 at 16:39

So IF the info is correct so far we'll be looking at 7800XT performance levels at the most at around £400'ish or potentially lower.

GhostDog1981 · 21 Jul 2024 at 17:09

Dicehunter said:
So IF the info is correct so far we'll be looking at 7800XT performance levels at the most at around £400'ish or potentially lower.

Why do you say Navi 48 would top out at 7800xt levels?

Competitor rules

*** The AMD RDNA 4 Rumour Mill ***

* The AMD RDNA 4 Rumour Mill *