• Competitor rules

    Please remember that any mention of competitors, hinting at competitors or offering to provide details of competitors will result in an account suspension. The full rules can be found under the 'Terms and Rules' link in the bottom right corner of your screen. Just don't mention competitors in any way, shape or form and you'll be OK.

NVIDIA 4000 Series

whitepaper:

lol the way they start with a 2pg highlight of their trackrecord of innovation..

summary:
- BVH structures have been simplified, probably they have sacrificed BVH depth for a much faster data structure that sacrifices traversals for set-up times - the trade-off has been favorable (also some talk abt a new kind of primitive, perhaps a game dev can explain this bit better)
- RT cores now also include alpha testing capability which was previously done using shaders, this is going to be a big advantage for nvidia in RT - more games with more foliage, looks like nvidia's tessellation moment in RT
- SER: another big innovation in RT, and looks intuitive - nvidia has found a way to treat secondary rays as primary rays, other than the ability to control divergence in execution this is going to make RT more scalable in future.. its just looking like a hardware accelerated recursive structure - pretty neat
- DLSS 3 has been detailed in a separate whitepaper
 
Last edited:
whitepaper:

lol the way they start with a 2pg highlight of their trackrecord of innovation..
Ah interesting.

RTX 4080 12GB and 16GB was confused before the whitepaper published. Everybody thought RTX 4080 12GB was changed from RTX 4070 12GB at last min but the whitepaper finally made it clear that RTX 4080 12GB will replace RTX 3080 10/12GB, not RTX 3070 and RTX 4080 16GB is actually Ti class will replace RTX 3080 Ti, not RTX 3080 10/12GB. I thought RTX 4080 Ti will launch later in early 2023 to replace RTX 3080 Ti.

Well guess I will have to wait until after Navi 31 launch to read reviews to see RTX 4080 12GB and 16GB benchmarks before decide to pick MSI RTX 4080 Suprim X 12GB or Ti class 16GB.
 
Last edited:
Ah interesting.

RTX 4080 12GB and 16GB was confused before the whitepaper published. Everybody thought RTX 4080 12GB was changed from RTX 4070 12GB at last min but the whitepaper finally made it clear that RTX 4080 12GB will replace RTX 3080 10/12GB, not RTX 3070 and RTX 4080 16GB is actually Ti class will replace RTX 3080 Ti, not RTX 3080 10/12GB. I thought RTX 4080 Ti will launch later in early 2023 to replace RTX 3080 Ti.

Well guess I will have to wait until after Navi 31 launch to read reviews to see RTX 4080 12GB and 16GB benchmarks before decide to pick MSI RTX 4080 Suprim X 12GB or Ti class 16GB.

I don't think anyone made the 4070 accusation based on what Nvidia puts in columns next each other.
 
so in the US a load of AiB's are selling the low end models for the same as founders MSRP.......... whats the likleyhood of that happening here too?

This is correct but said models will be shipping in extremely small numbers so wait times will be weeks and potentially months.

The majority and volumes of stock will be on more expensive OC models as that is where the AIBs have better margins.
 
Last edited:
Ah interesting.

RTX 4080 12GB and 16GB was confused before the whitepaper published. Everybody thought RTX 4080 12GB was changed from RTX 4070 12GB at last min but the whitepaper finally made it clear that RTX 4080 12GB will replace RTX 3080 10/12GB, not RTX 3070 and RTX 4080 16GB is actually Ti class will replace RTX 3080 Ti, not RTX 3080 10/12GB. I thought RTX 4080 Ti will launch later in early 2023 to replace RTX 3080 Ti.

Well guess I will have to wait until after Navi 31 launch to read reviews to see RTX 4080 12GB and 16GB benchmarks before decide to pick MSI RTX 4080 Suprim X 12GB or Ti class 16GB.

The 4080 ti will be on a AD102 chip with 16GB or 20GB VRAM (but expecting it to be 16GB and not 20GB), the AD103 chip that the 16GB 4080 is on will be the 4080ti laptop chip as is the case with the 3080ti laptop chip is a GA103 with 16GB too.


Don't be fooled by the naming, the same was meant to happen on Ampere but AMD messed that up for Nvidia.

The real Ampere should have been :-

AD102 Full die Titan AP and A series cards (Quadros of the past).
AD102 Slightly cut down again a Titan A and A series cards.
AD102 3080 Ti More cut down than the Titan A and half the VRAM and A series cards.

AD103 3080 desktop and 3080ti laptop.

AD104 3070 and 3060's and 3080,3070,3060s laptop.


The 3090 was never meant to exist, all 90 class cards before were dual chip gpus on a single card.

So what the 3090 showed Nvidia is people are willing to buy this type of class card that was normally a Titan before without the Titan drivers, so now they have a new product range the single chip 90 class cards for the people that want the best without titan drivers, but problem is the 4090 is a neutered 3090 this time as no NVLINK the big selling point to pros is now gone that wanted dual gpu setups with 48GB VRAM and double the performance.

Nvidia learned a lot from the 30 class series and have made sure not to make the same mistakes, also by them doing certain things to the 4090 says to me AMD don't have anything as good as 6000 series this time compared to 30 series before, so Nvidia is taking a bet they can get away with the 40 series range that they can sell less for more with the usual up tick in performance.

4080 12GB is looking like a 3080 12GB to me with a new name and not much performance difference and DLSS 3 support, 16GB 4080 and 4090 are in a different league from their previous cards as they should be with a proper up tick in performance and new features but also remember some features removed from 4090 compared to 3090 too. So you win some and lose some with the 4090.
 
Last edited:
Embargoes are a little mad when you think of it. I get the point, but really there are a whole load of press releases that run along the lines of "We have a new product! It's brilliant! It'll revolutionise your experience! This is definitely a game changer! It is safe to upgrade, the more you buy the more you save!"

"Oh cool, can we see it perform?"

"....No."
 
So what the 3090 showed Nvidia is people are willing to buy this type of class card that was normally a Titan before without the Titan drivers, so now they have a new product range the single chip 90 class cards for the people that want the best without titan drivers, but problem is the 4090 is a neutered 3090 this time as no NVLINK the big selling point to pros is now gone that wanted dual gpu setups with 48GB VRAM and double the performance.
Nvlink has been removed from the A series Ada cards as well. It seems like Nvidia may try and do something through PCIE 5
 
Nvlink has been removed from the A series Ada cards as well. It seems like Nvidia may try and do something through PCIE 5

I doubt it as the previous NVLINK was faster and had more bandwidth than what they could do on PCIE 5, but we will see , so far what I understand is Nvidia has removed all the silicon on the Ada chips related to NVLINK and only on Hopper chips this gen. Guessing we will see it on a Hopper version A series soon enough so they can add another range of cards with different features and higher prices, their way to knock up the prices of a feature we took for granted and now becoming a luxury feature for top end cards from now. Nvidia silly games again from the gamer market to the pro market.
 
I doubt it as the previous NVLINK was faster and had more bandwidth than what they could do on PCIE 5, but we will see , so far what I understand is Nvidia has removed all the silicon on the Ada chips related to NVLINK and only on Hopper chips this gen. Guessing we will see it on a Hopper version A series soon enough so they can add another range of cards with different features and higher prices, their way to knock up the prices of a feature we took for granted and now becoming a luxury feature for top end cards from now. Nvidia silly games again from the gamer market to the pro market.
So the quote I saw on another forums were. Unfortunately the person who posted it didn't give a source.

Jensen stated that the reason behind removing the NVLink connector was because they needed the I/O for “something else,”

Jen-Hsun continued with “and also, because Ada is based on Gen 5, PCIe Gen 5, we now have the ability to do peer-to-peer cross-Gen 5 that’s sufficiently fast that it was a better tradeoff”
 
I don't think anyone made the 4070 accusation based on what Nvidia puts in columns next each other.
You obviously not read many posts on this forum and other forums, websites, reddit or watching youtube etc.

Yes people wrongfully made overpriced $899 4070 accusations.








 
Ah interesting.

RTX 4080 12GB and 16GB was confused before the whitepaper published. Everybody thought RTX 4080 12GB was changed from RTX 4070 12GB at last min but the whitepaper finally made it clear that RTX 4080 12GB will replace RTX 3080 10/12GB, not RTX 3070 and RTX 4080 16GB is actually Ti class will replace RTX 3080 Ti, not RTX 3080 10/12GB. I thought RTX 4080 Ti will launch later in early 2023 to replace RTX 3080 Ti.

Well guess I will have to wait until after Navi 31 launch to read reviews to see RTX 4080 12GB and 16GB benchmarks before decide to pick MSI RTX 4080 Suprim X 12GB or Ti class 16GB.
Just look at the die sizes, neither are 80 class, ones a 60ti the others a 70. If you plan to buy something then get the 4090 as it'll be 40% faster than even the more expensive 4080 16gb which in turn will be around 25% faster than your 3080ti, the 4080 12gb version will be no faster than the 3080ti.
 
Just look at the die sizes, neither are 80 class, ones a 60ti the others a 70. If you plan to buy something then get the 4090 as it'll be 40% faster than even the more expensive 4080 16gb which in turn will be around 25% faster than your 3080ti, the 4080 12gb version will be no faster than the 3080ti.

This was the scary part. If your high end from last gen your not even sidegrading. A 4080 should be faster than a 3080Ti.
 
Jen-Hsun continued with “and also, because Ada is based on Gen 5, PCIe Gen 5, we now have the ability to do peer-to-peer cross-Gen 5 that’s sufficiently fast that it was a better tradeoff”


We know Ada is only using PCIe 4 on the slot so that doesn't make sense as you know then. Ada is PCIe 4 16 lanes only and PCIe 5 power connectors only. I think people are getting confused by all these PCIe numbers flying around the specs for Ada are already up and even states no SLI/NVLINK and PCIe Gen 4 16x. Nvidia removed it to make it a pro thing only for their top of the top cards and server gear. Even the A series has been neutered in the same way, only Hopper has NVLINK and they are not doing it with GEN 5 PCIe either but seems it's in the spec but not used for NVLINK just normal data over PCIE 5 and NVLINK 4.0 to link gpus. They are even using Nvlink switches now to do the linking.


NVLink 3.050 Gbit/s~6.25 GB/sAmpere,
NVSwitch for Ampere
NVLink 4.050 Gbit/s~6.25 GB/sHopper,
Nvidia Grace Datacenter/Server CPU
NVSwitch for Hopper

PCIe 5.032 GT/s[7]128b/130b~4 GB/sHopper

Performance​

The following table shows a basic metrics comparison based upon standard specifications:



InterconnectTransfer
rate
Line codeEffective payload rate
per lane
per direction
Max total
lane length
(PCIe: incl. 5" for PCBs)
Realized in design
PCIe 1.x2.5 GT/s8b/10b~0.25 GB/s20" = ~51 cm
PCIe 2.x5 GT/s8b/10b~0.5 GB/s20" = ~51 cm
PCIe 3.x8 GT/s128b/130b~1 GB/s20" = ~51 cm[6]Pascal,
Volta,
Turing
PCIe 4.016 GT/s128b/130b~2 GB/s8−12" = ~20−30 cm[6]Volta on Xavier
(8x, 4x, 1x),
Ampere,
Power 9
PCIe 5.032 GT/s[7]128b/130b~4 GB/sHopper
PCIe 6.064 GT/s128b/130b~8 GB/s
NVLink 1.020 Gbit/s~2.5 GB/sPascal,
Power 8+
NVLink 2.025 Gbit/s~3.125 GB/sVolta,
NVSwitch for Volta
Power 9
NVLink 3.050 Gbit/s~6.25 GB/sAmpere,
NVSwitch for Ampere
NVLink 4.050 Gbit/s~6.25 GB/sHopper,
Nvidia Grace Datacenter/Server CPU
NVSwitch for Hopper

The following table shows a comparison of relevant bus parameters for real world semiconductors that all offer NVLink as one of their options:


SemiconductorBoard/bus
delivery variant
InterconnectTransmission
technology
rate (per lane)
Lanes per
sub-link
(out + in)
Sub-link data rate
(per data direction)
Sub-link
or unit
count
Total data rate
(out + in)
Total
lanes
(out + in)
Total
data rate
(out + in)
Nvidia GP100P100 SXM,[8]
P100 PCI-E[9]
PCIe 3.08 GT/s16 + 16 128 Gbit/s = 16 GByte/s116 + 16 GByte/s[10]32 32 GByte/s
Nvidia GV100V100 SXM2,[11]
V100 PCI-E[12]
PCIe 3.08 GT/s16 + 16 128 Gbit/s = 16 GByte/s116 + 16 GByte/s32 32 GByte/s
Nvidia TU104GeForce RTX 2080,
Quadro RTX 5000
PCIe 3.08 GT/s16 + 16 128 Gbit/s = 16 GByte/s116 + 16 GByte/s32 32 GByte/s
Nvidia TU102GeForce RTX 2080 Ti,
Quadro RTX 6000/8000
PCIe 3.08 GT/s16 + 16 128 Gbit/s = 16 GByte/s116 + 16 GByte/s32 32 GByte/s
Nvidia Xavier[13](generic)PCIe 4.0 Ⓓ
2 units: x8 (dual)
1 unit: x4 (dual)
3 units: x1[14][15]
16 GT/s
8 + 8
4 + 4
1 + 1

128 Gbit/s = 16 GByte/s
64 Gbit/s = 8 GByte/s
16 Gbit/s = 2 GByte/s

2
1
3

32 + 32 GByte/s
8 + 8 GByte/s
6 + 6 GByte/s
40 80 GByte/s
IBM Power9[16](generic)PCIe 4.016 GT/s16 + 16 256 Gbit/s = 32 GByte/s396 + 96 GByte/s96192 GByte/s
Nvidia GA100[17][18]
Nvidia GA102[19]
Ampere A100
(SXM4 & PCIe[20])
PCIe 4.016 GT/s16 + 16 256 Gbit/s = 32 GByte/s132 + 32 GByte/s32 64 GByte/s
Nvidia GP100P100 SXM,
(not available with P100 PCI-E)[21]
NVLink 1.020 GT/s8 + 8 160 Gbit/s = 20 GByte/s480 + 80 GByte/s64160 GByte/s
Nvidia Xavier(generic)NVLink 1.0[13]20 GT/s[13]8 + 8 160 Gbit/s = 20 GByte/s[22]
IBM Power8+(generic)NVLink 1.020 GT/s8 + 8 160 Gbit/s = 20 GByte/s480 + 80 GByte/s64160 GByte/s
Nvidia GV100V100 SXM2[23]
(not available with V100 PCI-E)
NVLink 2.025 GT/s8 + 8 200 Gbit/s = 25 GByte/s6[24]150 + 150 GByte/s96300 GByte/s
IBM Power9[25](generic)NVLink 2.0
(BlueLink ports)
25 GT/s8 + 8 200 Gbit/s = 25 GByte/s6150 + 150 GByte/s96300 GByte/s
NVSwitch
for Volta[26]
(generic)
(fully connected 18x18 switch)
NVLink 2.025 GT/s8 + 8 200 Gbit/s = 25 GByte/s2 * 8 + 2
= 18
450 + 450 GByte/s288900 GByte/s
Nvidia TU104GeForce RTX 2080,
Quadro RTX 5000[27]
NVLink 2.025 GT/s8 + 8 200 Gbit/s = 25 GByte/s125 + 25 GByte/s1650 GByte/s
Nvidia TU102GeForce RTX 2080 Ti,
Quadro RTX 6000/8000[27]
NVLink 2.025 GT/s8 + 8 200 Gbit/s = 25 GByte/s250 + 50 GByte/s32100 GByte/s
Nvidia GA100[17][18]Ampere A100
(SXM4 & PCIe[20])
NVLink 3.050 GT/s4 + 4 200 Gbit/s = 25 GByte/s12[28]300 + 300 GByte/s96600 GByte/s
Nvidia GA102[19]GeForce RTX 3090
Quadro RTX A6000
NVLink 3.028.125 GT/s4 + 4 112.5 Gbit/s = 14.0625 GByte/s456.25 + 56.25 GByte/s16112.5 GByte/s
NVSwitch
for Ampere[29]
(generic)
(fully connected 18x18 switch)
NVLink 3.050 GT/s8 + 8 400 Gbit/s = 50 GByte/s2 * 8 + 2
= 18
900 + 900 GByte/s2881800 GByte/s
 
Last edited:
Back
Top Bottom