• Competitor rules

    Please remember that any mention of competitors, hinting at competitors or offering to provide details of competitors will result in an account suspension. The full rules can be found under the 'Terms and Rules' link in the bottom right corner of your screen. Just don't mention competitors in any way, shape or form and you'll be OK.

NVIDIA GeForce GTX 870 “Maxwell” Specifications Analysis – 13 SMM Units With 2nd Generation 1644 CUD

Caporegime
Joined
24 Sep 2008
Posts
38,284
Location
Essex innit!
The specifications of NVIDIA’s upcoming Maxwell based GeForce GTX 870 have been leaked Coolaler and we already covered the basic details of the card in another article which can be seen here. The NVIDIA Maxwell based GeForce GTX 870 graphics card is one of the three SKUs arriving to consumers in the upcoming months (September and October).GeForce GTX 870 / GeForce GTX 880 MakeUp

NVIDIA GeForce GTX 870 “Maxwell” Specifications Analysis

Last time, I did the analysis of the leaked GeForce GTX 880 PCB which was in engineering state and found out some intriguing details such as the 256-Bit memory bus, the GM204 chip being larger than GK104 while being based on the current 28nm process node. We didn’t get to see any specifications of the card itself aside from a few details we were able to gather from the pictures. Today, we have a different sort of leak in the form of a screenshot which shows the GPU-z program running parallel to a 3DMark 11 score of an unknown graphics card.
The leaker from the main source has mentioned that the graphics card in question is NVIDIA’s GeForce GTX 870 which is the skimmed down variant of the GeForce GTX 880 graphics which will be available to consumers in the upcoming months. The GeForce GTX 870 boasts the second generation Maxwell architecture and will feature the GM204 core architecture which will become the foundation of the flagship GeForce GTX 880 too. The GeForce GTX 860 is also expected to launch in Q4 2014 but it isn’t expected whether the card will feature GM204 core or the GM206 core but let’s forget that for a moment and focus on the details we have in hands.
The GeForce GTX 870 has the Device Id “NVIDIA D17U-20″ and the GPU Id for the graphics core is 13C2. We have our own speculation regarding that but before we get into that, let’s denote the entire specifications listed on the GPU-z panel. The GM204 SKU featured on the GeForce GTx 870 fuses 1664 CUDA Cores on the die along with 32 ROPs and 138 TMUs. From the first generation Maxwell core architecture, we learned that a Maxwell SMM (Streaming Multiprocessor Maxwell) unit has 128 cores compared to 192 on the current generation Kepler SMX units. Along with that, we have a 4 GB GDDR5 memory running across a 256-Bit memory interface clocked at 1753 MHz (7.00 GHz Effective) which pumps out 224.4 GB/s bandwidth. The core clock is maintained at 1051 MHz and 1178 MHz boost clock something which I was expecting if the cards were to be able to take on the GK110 core based graphics cards. Lastly, we have the fill rate numbers which amount to 33.6 GPixels/s Pixel and 145.0 GTexels/s Texture fill rates.
Now as you may remember, the first generation Maxwell core architecture which is featured on the GM107 GPU has 2 times the performance per watt and the CUDA Cores, even though lower amount to more performance that is rated at 135% of the past generation. The key word here is first generation, since NVIDIA just previewed what their Maxwell architecture is capable of and having ample time for optimizing and enhancing the architecture, I expect to see better numbers being rated at 145-150% and we can see more power efficiency on the new cards. The GeForce GTX 750 Ti which is NVIDIA’s entry level Maxwell offering is rated at 64W and comes with splendid performance for its wattage. Considering the updated design, I think there’s no doubt that NVIDIA GeForce GTX 870 will be able to trump the GeForce GTX 780 in performance (although not by a big margin).

The GPU name is also interesting since 13C2 could probably mean 13 SMM units which are featured on the GPU which makes us think about the GeForce GTX 880. With a SMM or two in place, we can see the GeForce GTX 880 include 1792 or 1920 cores which although lower than the GK110 based cards is reasonably higher than what was featured on the GK104 chips and with the performance per core leverage, we can see the GeForce GTX 880 right next to the GeForce GTX 780 Ti in performance numbers. The GM200 is still far away from launch and is expected next year but that just gives a glimpse of what to expect from the daddy core which can easily topple the flagship chips.

The core count and configuration for the GeForce GTX 880 looks around right since looking at the trend in the table provided below, you can note that the skimmed down *70 graphics card has generally one or two SM units disabled. The GK110 is the only chip in the line which has more chips disabled but that is due to the extended lineup due to which we had several cards in this line aka GeForce GTX 780, GeForce GTX Titan, GeForce GTX 780 Ti, GeForce GTX Titan Black (all of which are configured around the same core). In case there will be a higher SM units SKU, then NVIDIA will most definitely launch it as a Ti part as was rumored by a few sources a while back. If there are more disabled SMMs, then NVIDIA will launch them as a different card rather than releasing a 4 SMM disabled part
The 4 GB GDDR5 route seems to be the reference route for NVIDIA this time around moving onwards from a 2 GB GDDR5 VRAM configuration on several Kepler cards. The chips will still be configured around a 256-Bit bus but this just seems to be the route for graphics makers these days considering AMD is going this route with their next performance chip, codenamed Tonga (Radeon R9 285) which is considered faster than the Tahiti but has a 256-Bit memory bus compared to 384-Bit on Tahiti based Radeon R9 280 series.

NVIDIA GeForce GTX 870 Leaked Performance Numbers:

The performance numbers of the GeForce GTX 870 are interesting considering they are really close to the GeForce GTX 780. The card was tested with a Intel Core i7-4820K CPU and is currently a test sample so final performance updates and drivers are not in place considering the GPU-z couldn’t even recognize the card properly until the database is updated. In the 3DMark 11 Performance and Extreme modes, the GeForce GTX 870 was able to score P11919 points and X4625 marks respectively. A GeForce GTX 780 on the other hand on average scores around X4500 points and P12000 points. With the GeForce GTX 870 still optimized and without any sort of official drivers, I say these numbers are good considering this card will retail at $349 US and get you performance on par or better than a card based on a flagship chip from current generation.

NVIDIA Maxwell GM107 Architecture

The NVIDIA Maxwell GM107 architecture is most clearly built from scratch yet looks like an hybrid of both Kepler and Fermi. The SMM or Streaming Multiprocessor of Maxwell will replace the SMX of Kepler and each of the smm are assembled into four blocks yet are defined as part of a single SMM which means the core architecture has got quite dense with Maxwell. Each of these blocks hold 32 CUDA cores so a single SMM with four of these operation units results in 128 CUDA cores. The SMM has 128 CUDA cores compared to 192 CUDA cores on the SMX.
The GM107 GPC ‘Graphics Processing Cluster’ consists of five of these streaming multiprocessors which are connected to a Raster Engine. Each SMM consists of Polymorph Engine 2.0 which includes the Vertex Fetch, Tessellator, Viewport transform, attribute setup and stream output. Each SMM has 8 texture mapping units which equates to 40 on the whole chip and 16 ROPs while connected to two 64-bit memory controllers.
There’s also a handful of cache on Maxwell which is 2 MB in total compared to 256 KB on Kepler GK107 which reduces GPU queuing. IPCC has been increased on the GPU core and balancing improvement to workload has been done. NVIDIA has improved their H.264 encoding and decoding with NVENC on Maxwell so that would result in better performance in ShadowPlay technology and improved sleep states have been implemented to reduce power input while the GPU is running idle. All of this is packed in a die which measures 148mm2 which means transistor density has been upped by 15% on the same 28nm process design.

NVIDIA GM200.GM210 (Maxwell Architecture, High-Performance for Telsa/Quadro Arrives later for Cosnumers, Successor of GK110)
NVIDIA GM204 (Maxwell Architecture, High-End Consumer, Successor of GK104, First GeForce 800 Series Products likely to feature)
NVIDIA GM206 (Maxwell Architecture, Performance Minded, Successor of GK206, Mid-Range GeForce 800 Series products to feature)
NVIDIA GM107/207 (Maxwell Architecture, Entry Level, Successor of GK107, Entry Level GeForce 800/700 Series To feature, Already introduced on GTX 750 Ti / GTX 750)


Read more: http://wccftech.com/nvidia-geforce-...-cores-performance-benchmarked/#ixzz3A6K1QQvs

2f9593d9dfe9a50190c76ec2bb067644.jpg


3d047fcbbcbb56261957edfbc85baf4f.jpg


11b6033de414b24ef0f784e9cb175130.jpg


So $ to £ and 20% tax equates to around £250. It should be roughly the same speed at 1080P as a 780 or even slightly faster, so not too bad at all. I will be interested to see how the 256bit bus holds up at 4K resolutions.
 
Last edited:
If those spec for the GTX870 and GTX880 are true, then the performance (for 1080p) of the GTX870 would most probably sit between the GTX770 and the GTX780, where as the GTX880 (the 1920 CUDA cores one) would more or less matches the GTX780, but around 40W lower in power consumption.
 
Last edited:
If it comes in at sub £300 then I will be impressed but I don't think it will. People are going to get really hung up on that memory bandwidth to.
 
Yeah if those specs are true seems like an utter waste of time to me and the 870 would trail the 780 in most high res gaming scenarios.
 
i don't see those specs holding up against a gtx780. The texture fill rate is lower for a start and the memory bandwidth is also way down.
Yea...while 0xAA to 2xAA the performance gap of the GTX780 being ahead may not be too wide, but 4xAA or 8xAA, the GTX780 will pull ahead and widen the gap considerably...

Be prepared for Nvidia claiming these GTX870/GTX880 "trade blows" with GTX780 performance with "amazing efficiency and low temp", and quoting lots and lots of game benchmark results without AA applied :p
 
Last edited:
Yeah if those specs are true seems like an utter waste of time to me and the 870 would trail the 780 in most high res gaming scenarios.

I read the 870 specs so deleted my comment. The gtx880 might get closer but some of the specs are missing. I doubt there is any need for gtx780 owners to get excited about though.
 
According to the leaked slides (and remember they could be false), it scores slightly higher than a stock 780. I guess it comes down to what you game at. 1080P and I can see this card being a winner if it does hit the £250 mark but going higher on res and the gains will diminish.
 
I picked up a 780 Classy to mess with while I wait for the 880 to be released.

Will either step up to a 780ti or an 880. Time will tell :D
 
So I guess these will all be Samsung ICs, with a 256bit bus you need to crank the RAM. I don't think Elpida even make chips that are rated at this b/w.
 
3DMark11 GPU score is 13334, correct me if i'm wrong but thats slower than a stock 780?

The 3DMark11 Benchmark thread does not have the scores for lower 780's. but judging by some of the overclocked ones at 17000+ it looks a little low to me.

My 290 gets 17500 @ 1200Mhz.
 
Last edited:
Why does that chart list directx 12 support for everything?

Technically, all those cards are DX12 capable. That doesn't mean they are capable of the full feature set of DX12 but I would be interested to know if Maxwell is 'fully' DX12 ready.
 
http://www.3dmark.com/3dm11/8604885

I ran 3DMark11 on a Titan clocked at 1032Mhz (locked clocks) for comparison. 1k points more than the leaked slides. It would be cool of someone with a 780 would run it on stock clocks.

Give me a few minutes mate and ill do a quick blast, don't think I've ever ran 11 on this card. Need to update 3dm.
 
Last edited:
Is that right?

Thats actually a lot slower than the 5'th slowest 780

Even my old 7870 managed near 10K

That is correct. Setter will show what stock gives. Don't forget that the clocks are dropped off and not representative of real world clocks.
 
3DMark11 GPU score is 13334, correct me if i'm wrong but thats slower than a stock 780?

The 3DMark11 Benchmark thread does not have the scores for lower 780's. but judging by some of the overclocked ones at 17000+ it looks a little low to me.

My 290 gets 17500 @ 1200Mhz.

It is definitely slower than a stock 290X, I have just run it and even with me browsing the internet in the background it was not close.

I also think that the slides are fake as they have covered parts of GPUZ that does not need it.
 
Back
Top Bottom