• Competitor rules

    Please remember that any mention of competitors, hinting at competitors or offering to provide details of competitors will result in an account suspension. The full rules can be found under the 'Terms and Rules' link in the bottom right corner of your screen. Just don't mention competitors in any way, shape or form and you'll be OK.

**THE NVIDIA DRIVERS THREAD**

Upcoming driver has huge performance improvements

DX11 could rival Mantle


The big story at GDC last week was Microsoft’s reveal of DirectX 12 and the future of the dominant API for PC gaming. There was plenty of build up to the announcement with Microsoft’s DirectX team posting teasers and starting up a Twitter account of the occasion. I hosted a live blog from the event which included pictures of the slides. It was our most successful of these types of events with literally thousands of people joining in the conversation. Along with the debates over the similarities of AMD’s Mantle API and the timeline for DX12 release, there are plenty of stories to be told.

After the initial session, I wanted to setup meetings with both AMD and NVIDIA to discuss what had been shown and get some feedback on the planned direction for the GPU giants’ implementations. NVIDIA presented us with a very interesting set of data that both focused on the future with DX12, but also on the now of DirectX 11.


View Full Size


The reason for the topic is easy to decipher – AMD has built up the image of Mantle as the future of PC gaming and, with a full 18 months before Microsoft’s DirectX 12 being released, how developers and gamers respond will make an important impact on the market. NVIDIA doesn’t like to talk about Mantle directly, but it’s obvious that it feels the need to address the questions in a roundabout fashion. During our time with NVIDIA’s Tony Tamasi at GDC, the discussion centered as much on OpenGL and DirectX 11 as anything else.

What are APIs and why do you care?

For those that might not really understand what DirectX and OpenGL are, a bit of background first. APIs (application programming interface) are responsible for providing an abstraction layer between hardware and software applications. An API can deliver consistent programming models (though the language can vary) and do so across various hardware vendors products and even between hardware generations. They can provide access to feature sets of hardware that have a wide range in complexity, but allow users access to hardware without necessarily knowing great detail about it.

Over the years, APIs have developed and evolved but still retain backwards compatibility. Companies like NVIDIA and AMD can improve DirectX implementations to increase performance or efficiency without adversely (usually at least) affecting other games or applications. And because the games use that same API for programming, changes to how NVIDIA/AMD handle the API integration don’t require game developer intervention.

With the release of AMD Mantle, the idea of a “low level” API has been placed in the minds of gamers and developers. The term “low level” can mean many things, but in general it is associated with an API that is more direct, has a thinner set of abstraction layers, and uses less translation from code to hardware. The goal is to reduce the amount of overhead (performance hit) that APIs naturally impair for these translations. With additional performance available, the CPU cycles can be used by the program (game) or be slept to improve battery life. In certain cases, GPU throughput can increase where the API overhead is impeding the video card's progress.

Passing additional control to the game developers, away from the API or GPU driver developers, gives those coders additional power and improves the ability for some vendors to differentiate. Interestingly, not all developers want this kind of control as it requires more time, more development work, and small teams that depend on that abstraction to make coding easier will only see limited performance advantages.

The reasons for this transition to a lower level API is being driven the by widening gap of performance between CPU and GPUs. NVIDIA provided the images below.


View Full Size


On the left we see performance scaling in terms of GFLOPS and on the right the metric is memory bandwidth. Clearly the performance of NVIDIA's graphics chips has far outpaced (as have AMD’s) what the best Intel desktop processor have been able and that gap means that the industry needs to innovate to find ways to close it.

Continue reading NVIDIA Talks DX12, DX11 Efficiency Improvements!!!


View Full Size


Even with that huge disparity, there aren't really that many cases which are ripe for performance improvement with CPU efficiency increases. NVIDIA showed us this graphic above with performance changes when scaling a modern Intel Core i7 processor from 2.5 GHz to 3.3 GHz. 3DMark and AvP benchmarks don't scale at all, Battlefield 3 scales up to 3% with the GTX Titan, Bioshock Infinite scales across the board up to 5% and Metro: Last Light is the stand out with an odd 10%+ change on the HD 7970 GHz Edition.

NVIDIA doesn’t deny that a lower level API is beneficial or needed for PC gaming. It does, however, think that the methodology of AMD’s Mantle is the wrong way to go. Fragmenting the market into additional segments with a proprietary API does not maintain the benefits of hardware abstractions or “cross vendor support”. I realize that many readers will see some irony in this statement considering many in the industry would point to CUDA, PhysX, 3D Vision and others as NVIDIA’s own proprietary feature sets.

NVIDIA’s API Strategy

Obviously NVIDIA is going to support DirectX 12 and continues to support the latest updates to OpenGL. You’ll find DX12 support on Fermi, Kepler and Maxwell parts (in addition to whatever is coming next) and NVIDIA says they have been working with Microsoft since the beginning on the new API. The exact timeline of that and what constitutes “working with” is also up for debate, but that is mostly irrelevant for our discussion.

What NVIDIA did want to focus on with us was the significant improvements that have been made on the efficiency and performance of DirectX 11. When NVIDIA is questioned as to why they didn’t create their Mantle-like API if Microsoft was dragging its feet, they point to the vast improvements possible and made with existing APIs like DX11 and OpenGL. The idea is that rather than spend resources on creating a completely new API that needs to be integrated in a totally unique engine port (see Frostbite, CryEngine, etc.) NVIDIA has instead improved the performance, scaling, and predictability of DirectX 11.


View Full Size


This graphic, provided by NVIDIA of course, shows 9 specific Direct3D 11 functions. The metric of efficiency in this case is rated by the speed increase between the AMD R9 290X in red and the three different progressive driver versions in green on a GTX 780 Ti. The Draw, SetIndexBuffer and SetVertexBuffers functions have gone through several hundred percent performance improvements since just the R331 driver stack to an as-yet-unreleased driver due out in the next couple of weeks.

These were obviously hand selected by NVIDIA so there may be others that show dramatically worse results, but it is clear that NVIDIA has been working to improve the efficiency of DX11. NVIDIA claims that these fixes are not game specific and will improve performance and efficiency for a lot of GeForce users. Even if that is the case, we will only really see these improvements surface in titles that have addressable CPU limits or very low end hardware, similar to how Mantle works today.


View Full Size


NVIDIA shows this another way by also including AMD Mantle. Using the StarSwarm demo, built specifically for Mantle evaluation, NVIDIA’s GTX 780 Ti with progressive driver releases sees a significant shift in relation to AMD. Let’s focus just on D3D11 results – the first AMD R9 290X score and then the successive NVIDIA results. Out the gate, the GTX 780 Ti is faster than the 290X even using the R331 driver. If you move forward to the R334 and the unreleased driver you see improvements of 57% pushing NVIDIA’s card much higher than the R9 290X using DX11.

If you include Mantle in the picture, it improves performance on the R9 290X by 87% - a HUGE amount! That result was able to push the StarSwarm performance past that of the GTX 780 Ti with the R331 and R334 drivers but isn’t enough to stay in front of the upcoming release.

Thief, the latest Mantle-based game release, shows a similar story; an advantage for AMD (using driver version 14.2) over the GTX 780 Ti with R331 and R334, but NVIDIA’s card taking the lead (albeit by a small percentage) with the upcoming driver.

If you followed the panels at GDC at all, you might have seen one about OpenGL speed improvements as well. This talk was hosted by NVIDIA, AMD and Intel and all involved openly bragging about the extension-based changes to the API that have increased efficiency in a similar way to what NVIDIA has done with DX11. Even though OpenGL often gets a bad reputation for being outdated and bulky, the changes have added support for bindless textures, texture arrays, shader storage buffer objects, and commonly discussed DirectX features like tessellation, compute shaders, etc.


View Full Size


Add to that the extreme portability of OpenGL across mobile devices, Windows, Linux, Mac, and even SteamOS, and NVIDIA says their commitment to the open-source API is stronger than ever.

The Effect of DirectX 12

As we discussed in our live blog, the benefits of the upcoming DX12 implementation will come in two distinct parts: performance improvements for existing hardware and feature additions for upcoming hardware. Microsoft isn’t talking much about the new features that it will offer and instead are focused on the efficiency improvements. These include reductions in submission overhead, improved scalability on multi-core systems, and its ability to mimic a console-style execution environment. All of this gives more power to the developer to handle and manipulate the hardware directly.


View Full Size


NVIDIA claims that work on DX12 with Microsoft “began more than four years ago with discussions about reducing resource overhead. For the past year, NVIDIA has been working closely with the DirectX team to deliver a working design and implementation of DX12 at GDC.” This would indicate that while general ideas about what would be in the next version of DX, the specific timeline to build and prepare it started last spring.

NVIDIA is currently the only GPU vendor to have a DX12 capable driver in the hands of developers and the demo that Microsoft showed at GDC was running on a GeForce GTX TITAN BLACK card.

Will NVIDIA feel heat from Mantle?

Though it doesn’t want to admit it, NVIDIA is clearly feeling some pressure from gamers and media due to AMD’s homemade Mantle API. The company’s stance is to wait for DirectX 12 to be released and level the playing field with an industry standard rather than proceed down the pathway of another custom option. In the next 18 months, though, there will be quite a few games released with Mantle support, using the Frostbite engine or CryEngine. How well those games are built, and how much of an advantage the Mantle code path offers, will determine if gamers will respond positively to Radeon cards. NVIDIA, on the other hand, will be focusing its reticule on improving the efficiency and performance of DirectX 11 in its own driver stack in an attempt to maximize CPU efficiency (and thus overall performance) levels to rival Mantle.

During a handful of conversations with NVIDIA on DirectX and Mantle, there was a tone from some that leaned towards anger, but hints at annoyance. It’s possible, according to NVIDIA’s performance improvements in DX11 efficiency shown here, that AMD could have accomplished the same thing without pushing a new API ahead of DirectX 12. Questions about the division of internal resources on the AMD software team between Mantle and DirectX development are often murmured as is the motives of the developers continuing to adopt Mantle today. Finding the answers to such questions is a fruitless endeavor though and to speculate seems useless – for now.

AMD has done good with Mantle. Whether or not the company intended for the new API to become a new standard, or merely force Microsoft's hand with DirectX 12, it is thanks to AMD that we are even talking about efficiency with such passion. Obviously AMD hopes they can get some financial benefits from the time and money spent on the project, with improved marketshare and better mindshare with gamers on the PC. The number and quality of games that are released in 2014 (and some of 2015) will be the determining factor for that.

Over the next year and half, NVIDIA will need to prove its case that DirectX 11 can be just as efficient as what AMD has done with Mantle. Or at the very least, the performance deltas between the two options are small enough to not base purchasing decisions on. I do believe that upon the release of DX12, the playing field will level once again and development on Mantle will come to a close; that is, at least if Microsoft keeps its promises.
http://www.pcper.com/reviews/Graphics-Cards/NVIDIA-Talks-DX12-DX11-Efficiency-Improvements

TL: DR, nVidia drivers for DX11 will rock.

Nvidia guys are in for a treat :D Bring it on baby!!
 
So are they saying that some low level features already existed in DirectX 11 and that NVidia will release a driver exposing that performance?
 
I compared my 290 results to their (Nvidia's) 290X results. In starswarm with my cpu at 4.8ghz vs their 6 core cpu i scored 39.94 fps and in Thief on dx11.1 i scored 61 fps. Both considerably higher than their 290X scored in those benches so perhaps a grain of salt required.

Or if those benches are correct i suspect some doctoring has occured. Perhaps in thermally constrained environment and ensuring the 290X was running on quiet mode. That's not been done before, has it? :D


D75G6vf.jpg

It’s not fair to criticize the 290X’s Quiet mode, or to benchmark the GPU solely in that mode. That’s not what the card is designed to do. But the R9 290 does have a problem — AMD, in a fit of “Eat your cake and have it, too” shipped a driver that sets a fan profile speed that still allows for throttling. Why? Because it wanted to keep the noise down — it didn’t want a slew of reviews saying the R9 290 is hot and noisy.

Source
http://www.extremetech.com/gaming/1...blem-but-nvidias-smear-attack-is-heavy-handed


Shots fired though and i welcome competition. Can DX11 be better than Mantle? Bring it on i say. Lets see if its true. :)
 
Last edited:
Cant compare unequal setup LT :)
Dont see any mention of there the CPU being overclocked and star swarm never seems to have the same result unless you edit one of the files
 
I embrace any new improvements and I can tell this has you worried Matt or you wouldn't have bothered running tests to try to disprove PCper's findings. Good stuff :D

I asked Joel how they came up with their results and he told me how they did it. ;)

I welcome the competition Greg. Nothing to worry about makes things interesting i say. I love a bit of shots fired between sides. Makes things exciting around here. :D

Cant compare unequal setup LT :)
Dont see any mention of there the CPU being overclocked and star swarm never seems to have the same result unless you edit one of the files

I know i have a slower setup, cpu and gpu.
 
Last edited:
f8e405263109b1de6dd2f4db7233c93b.jpg


Check out the improvements with the new drivers. Can't wait for these to hit :D
 
I don't see how Mantle will be able to compete if NVidia and Microsoft have a magic switch to enable Mantle-like performance improvements on a firmly entrenched DirectX 11, it sounds too good to be true tbh!

If it's the case it would be easier for AMD to just fall in line and release their own DX11 low level driver rather than push for Mantle and fragment the PC gaming industry.

How soon is the NVidia driver expected?
 
I don't see how Mantle will be able to compete if NVidia and Microsoft have a magic switch to enable Mantle-like performance improvements on a firmly entrenched DirectX 11, it sounds too good to be true tbh!

If it's the case it would be easier for AMD to just fall in line and release their own DX11 low level driver rather than push for Mantle and fragment the PC gaming industry.

How soon is the NVidia driver expected?

Not sure on time scales but I imagine nVidia had this working nicely at GDC, so a release time will not be too far away. Maybe just needs a few bugs ironed out.
 
I will give this a try on my GTX660 and see what kind of performance boost I get.

Looks good,but TBH,look at the CPU used - a Core i7 3930K??

TH tests Thief with Mantle:

http://www.tomshardware.com/reviews/thief-mantle-benchmarks,3773.html

An FX4170 just about matches a Core i7 4770K!

X343g7k.png

ujEdKhK.png

I want to see tests with lower end CPUs and GPUs.

I couldn't care less about high end Core i7 CPUs and £300+ GPUs.

If Nvidia can improve performance on lower end and midrange cards,like the GTX750TI and GTX660 with sub £100 CPUs,I will be impressed.

This is where we need the performance improvements - they did already quite well with their MT drivers,so I want to see the improvements here the most.




I don't see how Mantle will be able to compete if NVidia and Microsoft have a magic switch to enable Mantle-like performance improvements on a firmly entrenched DirectX 11, it sounds too good to be true tbh!

If it's the case it would be easier for AMD to just fall in line and release their own DX11 low level driver rather than push for Mantle and fragment the PC gaming industry.

How soon is the NVidia driver expected?

from the article said:
NVIDIA doesn’t deny that a lower level API is beneficial or needed for PC gaming. It does, however, think that the methodology of AMD’s Mantle is the wrong way to go. Fragmenting the market into additional segments with a proprietary API does not maintain the benefits of hardware abstractions or “cross vendor support”. I realize that many readers will see some irony in this statement considering many in the industry would point to CUDA, PhysX, 3D Vision and others as NVIDIA’s own proprietary feature sets.

BTW,Nvidia used NV API for BF2 and BF3 too which pushed them ahead of AMD in those games. Try not to forget about that either and that was proprietary. However,the line about AMD trying to fragment the gaming industry - Nvidia has been pushing proprietary stuff FAR longer and even the article mocked the crocodile tears. PCPER also has close ties to Nvidia,so if they said that they must have seen the irony in the statement.
 
Last edited:
I think it is clear to see Cat that proprietary is something that nVidia do and I don't care in truth.

TBF,your correct in this regard,and AMD are doing a bit of it now too. I suppose its more about trying to sell that your product has features the other one does not have,etc. Its not the best way forward,but I kind of understand why it happens - even DX11 is proprietary to Windows and not on Linux,etc.

However,I do hope sites test the GTX750TI/GTX660 with a Core i3,FX6300,etc to see the performance increase. This is the part of the market which needs any performance boost the most TBH.

The Nvidia DX11 drivers thread better in general,so outside of Mantle enabled games,the FX6300 fares better with Nvidia cards and the AMD cards fare better with Intel CPUs.

The irony!! :p
 
Last edited:
Needs to be benched on lower specked cpu.

Having said that yet again everyone should be sending a thank you note to AMD for giving the pc industry a kick up the backside.

Happy days for all of us no matter what gpu you have.



Now here follows several pages of trolling from each 'side' >>>>
 
TBF,your correct in this regard,and AMD are doing a bit of it now too. I suppose its more about trying to sell that your product has features the other one does not have,etc. Its not the best way forward,but I kind of understand why it happens - even DX11 is proprietary to Windows and not on Linux,etc.

However,I do hope sites test the GTX750TI/GTX660 with a Core i3,FX6300,etc to see the performance increase. This is the part of the market which needs any performance boost the most TBH.

The Nvidia DX11 drivers thread better in general,so outside of Mantle enabled games,the FX6300 fares better with Nvidia cards and the AMD cards fare better with Intel CPUs.

The irony!! :p

For sure and that made me proper lol that nVidia performs better on AMD CPU's over Intel CPU's :D

I am sure there is some marketing hype in this but I hope all nVidia users get a nice performance boost with less driver overheads.
 
Back
Top Bottom