• Competitor rules

    Please remember that any mention of competitors, hinting at competitors or offering to provide details of competitors will result in an account suspension. The full rules can be found under the 'Terms and Rules' link in the bottom right corner of your screen. Just don't mention competitors in any way, shape or form and you'll be OK.

Possible huge ATI tessellation performance boost with 10.5 drivers

Soldato
Joined
6 Oct 2007
Posts
23,115
Location
North West
Could be something of nothing but it is a very interesting read.

ATI uses 1 hardware tessellation unit. Nvidia uses 16 tessellation units that are emulated within the CUDA cores. Currently, ATI's tessellation is not aided by the Stream Processors, whereas Nvidia's SP's are what does it. This is one of the major reasons the tessellation performance difference in Unigine Heaven bench is so huge; NV's cores are constantly being used for emulating Tessellation, while ATI's are only partially used for everything else and only the 1 hardware tessellation unit is being used for tessellation. Once there are moving players/monsters and fighting going on and physics being processed, those shader cores need to process a lot more so there is less room for processing tessellation. This is why the performance gap is smaller in actual games.

Currently, ATI is working on changing this within the drivers so that the Stream Processors will aid the Tessellation Unit with processing tessellation. Supposedly this change will be released with the 10.5 drivers and will be the first large tessellation-based improvement.

They're also working to further improve tessellation by making the cache work better between the tessellation unit and the SP's, which is going to be in the 11.0 Catalyst drivers.

As for improving your scores, stick with 10.3a drivers. The CrossfireX profiles are larger and support more software, and the drivers handle 2D better. There are still problems with multiple monitors going to sleep. This can be resolved by installing the Catalyst 10.2 RC2 drivers, then upgrading to the Catalyst 10.3a drivers without doing an uninstall. This workaround works for now.

If you want to boost performance, focus on overclocking the core to boost the tessellation unit processing clocks. Right now, ATI are working on a way to expand tessellation onto the Stream Processors. When they complete that, nVidia is dead. If they get the SP units to work hand in hand as cache and pre/post processors for the tessellation units, then the tessellation performance will SKYROCKET! It technically could be called cheating, but if you want to get the most out of a benchmark with a dedicated tessellation processor, the only way to get more tessellation processor is to assist it. nVidia has no tessellation processor, as tessellation is emulated on the CUDA cores. When the new drivers are finished, likely to be released in the 10.5 beta catalyst drivers, things will change in the benchmark world.

Catalyst 11.0 is already in the works, but it is focusing on using the Stream Processors to aid in render on benchmarks and graphics systems, using the tessellation units as a math base cache systems and booster. When these are complete, the HD 5000 series will eat the GTX 480 as a midnight snack, and it will only be the first bite out of the proverbial cake.

ATI is re-working the driver models as we speak. In the end, ATI learned from how nVidia handled tessellation, and uses the multiple SIMD I/O system to destroy the graphics gap. Right now, we are using an older model from 9.1 catalyst where tessellation is only run on the tessellation units. The new model will be a complete re-working of the way the card is run.

The beauty of ATI's dedicated cores is the card can be software adapted to do so many things, and with the ECC and tessellation units in place, the sky is the limit.

Also, it's likely they're increasing the amount of Hardware Tessellation units in the 5xxx refresh. Very likely even further improved in the 6xxx series.

At this point it's very hard to tell if Nvidia is only 'temporarily' better in tessellation, or if they'll hold on to this for a while. Based on the improvements ATI will be making, I'm not doubting at all that the difference in the near future between both will be much smaller than today, for tessellation performance.
 
Last edited:
Well it's true about Nvidia tessellation performance dropping in games compared to synthetic benchmarks they have optimized the cores for, check Dirt2 out when in the menu area where the tessellated crowd is, the 480 is pushing the same frames as the 5870.
 
I've heard about this as well....though at the moment tessellation is still just a marketing thing. Who cares about it and DX11 really.
 
To be honest all ATI is going to acheive is better Heaven benchmark results. As they have stated themselves, once the gtx 480 is using its shaders for the graphics in the game heavily, it's tessellation performance drops. The same thing will happen with the 5 series.

Still nice to have and perhaps will still make a difference to some games and their performance with tessellation.
 
new driver model could indicate a lot more improvements than just tessellation. though 11.01 will be for the brave I'd guess...
 
To be honest all ATI is going to acheive is better Heaven benchmark results. As they have stated themselves, once the gtx 480 is using its shaders for the graphics in the game heavily, it's tessellation performance drops. The same thing will happen with the 5 series.

Still nice to have and perhaps will still make a difference to some games and their performance with tessellation.

Image an HT like situation for the 5xx0 series of cards.

Most games wont fill all 5 units of each shader, imagine if you could use the remaining ones for tessalation. :D
 
This keeps getting better and better, If it gets done before the 46* releases I'm going to lol oh so hard. :rolleyes:


- Ordokai
 
Last edited:
is there any news on the refresh/southern isles? - i read it taped out a few weeks back but nothing since... i suppose it would make sense to wait until this driver change to launch new cards
 
is there any news on the refresh/southern isles? - i read it taped out a few weeks back but nothing since... i suppose it would make sense to wait until this driver change to launch new cards

Taping out a core, even if it doesn't need a respin(its INCREDIBLY unlikely for ANY core to tape out and go into full scale production on one spin, its also unlikely and rare for 3 full respins to be needed). Basically tape out first date means 2-3 months minimum till retail availability, another respin add's 4-6 weeks.

As for benchmark performance, rather pointless waste of time, if they can use excess shader performance to do tesselation that would otherwise not be used, then fine, but the way the architecture works thats very unlikely, possible, but unlikely.


They really need to be pushing VERY HARD to get game developers to code better for the architecture. Its somewhat PS3/360 problems at the moment in that programming to get the full 1600shaders working most of the time is incredibly difficult and doesn't happen often. Getting 3 of the cluster of 5 shaders working is fairly easy, but 4, or 5 is very rare. At any one time its rarely more than 60-70% loaded which leaves them huge headroom. Though really thats the side effect of making a core with a huge number of shaders without the massive size.

Its very likely that when Nvidia make a similar core(going as small as possible and thereby having to go for efficient and complex shaders rather than simple and large ones) that all of a sudden all game dev's will be breaking their necks to optimise code for the AMD style architecture.
 
Lets get two thing straight... nVidia's tessellation processing is NOT aided by the stream processors (as in traditional compute - but is implemented as dedicated inline geometry unit)... and...

Using the stream processors to increase tessellation performance is going to be VERY hit and miss, your doing additional work that the nVidia polymorph engine doesn't do, deeper into the pipeline, as the ATI cards have masses of shaders its likely that in specific instances it can provide a decent speed up, but in others its going to have overhead penalties that may even cancel out the gains, so its going to make it more complex to program for. Outside of predictable targets like static benchmarks I think the writebacks/local memory issues will likely murder performance.
 
Last edited:
Is that right..care to explain how Nvidia does tessellation on fermi cards, no dedicated tessellation hardware, how does it do it...magic?
 
Last edited:
Back
Top Bottom