Could be something of nothing but it is a very interesting read.
ATI uses 1 hardware tessellation unit. Nvidia uses 16 tessellation units that are emulated within the CUDA cores. Currently, ATI's tessellation is not aided by the Stream Processors, whereas Nvidia's SP's are what does it. This is one of the major reasons the tessellation performance difference in Unigine Heaven bench is so huge; NV's cores are constantly being used for emulating Tessellation, while ATI's are only partially used for everything else and only the 1 hardware tessellation unit is being used for tessellation. Once there are moving players/monsters and fighting going on and physics being processed, those shader cores need to process a lot more so there is less room for processing tessellation. This is why the performance gap is smaller in actual games.
Currently, ATI is working on changing this within the drivers so that the Stream Processors will aid the Tessellation Unit with processing tessellation. Supposedly this change will be released with the 10.5 drivers and will be the first large tessellation-based improvement.
They're also working to further improve tessellation by making the cache work better between the tessellation unit and the SP's, which is going to be in the 11.0 Catalyst drivers.
As for improving your scores, stick with 10.3a drivers. The CrossfireX profiles are larger and support more software, and the drivers handle 2D better. There are still problems with multiple monitors going to sleep. This can be resolved by installing the Catalyst 10.2 RC2 drivers, then upgrading to the Catalyst 10.3a drivers without doing an uninstall. This workaround works for now.
If you want to boost performance, focus on overclocking the core to boost the tessellation unit processing clocks. Right now, ATI are working on a way to expand tessellation onto the Stream Processors. When they complete that, nVidia is dead. If they get the SP units to work hand in hand as cache and pre/post processors for the tessellation units, then the tessellation performance will SKYROCKET! It technically could be called cheating, but if you want to get the most out of a benchmark with a dedicated tessellation processor, the only way to get more tessellation processor is to assist it. nVidia has no tessellation processor, as tessellation is emulated on the CUDA cores. When the new drivers are finished, likely to be released in the 10.5 beta catalyst drivers, things will change in the benchmark world.
Catalyst 11.0 is already in the works, but it is focusing on using the Stream Processors to aid in render on benchmarks and graphics systems, using the tessellation units as a math base cache systems and booster. When these are complete, the HD 5000 series will eat the GTX 480 as a midnight snack, and it will only be the first bite out of the proverbial cake.
ATI is re-working the driver models as we speak. In the end, ATI learned from how nVidia handled tessellation, and uses the multiple SIMD I/O system to destroy the graphics gap. Right now, we are using an older model from 9.1 catalyst where tessellation is only run on the tessellation units. The new model will be a complete re-working of the way the card is run.
The beauty of ATI's dedicated cores is the card can be software adapted to do so many things, and with the ECC and tessellation units in place, the sky is the limit.
Also, it's likely they're increasing the amount of Hardware Tessellation units in the 5xxx refresh. Very likely even further improved in the 6xxx series.
At this point it's very hard to tell if Nvidia is only 'temporarily' better in tessellation, or if they'll hold on to this for a while. Based on the improvements ATI will be making, I'm not doubting at all that the difference in the near future between both will be much smaller than today, for tessellation performance.
Last edited: