Looking at the module layout diagrams it looks to me like a single thread will have both of the floating point units and double the L2 cache available to play with? so it would make sense that performance on the first core will take a slight hit when running two threads per module versus one... it would also be a fairly sensible design move by AMD because very few games will utilize all of the cores, if they can give cores 1, 3, 5 & 7 priority over 2, 4, 6, 8 it would result in better performance in practically anything that used 4 threads or less.
Am I right?![]()
No not really. The only performance hit is if a single thread requires the full 256bits in the FPU. It can only execute 1 thread at that moment.
A module has 2 integer schedulers and the FPU can perform 2x 128bit operations at the same time. That's enough for the module to perform 2 threads in parallel.
If most operations are only 128bit, which they probably are. Then BD should have strong processing throughput.
Last edited:


