This means that thermal throttling is occurring with the CPU. Performance should be constant regardless of the TIM used. If it isn't then the TIM is not doing it's job and failing.
Basically when you have more or less similar GFlops values in IBT/LinX for both TIMs, you are ensuring that cpu is processing/executing
Guassian Elimination equations at similar rate.
(Any one who has done maths/engineering etc will know what Guassian elimination equations look like which are algebraic simultaneous equations solved by making use of matrices)
Lower GFlops values mean that cpu is processing/executing at a slower rate so the coretemps will also be lower. Likewise higher GFlops values mean cpu is processing/executing at faster rate hence higher coretemps. You can even ask fellow ocuker
C64 and he will confirm it. TIMs won't make much difference whether they are good or not because you are getting different GFlops values in different tests and your TIM results will be inconclusive as the dominant baseline factor is the cpu execution rate which will overshadow the effectiveness of TIMs.
So with similar GFlops in all tests you are ensuring cpu is performing at similar rate. Since this was the major variable which has been now controlled, then you can proceed to make note of temps. Then you can see which TIM is helping to reduce temps.
For example
Invalid Test
MX-3 or any other TIM: 33-35 GFlops in IBT/LinX
Average Coretemp = 58C
IC Diamond: 40-42GFlops in IBT/LinX
Average Coretemp = 66C
Valid Test
MX-3 or any other TIM: 40-42 GFlops in IBT/LinX
Average Coretemp = 68C
IC Diamond: 40-42GFlops in IBT/LinX
Average Coretemp = 66C
As you can see that in the
invalid test your GFlops were different for both TIMs and hence there was also a large difference between coretemp values. So any person not familiar with the LinPack (IBT/LinX) tests would have assumed that MX-3 gave a reduction of 8C temp on average compared to IC Diamond totally ignoring the GFlops values which actually played the major role.
However in the
valid test with similar range of GFlops values the results were much closer for both TIMs and hence
consistency was achieved. Not saying that IC Diamond is better than MX-3 or vice versa as this is what we trying to prove anyway on this thread
but you do get an idea.
It's sort of like with prime95 if you use IC Diamond for blend test and MX-3 etc for small FFTs test and then compare the results, they will be invalid since you are using two different test for two different TIMs. So if you use small FFts for both TIMs then you can compare temps results which will be valid for comparison purposes.
The only difference is that major variable factor in prime95 is the nature of the test whereas in Intel Burn Test/LinX it is the GFlops values along with amount of RAM used which should also be same for all test to ensure the cpu is executing same amount of equations and hence the loading time is similar
.