But as I was saying before... measuring CPU usage for a process is not an exact science. Technically whichever thread is using the CPU at a point in time already has 100% of the CPU's attention. Obviously over longer periods of time time slicing may impact that figure slightly because no single process can genuinely have 100% CPU time...
The only time a process shows up on Task Manager as using 99% is if it has been working the CPU almost flat-out for 1 second (or whatever the refresh rate of TM is set to)...
I'm not sure how else to explain it...
The more clock cycles a CPU can perform in a single second, then generally, the faster it is. (ignoring the whole P4 NetBurst vs Athlon 64 issue)
So let's say a CPU can do 1000 clock cycles in 1 second (this would be a incredibly slow CPU by the way
)...
If a task requires 500 of those clock cycles to perform its task. Then that means 0.5 seconds of CPU time is required to complete that task.
Now let's say another CPU can do 2000 clock cycles in 1 second (still an ultra slow CPU
)...
Now if we execute that same task (500 clock cycles) then this better CPU would complete the task in 0.25 seconds.
Surely this clearly shows that even though neither CPU was maxxed out that they still perform tasks quicker?
I'm really not sure how else this can be explained!
PS: If you have a copy of the Intel x86 manual it is actually possible to calculate how many clock cycles an set of assembly instructions needs to execute. Most instructions are 1 clock cycle... but some more complex instructions are 2, 4 or sometimes even more clock cycles. Many SSE instructions for instance can sometimes need about 30 clock cycles... A lot of the tweaks to the Intel Core 2 Duo were in optimising the number of clock cycles needed to perform these more complex instructions. As a result C2D chips can do 64-bit algebra in 1 clock cycle, 128-bit algebra in 2 clock cycles etc...
The only time a process shows up on Task Manager as using 99% is if it has been working the CPU almost flat-out for 1 second (or whatever the refresh rate of TM is set to)...
I'm not sure how else to explain it...
The more clock cycles a CPU can perform in a single second, then generally, the faster it is. (ignoring the whole P4 NetBurst vs Athlon 64 issue)
So let's say a CPU can do 1000 clock cycles in 1 second (this would be a incredibly slow CPU by the way

If a task requires 500 of those clock cycles to perform its task. Then that means 0.5 seconds of CPU time is required to complete that task.
Now let's say another CPU can do 2000 clock cycles in 1 second (still an ultra slow CPU

Now if we execute that same task (500 clock cycles) then this better CPU would complete the task in 0.25 seconds.
Surely this clearly shows that even though neither CPU was maxxed out that they still perform tasks quicker?
I'm really not sure how else this can be explained!
PS: If you have a copy of the Intel x86 manual it is actually possible to calculate how many clock cycles an set of assembly instructions needs to execute. Most instructions are 1 clock cycle... but some more complex instructions are 2, 4 or sometimes even more clock cycles. Many SSE instructions for instance can sometimes need about 30 clock cycles... A lot of the tweaks to the Intel Core 2 Duo were in optimising the number of clock cycles needed to perform these more complex instructions. As a result C2D chips can do 64-bit algebra in 1 clock cycle, 128-bit algebra in 2 clock cycles etc...