Stability testing

Soldato
Joined
25 Nov 2005
Posts
12,711
I've spent hours upon hours while running tests on my OC looking for definitive answers on what counts as a valid stability test and there seems to be no consensus.

There's people who swear by 24hour runs of Prime

Others will say Prime is to be avoided (especially for Haswells) and that tests like RealBench and X264 are better for showing stability

There's folk who say run Prime for an hour doing custom at 1344 FFT, others will say 10 runs in IBT on max will usually root out a dodgy OC

There's people who have said they could run Prime for 24 hours but crash running x264 and also the opposite.

So what is valid stability ?

I can run Prime on custom at 1344FFT, done 40 runs of IBT, ran x264 overnight without issue BUT I tried running the CPU:OCCT and it crashed after 10 minutes, however CPU:Linpack with AVX works flawlessly. Likewise Prime on custom with min 8FFT to max 4096FFT also yields a crash.

I'm on Haswell and I'm running close to my limit on temps, IBT, Prime, OCCT all push the limit to near 90 degrees so I'm wondering if it's more the temp causing instability than the actual voltage (even though Prime @ 1344 and IBT work fine) all the other tests don't really heat up my core and I hover anywhere from 60 to 70 depending on the test (currently running Asus Realbench bouncing from 68 to 70)

I know it probably comes down to personal opinion but if it can't run everything then surely it's not stable ? Or are we to not include the absolute extremes we find in Prime/IBT/OCCT simply because they are absolute extremes and nowhere close to realistic usage (unless you happen to enjoy using your pc to calculate extreme prime numbers) ?
 
You'll do well not to open a can of worms with that question.

Depends on the workload and the CPU architecture. Personally for CPU stability on newer platforms RealBench is sufficient assuming you're not entering arbitrary values hoping things work. IBT is a fairly weak test of stability by today's standards. The trouble is, and this is where arguments arise, not many people fully understand the implications of pushing OCCT and AVX versions of Prime for hours on modern CPUs. On the HEDT SKUs, you're talking 300 to 400w of current in some cases. It's a free country, but if you want to subject your CPU to that then there will be some degradation.

Memory stress testing comes down to two applications currently which are Google Stress App test, and HCI Memtest. Both good in their own right with the latter being better at catching errors with cache instability. AIDA64 is also good at finding cache instability.

Short answer, there isn't a magic bullet here, a pragmatic approach is needed depending on the platform - and simply use the machine beyond initial stressing.

This is the thing that gets me though I can run Prime95 on the majority settings without any problem, been running blend for the past 3 hours with no issue, yet if I do custom on 8k FFT to 4096k FFT for some reason it causes a crash at the second test, while Blend which basically does the same thing as that custom setting runs without problems.

So where do you draw the line on stability ? How does say @8pack do his stability tests for his super CPU's OCUK offers for sale there really should be some universal baseline amongst the community for what stable really means.

It's difficult for me to say my system is stable because that one single custom mode in Prime95 but if I ignore that my system IS stable, I guess it's like saying your house is water proof until the ground around it floods ? That one test being the flood while all the others are just like rain.

Thanks for mentioning Google Stress App too, not heard of it before so I'll be giving it a run in
 
The custom run is likely hitting the cache slightly harder. Like I say, there isn't a magic bullet here or a definitive answer. Anywhere where there is AVX involved, you have to acknowledge that if you want to overclock - concessions may need to be made. That's why Intel introduced the AVX offset.

Do you think it could be voltage for the vring then ? I'm running uncore at 35 and Vring at 1.1 (although my UD4H automatically boosts uncore to 40 to catch up with the 44 core mulitplier)

Same with CPU:OCCT it's not stable which could be same issue as that one Prime custom issue, while CPU:Linpack in OCCT with AVX works fine, I haven't been able to actually find any information on what the difference between the 2 tests in OCCT is though so I can't really troubleshoot as I don't understand what is being pushed more in the 2 tests.

Ideally I want to get the core stable before I attempt going 1:1 on uncore (doubt it will happen) but if that's what's causing the issue then I'll have to tweak and technically if I ignore that one Prime custom and OCCT it is stable and has cool temps (bar prime/ibt temps)
 
Possibly vring, but pushing this rail is what will degrade your CPU the quickest.... I've seen a handful of users do this with no more than 1.2v in Prime, sometimes less if it's run for long enough.

Just use the machine for what you need to use it for, or be another user that's pushed theirs till the gates are the width of a motorway.

I just wish I knew what the issue was that was causing the crash instead of being stuck with multiple choice lol

If it's temps I'm screwed because I can't push voltage higher for Prime
If it's voltage I'm screwed because I can't push the temps higher for Prime
If it's the Vring I "could" go 1.2v but like you say degradation becomes an issue so is it really worth it ? (I have got a good 4 years out of the chip already though)

It's interesting to see all the answers though as everybody has a different benchmark for stability

I do none avx prime with 3d running at the same time to stress all controllers, Xtu with the same and real bench for a gaming pc.

Hci for memory.

For a workstation I will tailer testing to the company I am supplying them to. So if they are running avx etc.

The best way in all instances is everyday use.

How long do you run your tests for ? 24 hours or less/more ?
 
When it comes to GPU stability what would you guys recommend ?

I know of Heaven, Valley and 3DMark but are there any other good bench/stress testing tools focusing around GPU or the full works of RAM,CPU,GPU ?
 
Back
Top Bottom