Generally these days, i'll 10-20 run IBT/LinX, 1- 2 hr prime blend, then go onto serious testing.
What i class as serious testing is 24-48 hours of folding, if it can do that without crashing the folding client or BSOD, then its stable.
Why i do that is, generally my overclocks where allways prime stable, but where sometimes flaky when it came to folding, so i now use folding as my main stability test, as that is what my computer is used for, well that and gaming.
But as others have said, stress testing is only an indication, you can never trully call any PC 100% stable, we use to many different bits of software these days, the hardware may be stable, but a software fault may cause a crash also.
But I would say about the original question, 20 runs on high/max is the first step towards stability, if it passes that, then its still always a good idea to do a long Prime Blend run also. But it all depends on how much time u want to take really.