Hi All
I'm usually OK with this sort of thing and I think I know the problem - just a sanity check/other ideas I may have missed before I go throwing money at the issue.
Old X99 system running Server 2019 has started locking up recently. There have been some changes, which were made at the same time I upgraded (clean install) from Server 2016:
'New' NVME drive
Additional Memory
However, I've temporary changed everything back to how it was before (still had the old SSD with Server 2016 on) and the problem persists.
The biggest issue is the server is headless, and I've not been able to get any errors, stop codes etc out of it, so there's an element of guesswork. Event viewer doesn't record anything at all, so it's some sort of hardware fault I think.
HWInfo has revealed a SMART error, which I'm assuming is the cause of the problem. I'm not great with HDDs so really just wanting to check that this is likely to be what's locking up the server. The error says:
[05] Reallocated Sector Count: 100/10, Worst: 100 (Data = 16,0)
I dont really understand what that means, but as it's the only thing I can find that's wrong, I just want to check it won't be wasting my time replacing the drive (12TB so not cheap).
Cheers!
*edit* forgot one possible key fact - I also swapped out the old 5820k for an engineering sample intel chip (12 core, 24 thread, picked up by CPU-Z as a 'Xeon-2000'). However this was done a few weeks before the OS upgrade and was fine on server 2016, so I don't think that's the issue.
Also just to rule out the other obvious stuff, temperatures are fine, it rarely goes above 35ºC
I'm usually OK with this sort of thing and I think I know the problem - just a sanity check/other ideas I may have missed before I go throwing money at the issue.
Old X99 system running Server 2019 has started locking up recently. There have been some changes, which were made at the same time I upgraded (clean install) from Server 2016:
'New' NVME drive
Additional Memory
However, I've temporary changed everything back to how it was before (still had the old SSD with Server 2016 on) and the problem persists.
The biggest issue is the server is headless, and I've not been able to get any errors, stop codes etc out of it, so there's an element of guesswork. Event viewer doesn't record anything at all, so it's some sort of hardware fault I think.
HWInfo has revealed a SMART error, which I'm assuming is the cause of the problem. I'm not great with HDDs so really just wanting to check that this is likely to be what's locking up the server. The error says:
[05] Reallocated Sector Count: 100/10, Worst: 100 (Data = 16,0)
I dont really understand what that means, but as it's the only thing I can find that's wrong, I just want to check it won't be wasting my time replacing the drive (12TB so not cheap).
Cheers!
*edit* forgot one possible key fact - I also swapped out the old 5820k for an engineering sample intel chip (12 core, 24 thread, picked up by CPU-Z as a 'Xeon-2000'). However this was done a few weeks before the OS upgrade and was fine on server 2016, so I don't think that's the issue.
Also just to rule out the other obvious stuff, temperatures are fine, it rarely goes above 35ºC
Last edited: