Help with diagnosing crash problems

Associate
Joined
30 May 2018
Posts
3
Hello

I have an intermittent issue with 3 computers all HP z260, and I have been trying to solve it for months.
I am running the machines as headless render nodes for 3D software (Cinema 4D) - more or less at 100% load.


Computer Specs

All computers HP Z260, Windows 10 Pro 64 bit, 32 GB Ram, 1TB HDD, entry level Nvidia GPU,

CPUs have changed

Original setup
Computer 1: dual Xeon E5 2660 V1
Computer 2: dual Xeon E5 2680 V1
Computer 3: dual Xeon E5 2690 V2

New setup -
All CPU's changed to dual E5 2690 V2


Problem


There are two main symptoms I would like to diagnose,

1) two of the computers have shown signs of intermittently getting tired after a couple of hours of working, by this I mean that the time of completing roughly identical tasks (rendering very similar frames) exponentially gets slower. In the most extreme example the render time went from 4 mins to 24 mins.

The fix for this is simply to reboot the software. However according to the software manufacturer and it's associated forum, this is very unusual behaviour and should not happen.

2) all 3 of the computers suffer from intermittent software crashes, in which the entire system seizes to a halt and then the software stops responding.

I am using dual Xeon machines, which were bought refurbished from a reputable UK sales outlet. The hardware team tell me that these processors should be able to run constantly without the need to reboot.

The Cinema 4D forum suggests that there has been some issues with CPU's reaching max temp and then causing the application to crash. The suggested fix for this is to lower the clock speed to lower the temperature.

However I have been closely monitoring temperatures using intel power gadget and note that none of the temperatures on the machines go anywhere near the max temp. Usually around 65degrees per unit.


Questions

so, the hardware people say the software is the problem and the software people say the hardware is the problem.

It should be noted that I also have an iMac with exactly the same build of software installed and it has never crashed even when running for several days. It uses an i7 6700

1. I have a hunch that there could be an issue with the HDD's in the machines during the read/write process?
2. Could be the RAM? although this never seems to exceed 60% according to task manager
3. Do I need to reboot the machines regularly or should they be able to run for longer? The iMac is usually on constantly.

Any help would be amazing, thank you
 
Also - assuming parts are functioning properly. Should these machines be able to run at 100% load constantly or am I expecting too much. I bought them on the basis they are server chips and supposedly can take heavy performance, but as I say are being outstripped by a 4 ghz quad i7 for stability.
 
Back
Top Bottom