1 |
On Saturday 23 May 2015 23:53:32 Joseph wrote: |
2 |
> On 05/23/15 18:08, Zhu Sha Zang wrote: |
3 |
> >On 05/23/2015 05:24 PM, Joseph wrote: |
4 |
> >> I have a box in a remote location (8-core CPU) and it turn itself off |
5 |
> >> during compiling |
6 |
> >> |
7 |
> >> The box it connected to UPS. Is it power supply? |
8 |
> > |
9 |
> >Maybe. I have a problem like that when using high processing simulation |
10 |
> >with nvidia-cuda and the power supply protection was unable to keep a |
11 |
> >safe energy level then the system goes off. |
12 |
> > |
13 |
> >But, if the failure happens during compilation time can be a heat |
14 |
> >problem. Install lm_sensors and use something like that: "watch -n 1 |
15 |
> >sensors". |
16 |
> > |
17 |
> >If not, if the temperature stay at safe levels, maybe you have a RAM |
18 |
> >corruption. In this case, you'll need to use memtest86++ to check. |
19 |
> > |
20 |
> >Good Luck |
21 |
> |
22 |
> I tried to read the lm-sensors again and the compupter turn crash with the |
23 |
> readings: |
24 |
> |
25 |
> fan1: 0 RPM (min = 10 RPM) ALARM |
26 |
> fan2: 0 RPM (min = 0 RPM) |
27 |
> fan3: 0 RPM (min = 0 RPM) |
28 |
> fan5: 0 RPM (min = 0 RPM) |
29 |
> temp1: +47.0°C (low = +127.0°C, high = +127.0°C) sensor = |
30 |
> thermistor temp2: +106.0°C (low = +127.0°C, high = +70.0°C) |
31 |
> sensor = thermal diode temp3: +106.0°C (low = +127.0°C, high = |
32 |
> +127.0°C) sensor = thermistor cpu0_vid: +1.250 V |
33 |
> |
34 |
> I'm suspecting it is power supply. |
35 |
|
36 |
I wouldn't trust these numbers. You probably need a different/correct driver |
37 |
in your kernel to measure your CPU and MoBo chipset readings and/or a later |
38 |
BIOS firmware. |
39 |
|
40 |
Whenever I had such problems they were down to bad memory (some PCs are rather |
41 |
particular in only accepting matching memory modules) and also down to a sick |
42 |
power supply. |
43 |
|
44 |
Memetest86+ should tell you after some hours if something is amiss. |
45 |
|
46 |
The power supply problem will require opening it up and checking for domed |
47 |
capacitors. A few cents later and with a soldering iron in hand you should be |
48 |
able to fix any cheap capacitor induced failure. |
49 |
|
50 |
Of course if the machine is hundreds of miles away, attending to it is more of |
51 |
a problem. |
52 |
-- |
53 |
Regards, |
54 |
Mick |