1 |
On 05/23/2015 06:53 PM, Joseph wrote: |
2 |
> On 05/23/15 18:08, Zhu Sha Zang wrote: |
3 |
>> On 05/23/2015 05:24 PM, Joseph wrote: |
4 |
>>> I have a box in a remote location (8-core CPU) and it turn itself off |
5 |
>>> during compiling |
6 |
>>> |
7 |
>>> The box it connected to UPS. Is it power supply? |
8 |
>>> |
9 |
>> |
10 |
>> Maybe. I have a problem like that when using high processing simulation |
11 |
>> with nvidia-cuda and the power supply protection was unable to keep a |
12 |
>> safe energy level then the system goes off. |
13 |
>> |
14 |
>> But, if the failure happens during compilation time can be a heat |
15 |
>> problem. Install lm_sensors and use something like that: "watch -n 1 |
16 |
>> sensors". |
17 |
>> |
18 |
>> If not, if the temperature stay at safe levels, maybe you have a RAM |
19 |
>> corruption. In this case, you'll need to use memtest86++ to check. |
20 |
>> |
21 |
>> Good Luck |
22 |
> |
23 |
> I tried to read the lm-sensors again and the compupter turn crash with |
24 |
> the readings: |
25 |
> |
26 |
> fan1: 0 RPM (min = 10 RPM) ALARM |
27 |
> fan2: 0 RPM (min = 0 RPM) |
28 |
> fan3: 0 RPM (min = 0 RPM) |
29 |
> fan5: 0 RPM (min = 0 RPM) |
30 |
> temp1: +47.0°C (low = +127.0°C, high = +127.0°C) sensor = |
31 |
> thermistor |
32 |
> temp2: +106.0°C (low = +127.0°C, high = +70.0°C) sensor = |
33 |
> thermal diode |
34 |
> temp3: +106.0°C (low = +127.0°C, high = +127.0°C) sensor = |
35 |
> thermistor |
36 |
> cpu0_vid: +1.250 V |
37 |
> |
38 |
> I'm suspecting it is power supply. |
39 |
> |
40 |
|
41 |
Hey, did you run "sensors-detect" and "/etc/init.d/lm_sensors" as root |
42 |
before use "sensors"? |
43 |
|
44 |
As was said, maybe you're using wrong kernel modules. |
45 |
|
46 |
Regards |