1 |
Hrm, it doesn't look to be a heat issue. I opened the case and took a |
2 |
look inside when i saw the last message,and it looks perfectly cool and |
3 |
happy. |
4 |
|
5 |
I do notice however, that it's only happening when i hammer the raid |
6 |
array, which is on a pci promise controler. |
7 |
|
8 |
On Tue, 6 Dec 2005, Deedra Waters wrote: |
9 |
|
10 |
> Date: Tue, 6 Dec 2005 22:04:50 -0600 (CST) |
11 |
> From: Deedra Waters <dmwaters@g.o> |
12 |
> Reply-To: gentoo-amd64@l.g.o |
13 |
> To: gentoo-amd64@l.g.o |
14 |
> Subject: Re: [gentoo-amd64] mce log errors |
15 |
> |
16 |
> Is there a way to test that fact? I've tried to work with lm_sensors, |
17 |
> but the readings for that are way way off. So, considering lm_sensors |
18 |
> isuseless is there another way to tell if overheating is the problem? |
19 |
> |
20 |
> The case itself has a lot of fans, but it's also got 5 harddrives in it. |
21 |
> On Tue, 6 Dec 2005, Daniel Gryniewicz wrote: |
22 |
> |
23 |
> > Date: Tue, 06 Dec 2005 18:39:48 -0500 |
24 |
> > From: Daniel Gryniewicz <dang@g.o> |
25 |
> > Reply-To: gentoo-amd64@l.g.o |
26 |
> > To: gentoo-amd64@l.g.o |
27 |
> > Subject: Re: [gentoo-amd64] mce log errors |
28 |
> > |
29 |
> > On Tue, 2005-12-06 at 14:56 -0600, Deedra Waters wrote: |
30 |
> > > All, |
31 |
> > > |
32 |
> > > I'm getting a lot of these, but it only seems to happen when i put the |
33 |
> > > machine under a lot of stress, and even then it's not always happening. |
34 |
> > > This machine is a duel opteron 242, the board is an asus k8, and with |
35 |
> > > the latest bios update, the machine has no real problems at all. |
36 |
> > > |
37 |
> > > MCE 1 |
38 |
> > > CPU 0 4 northbridge TSC 8f1a7b270b6f |
39 |
> > > ADDR 75c3320 |
40 |
> > > Northbridge ECC error |
41 |
> > > ECC syndrome = 62 |
42 |
> > > bit32 = err cpu0 |
43 |
> > > bit46 = corrected ecc error |
44 |
> > > bus error 'local node origin, request didn't time out |
45 |
> > > generic read mem transaction |
46 |
> > > memory access, level generic' |
47 |
> > > STATUS 9431400100000813 MCGSTATUS 0 |
48 |
> > > MCE 2 |
49 |
> > > CPU 0 2 bus unit TSC 8f8ad2325db7 |
50 |
> > > L2 cache ECC error |
51 |
> > > Bus or cache array error |
52 |
> > > bit46 = corrected ecc error |
53 |
> > > bit62 = error overflow (multiple errors) |
54 |
> > > bus error 'local node origin, request didn't time out |
55 |
> > > prefetch mem transaction |
56 |
> > > memory access, level generic' |
57 |
> > > STATUS d000400000000863 MCGSTATUS 0 |
58 |
> > |
59 |
> > CPU cache is getting ECC errors. Smells like overheating. |
60 |
> > |
61 |
> > Daniel |
62 |
> > |
63 |
> |
64 |
> |
65 |
|
66 |
-- |
67 |
Deedra Waters - Gentoo developer relations, accessibility and infrastructure - |
68 |
dmwaters@g.o |
69 |
Gentoo linux: http://www.gentoo.org |
70 |
|
71 |
-- |
72 |
gentoo-amd64@g.o mailing list |