Gentoo Archives: gentoo-amd64

From: Deedra Waters <dmwaters@g.o>
To: gentoo-amd64@l.g.o
Subject: Re: [gentoo-amd64] mce log errors
Date: Wed, 07 Dec 2005 05:46:56
Message-Id: Pine.LNX.4.64.0512062343090.6137@monster
In Reply to: Re: [gentoo-amd64] mce log errors by Deedra Waters
1 Hrm, it doesn't look to be a heat issue. I opened the case and took a
2 look inside when i saw the last message,and it looks perfectly cool and
3 happy.
4
5 I do notice however, that it's only happening when i hammer the raid
6 array, which is on a pci promise controler.
7
8 On Tue, 6 Dec 2005, Deedra Waters wrote:
9
10 > Date: Tue, 6 Dec 2005 22:04:50 -0600 (CST)
11 > From: Deedra Waters <dmwaters@g.o>
12 > Reply-To: gentoo-amd64@l.g.o
13 > To: gentoo-amd64@l.g.o
14 > Subject: Re: [gentoo-amd64] mce log errors
15 >
16 > Is there a way to test that fact? I've tried to work with lm_sensors,
17 > but the readings for that are way way off. So, considering lm_sensors
18 > isuseless is there another way to tell if overheating is the problem?
19 >
20 > The case itself has a lot of fans, but it's also got 5 harddrives in it.
21 > On Tue, 6 Dec 2005, Daniel Gryniewicz wrote:
22 >
23 > > Date: Tue, 06 Dec 2005 18:39:48 -0500
24 > > From: Daniel Gryniewicz <dang@g.o>
25 > > Reply-To: gentoo-amd64@l.g.o
26 > > To: gentoo-amd64@l.g.o
27 > > Subject: Re: [gentoo-amd64] mce log errors
28 > >
29 > > On Tue, 2005-12-06 at 14:56 -0600, Deedra Waters wrote:
30 > > > All,
31 > > >
32 > > > I'm getting a lot of these, but it only seems to happen when i put the
33 > > > machine under a lot of stress, and even then it's not always happening.
34 > > > This machine is a duel opteron 242, the board is an asus k8, and with
35 > > > the latest bios update, the machine has no real problems at all.
36 > > >
37 > > > MCE 1
38 > > > CPU 0 4 northbridge TSC 8f1a7b270b6f
39 > > > ADDR 75c3320
40 > > > Northbridge ECC error
41 > > > ECC syndrome = 62
42 > > > bit32 = err cpu0
43 > > > bit46 = corrected ecc error
44 > > > bus error 'local node origin, request didn't time out
45 > > > generic read mem transaction
46 > > > memory access, level generic'
47 > > > STATUS 9431400100000813 MCGSTATUS 0
48 > > > MCE 2
49 > > > CPU 0 2 bus unit TSC 8f8ad2325db7
50 > > > L2 cache ECC error
51 > > > Bus or cache array error
52 > > > bit46 = corrected ecc error
53 > > > bit62 = error overflow (multiple errors)
54 > > > bus error 'local node origin, request didn't time out
55 > > > prefetch mem transaction
56 > > > memory access, level generic'
57 > > > STATUS d000400000000863 MCGSTATUS 0
58 > >
59 > > CPU cache is getting ECC errors. Smells like overheating.
60 > >
61 > > Daniel
62 > >
63 >
64 >
65
66 --
67 Deedra Waters - Gentoo developer relations, accessibility and infrastructure -
68 dmwaters@g.o
69 Gentoo linux: http://www.gentoo.org
70
71 --
72 gentoo-amd64@g.o mailing list

Replies

Subject Author
[gentoo-amd64] Re: mce log errors Duncan <1i5t5.duncan@×××.net>