1 |
Hi there! |
2 |
|
3 |
This week I've tried to setup a home-server, but the system is highly |
4 |
instable. The first symptoms were lots of page allocation errors, which |
5 |
disappeared after setting the internal memory allocator from SLUB to SLAB |
6 |
and increasing the min_free_kbytes in /proc/sys/vm from 8MB to 20MB. |
7 |
|
8 |
The machine is a AMD Athlon64X2 5050e on a asus M3A78-Pro board with 2x2GB |
9 |
RAM. I'm using kernel 2.6.29.4 (vanilla, but the result is the same as |
10 |
using 2.6.29-gentoo-r5), and I also upgraded the board's BIOS to the latest |
11 |
version (which is 0902) |
12 |
|
13 |
But still the system freezes after some hours. It just freezes. Console is |
14 |
dead, no entry in the logs, no network connectivity, even sysrq doesn't |
15 |
seem to do anything. The worst thing is I don't even have an idea what the |
16 |
error could be, and in the rare situations when it crashed and the console |
17 |
was not blanked, I only see the end of a stack trace, and the intresting |
18 |
parts are scrolled out (and I can't scroll back as the console is |
19 |
absolutely dead :-( ) The only button that is still working is the reset |
20 |
button, and after rebooting the log does't tell anything (just ends without |
21 |
any message) |
22 |
|
23 |
I inspected my dmesg-output right after booting more precisely, and I've |
24 |
found some strange entries which could indicate a problem. What do you |
25 |
think about them? |
26 |
|
27 |
[ 0.000000] ACPI Warning (tbfadt-0568): 32/64X length mismatch in |
28 |
Gpe0Block: 64/32 [20081204] |
29 |
[ 0.000000] FADT: X_PM1a_EVT_BLK.bit_width (16) does not match |
30 |
PM1_EVT_LEN (4) |
31 |
... |
32 |
[ 0.000000] 4 Processors exceeds NR_CPUS limit of 2 |
33 |
[ 0.000000] SMP: Allowing 2 CPUs, 0 hotplug CPUs |
34 |
... |
35 |
[ 0.000999] Aperture pointing to e820 RAM. Ignoring. |
36 |
[ 0.000999] Your BIOS doesn't leave a aperture memory hole |
37 |
[ 0.000999] Please enable the IOMMU option in the BIOS setup |
38 |
[ 0.000999] This costs you 64 MB of RAM |
39 |
[ 0.000999] Mapping aperture over 65536 KB of RAM @ 20000000 |
40 |
[ 0.000999] PM: Registered nosave memory: 0000000020000000 - |
41 |
0000000024000000 |
42 |
... |
43 |
[ 0.099055] mtrr: your CPUs had inconsistent fixed MTRR settings |
44 |
[ 0.099059] mtrr: probably your BIOS does not setup all CPUs. |
45 |
[ 0.099116] mtrr: corrected configuration. |
46 |
... |
47 |
[ 0.151260] PCI-DMA: Disabling AGP. |
48 |
[ 0.151260] PCI-DMA: aperture base @ 20000000 size 65536 KB |
49 |
[ 0.151260] PCI-DMA: using GART IOMMU. |
50 |
[ 0.151260] PCI-DMA: Reserving 64MB of IOMMU area in the AGP aperture |
51 |
... |
52 |
[ 0.163241] system 00:09: iomem range 0xfec00000-0xfec00fff has been |
53 |
reserved |
54 |
[ 0.163305] system 00:09: iomem range 0xfee00000-0xfee00fff has been |
55 |
reserved |
56 |
[ 0.163365] system 00:0a: ioport range 0x4d0-0x4d1 has been reserved |
57 |
[ 0.163422] system 00:0a: ioport range 0x40b-0x40b has been reserved |
58 |
[ 0.163480] system 00:0a: ioport range 0x4d6-0x4d6 has been reserved |
59 |
[ 0.163537] system 00:0a: ioport range 0xc00-0xc01 has been reserved |
60 |
[ 0.163595] system 00:0a: ioport range 0xc14-0xc14 has been reserved |
61 |
[ 0.163653] system 00:0a: ioport range 0xc50-0xc51 has been reserved |
62 |
[ 0.163711] system 00:0a: ioport range 0xc52-0xc52 has been reserved |
63 |
[ 0.163769] system 00:0a: ioport range 0xc6c-0xc6c has been reserved |
64 |
[ 0.163827] system 00:0a: ioport range 0xc6f-0xc6f has been reserved |
65 |
[ 0.163885] system 00:0a: ioport range 0xcd0-0xcd1 has been reserved |
66 |
[ 0.163942] system 00:0a: ioport range 0xcd2-0xcd3 has been reserved |
67 |
[ 0.163999] system 00:0a: ioport range 0xcd4-0xcd5 has been reserved |
68 |
[ 0.164070] system 00:0a: ioport range 0xcd6-0xcd7 has been reserved |
69 |
[ 0.164127] system 00:0a: ioport range 0xcd8-0xcdf has been reserved |
70 |
[ 0.164184] system 00:0a: ioport range 0x800-0x89f has been reserved |
71 |
[ 0.164241] system 00:0a: ioport range 0xb00-0xb3f has been reserved |
72 |
[ 0.164305] system 00:0a: ioport range 0x900-0x90f has been reserved |
73 |
[ 0.164363] system 00:0a: ioport range 0x910-0x91f has been reserved |
74 |
[ 0.164421] system 00:0a: ioport range 0xfe00-0xfefe has been reserved |
75 |
[ 0.164480] system 00:0a: iomem range 0xffb80000-0xffbfffff has been |
76 |
reserved |
77 |
[ 0.164538] system 00:0a: iomem range 0xfec10000-0xfec1001f has been |
78 |
reserved |
79 |
[ 0.164598] system 00:0c: ioport range 0xe00-0xe0f has been reserved |
80 |
[ 0.164656] system 00:0c: ioport range 0xe80-0xe8f has been reserved |
81 |
[ 0.164713] system 00:0c: ioport range 0xf40-0xf4f has been reserved |
82 |
[ 0.164771] system 00:0c: ioport range 0xa30-0xa3f has been reserved |
83 |
[ 0.164830] system 00:0d: iomem range 0xe0000000-0xefffffff has been |
84 |
reserved |
85 |
[ 0.164890] system 00:0e: iomem range 0x0-0x9ffff could not be reserved |
86 |
[ 0.164947] system 00:0e: iomem range 0xc0000-0xcffff has been reserved |
87 |
[ 0.165018] system 00:0e: iomem range 0xe0000-0xfffff could not be |
88 |
reserved |
89 |
[ 0.165076] system 00:0e: iomem range 0x100000-0xdfffffff could not be |
90 |
reserved |
91 |
[ 0.165158] system 00:0e: iomem range 0xfec00000-0xffffffff could not be |
92 |
reserved |
93 |
... |
94 |
[ 21.298450] ACPI: I/O resource piix4_smbus [0xb00-0xb07] conflicts with |
95 |
ACPI region SOR1 [0xb00-0xb0f] |
96 |
[ 21.298454] ACPI: Device needs an ACPI driver |
97 |
[ 21.298461] piix4_smbus 0000:00:14.0: SMBus Host Controller at 0xb00, |
98 |
revision 0 |
99 |
... |
100 |
[ 73.861479] ACPI: I/O resource it87 [0xe85-0xe86] conflicts with ACPI |
101 |
region HWRE [0xe85-0xe86] |
102 |
[ 73.861483] ACPI: Device needs an ACPI driver |
103 |
|
104 |
Whats does this message "4 Processors exceeds NR_CPUS" say? the system is a |
105 |
Dual-Core AMD Athlon64 5050e, AFAIK it has two cores and nothing more. The |
106 |
mttr-Message later also indicate that there could be more than 2 CPUs |
107 |
available. wondering... |
108 |
|
109 |
The next thing which seems somewhat strange to me is the AGP aperture and |
110 |
the IOMMU. The Mainboard does not have an AGP port, nor does the bios have |
111 |
any option to enable. The only thing I can set is the size of the memory |
112 |
reservered for the onboad video card, which I set to the smallest value of |
113 |
32MB as the machine will usually not even have a display. |
114 |
|
115 |
The iomem-range reservation errors at the end? Harmful or not? |
116 |
|
117 |
The last messages come after loading the hw-sensors modules it87.ko and |
118 |
i2c_piix4. |
119 |
|
120 |
Thanks in advance for suggestions |
121 |
Alex |