Gentoo Archives: gentoo-server

From: welz@×××××××××.de
To: gentoo-server@l.g.o
Subject: Re: [gentoo-server] SMART attributes: dead hard disk ?
Date: Mon, 30 Oct 2006 22:22:02
Message-Id: 54061.192.168.2.198.1162246660.squirrel@192.168.3.211
In Reply to: Re: [gentoo-server] SMART attributes: dead hard disk ? by Ben Munat
1 Hi.
2
3 Probably you should reduce the watched attributes.
4 You must tweak a bit, but yes, the man page is overwhelming...
5
6 My smartd.conf looks as follows:
7
8 /dev/hdb -m sysadmin@×××××××××.de,test@×××××××××.de \
9 -M exec /root/bin/Scripts/smartmon_mail -M test -M daily -o on -S on -H \
10 -s (S/../.././18|L/../01/./03)
11
12 /dev/twe0 -d 3ware,1 -m sysadmin@×××××××××.de,test@×××××××××.de \
13 -M exec /root/bin/Scripts/smartmon_mail -M test -M daily -o on -S on -H \
14 -s (S/../.././19|L/../01/./05) -p -R 194
15
16 /dev/twe0 -d 3ware,2 -m sysadmin@×××××××××.de,test@×××××××××.de \
17 -M exec /root/bin/Scripts/smartmon_mail -M test -M daily -o on -S on -H \
18 -s (S/../.././20|L/../01/./03) -p -R 194
19
20 and smartmon_mail is:
21
22 #! /bin/bash
23 my_hostname=`/usr/bin/hostname`
24
25 /usr/bin/echo -e "Subject: $2\n\n$SMARTD_FULLMESSAGE\n\nFirst reported:
26 $SMARTD_TFIRST " | /usr/sbin
27 /sendmail -F "smartmon@$my_hostname" sysadmin@×××××××××.de
28
29 /usr/bin/echo -e "Subject: $2\n\n$SMARTD_FULLMESSAGE\n\nFirst reported:
30 $SMARTD_TFIRST " | /usr/sbin
31 /sendmail -F "smartmon@$my_hostname" test@×××××××××.de
32
33
34 > Yeah, I think you're right Peter.
35 >
36 > I've been meaning to look up what the numbers mean in a smartd log
37 > message. I generally see stuff like this:
38 >
39 > Usage: Seek_Time_Performance (8) changed to 91
40 > Usage: Temperature_Celsius (194) changed to 154, 163, 169
41 >
42 > but I certainly hope my drive isn't 169 degrees Celsius!
43 >
44 > It's probably in the man page somewhere, but I've been too lazy to look
45 > it up. I just hope that the smartd devs were nice enough to make the
46 > message of an actual imminent failure quite alarming.
47 >
48 > b
49 >
50 > Peter Abrahamsen wrote:
51 >> I speak on no particular authority, but I think these are normal
52 >> messages, and nothing to be too concerned about.
53 >>
54 >> P
55 >>
56 >> On 10/25/06, Nicolas MASSÉ <nicolas27.masse@×××××××.net> wrote:
57 >>> Hello,
58 >>>
59 >>> I have one RAID 1 array with two ATA hard disks. I noticed a lots of
60 >>> messages
61 >>> from smartd in my logs.
62 >>>
63 >>> First, the CRC error count of hdc "goes up" and then "down". What do
64 >>> you think
65 >>> of this ?
66 >>>
67 >>> SMART Usage Attribute: 199 UDMA_CRC_Error_Count changed from 195 to 199
68 >>> SMART Usage Attribute: 199 UDMA_CRC_Error_Count changed from 199 to 198
69 >>>
70 >>> Then, I have some messages regarding my first disk (hda) :
71 >>>
72 >>> SMART Prefailure Attribute: 8 Seek_Time_Performance changed from 242
73 >>> to 245
74 >>>
75 >>> This message is repeated about ten times a day.
76 >>>
77 >>> Do you think my disks are going to die in the next days or is smartd
78 >>> confused ?
79 >>>
80 >>> Regards,
81 >>>
82 >>> --
83 >>> Nicolas MASSÉ
84 >>> Pour récupérer ma clef GPG:
85 >>> gpg --keyserver wwwkeys.eu.pgp.net --recv-keys 0x2A18C433
86 >>> Key fingerprint: 6621 FC23 5DC7 54BA B952 316A 50B1 BC3F 2A18 C433
87 >>>
88 >>>
89 >>>
90 >>
91 > --
92 > gentoo-server@g.o mailing list
93 >
94 >
95 >
96
97
98 --
99 gentoo-server@g.o mailing list