Gentoo Archives: gentoo-user

From: "Fatih Tümen" <fthtmn+gentoo@×××××.com>
To: gentoo-user@l.g.o
Subject: [gentoo-user] USB Disk failure - Buffer I/O error on device sda2, logical block 1289 lost page write due to I/O error on sda2
Date: Sat, 09 Oct 2010 19:26:18
Message-Id: AANLkTikqvSABi3Qd0r0=iPKhqnts2ughAZ_3dWd9Q9uA@mail.gmail.com
1 Hi,
2
3 When I ran eix hdf command, all of a sudden my usb drive started
4 making weird noises. I only have ccache, distfiles and packaes
5 directories on sda2, the usb disk partition. I don't know why eix
6 waked up the disk. Eix hung there non-responding for a moment as the
7 disk kept making noises, so I interrupted the task. Immediately
8 checked the logs[0] and umounted the device as a reflex.
9
10 The time 20:38 in [0] corresponds to the time I issues eix. There was
11 nothing in the logs for 8 minutes. The last operation I had on the
12 disk was fetching kde updates which finished at about 19:00.
13
14 I wanted to fsck the device and got:
15 # fsck.ext3 -pvf /dev/sda2
16 fsck.ext3: Attempt to read block from filesystem resulted in short
17 read while trying to open /dev/sda2
18 Could this be a zero-length partition?
19
20 fdisk -l was not showing the device anymore. I could not mounted it back.
21
22 Googling the subject brought some unsolved threads so far. There were
23 some similar issues, one with ext4, one with ipod. But in all those
24 cases they still had their device alive whereas I seem to completely
25 lost it. Here is what smartmontools says:
26
27 # smartctl -d /dev/sda
28 smartctl version 5.38 [i686-pc-linux-gnu] Copyright (C) 2002-8 Bruce Allen
29 Home page is http://smartmontools.sourceforge.net/
30
31 =======> INVALID ARGUMENT TO -d: /dev/sda
32 =======> VALID ARGUMENTS ARE: ata, scsi, marvell, sat, 3ware,N,
33 hpt,L/M/N cciss,N <=======
34
35 Use smartctl -h to get a usage summary
36
37 21:08:23 | log # smartctl -a /dev/sda
38 smartctl version 5.38 [i686-pc-linux-gnu] Copyright (C) 2002-8 Bruce Allen
39 Home page is http://smartmontools.sourceforge.net/
40
41 Device: WD Version: 1.02
42 >> Terminate command early due to bad response to IEC mode page
43 A mandatory SMART command failed: exiting. To continue, add one or
44 more '-T permissive' options.
45 21:08:31 | log # fdisk /dev/sda
46
47 Unable to read /dev/sda
48 21:14:33 | log # ll /dev/sda
49 brw-rw---- 1 root disk 8, 0 Oct 9 21:14 /dev/sda
50
51 I tried to plug it in a ubuntu box on a different machine, the result was same.
52 And this is the messages I get upon plugging[1]. For comparison here
53 is usual message I got last time mounted[2]
54
55 I am not sure what may have caused the issue but there are two things
56 I suspect that may have caused this.
57 1. I usually forget the device is mounted and sda1 partition swappedon
58 while suspending/sleeping the system. I run fsck.ext3 on the
59 complaining dirty partitions.
60 2. I accidentally ran grub-install /dev/sda instead of /dev/hda last
61 week and interrupted immediately. There is no sign of a problem in the
62 logs for a week. I had install grub on sda couple of years ago but
63 hadnt used it for a long time.
64
65 I thought this may be related because interrupting the grub-install
66 left in complete mbr. So I ran grub-install on sda again but it could
67 not find the disk[3].
68
69 I cannot think of anything else to try. I hope you can suggest me a
70 way to recover the device or to extract data from it.
71
72
73 [0]
74 Oct 9 20:30:01 elsewhere cron[3670]: (root) CMD (test -x
75 /usr/sbin/run-crons && /usr/sbin/run-crons )
76 Oct 9 20:38:00 elsewhere kernel: [265616.964879] sd 8:0:0:0: [sda]
77 Result: hostbyte=0x00 driverbyte=0x08
78 Oct 9 20:38:00 elsewhere kernel: [265616.964893] sd 8:0:0:0: [sda]
79 Sense Key : 0x5 [current]
80 Oct 9 20:38:00 elsewhere kernel: [265616.964904] Info fld=0x0
81 Oct 9 20:38:00 elsewhere kernel: [265616.964909] sd 8:0:0:0: [sda]
82 ASC=0x21 ASCQ=0x0
83 Oct 9 20:38:00 elsewhere kernel: [265616.964918] sd 8:0:0:0: [sda]
84 CDB: cdb[0]=0x28: 28 00 00 19 b6 78 00 00 08 00
85 Oct 9 20:38:00 elsewhere kernel: [265616.964938] end_request: I/O
86 error, dev sda, sector 1685112
87 Oct 9 20:38:00 elsewhere kernel: [265616.964972] EXT3-fs error
88 (device sda2): ext3_find_entry: reading directory #8238 offset 0
89 Oct 9 20:38:20 elsewhere kernel: [265637.556176] sd 8:0:0:0: [sda]
90 Result: hostbyte=0x00 driverbyte=0x08
91 Oct 9 20:38:20 elsewhere kernel: [265637.556190] sd 8:0:0:0: [sda]
92 Sense Key : 0x5 [current]
93 Oct 9 20:38:20 elsewhere kernel: [265637.556200] Info fld=0x0
94 Oct 9 20:38:20 elsewhere kernel: [265637.556206] sd 8:0:0:0: [sda]
95 ASC=0x21 ASCQ=0x0
96 Oct 9 20:38:20 elsewhere kernel: [265637.556214] sd 8:0:0:0: [sda]
97 CDB: cdb[0]=0x28: 28 00 00 1d c6 60 00 00 08 00
98 Oct 9 20:38:20 elsewhere kernel: [265637.556234] end_request: I/O
99 error, dev sda, sector 1951328
100 Oct 9 20:38:20 elsewhere kernel: [265637.556281] EXT3-fs error
101 (device sda2): ext3_get_inode_loc: unable to read inode block -
102 inode=24579, block=99331
103 Oct 9 20:38:20 elsewhere kernel: [265637.558297] sd 8:0:0:0: [sda]
104 Result: hostbyte=0x00 driverbyte=0x08
105 Oct 9 20:38:20 elsewhere kernel: [265637.558310] sd 8:0:0:0: [sda]
106 Sense Key : 0x5 [current]
107 Oct 9 20:38:20 elsewhere kernel: [265637.558320] Info fld=0x0
108 Oct 9 20:38:20 elsewhere kernel: [265637.558325] sd 8:0:0:0: [sda]
109 ASC=0x21 ASCQ=0x0
110 Oct 9 20:38:20 elsewhere kernel: [265637.558333] sd 8:0:0:0: [sda]
111 CDB: cdb[0]=0x2a: 2a 00 00 11 a6 48 00 00 08 00
112 Oct 9 20:38:20 elsewhere kernel: [265637.558352] end_request: I/O
113 error, dev sda, sector 1156680
114 Oct 9 20:38:20 elsewhere kernel: [265637.558363] Buffer I/O error on
115 device sda2, logical block 0
116 Oct 9 20:38:20 elsewhere kernel: [265637.558370] lost page write due
117 to I/O error on sda2
118 Oct 9 20:39:25 elsewhere kernel: [265702.325405] sd 8:0:0:0: [sda]
119 Result: hostbyte=0x00 driverbyte=0x08
120 Oct 9 20:39:25 elsewhere kernel: [265702.325419] sd 8:0:0:0: [sda]
121 Sense Key : 0x5 [current]
122 Oct 9 20:39:25 elsewhere kernel: [265702.325430] Info fld=0x0
123 Oct 9 20:39:25 elsewhere kernel: [265702.325435] sd 8:0:0:0: [sda]
124 ASC=0x21 ASCQ=0x0
125 Oct 9 20:39:25 elsewhere kernel: [265702.325444] sd 8:0:0:0: [sda]
126 CDB: cdb[0]=0x2a: 2a 00 00 11 ce 90 00 00 08 00
127 Oct 9 20:39:25 elsewhere kernel: [265702.325464] end_request: I/O
128 error, dev sda, sector 1166992
129 Oct 9 20:39:25 elsewhere kernel: [265702.325475] Buffer I/O error on
130 device sda2, logical block 1289
131 Oct 9 20:39:25 elsewhere kernel: [265702.325482] lost page write due
132 to I/O error on sda2
133 Oct 9 20:39:25 elsewhere kernel: [265702.325533] ------------[ cut
134 here ]------------
135 Oct 9 20:39:25 elsewhere kernel: [265702.325547] WARNING: at
136 fs/buffer.c:1151 mark_buffer_dirty+0x23/0x6b()
137 Oct 9 20:39:25 elsewhere kernel: [265702.325554] Hardware name: Amilo A1640
138 Oct 9 20:39:25 elsewhere kernel: [265702.325563] Pid: 3711, comm:
139 umount Not tainted 2.6.35-gentoo-r4 #1
140 Oct 9 20:39:25 elsewhere kernel: [265702.325569] Call Trace:
141 Oct 9 20:39:25 elsewhere kernel: [265702.325584] [<c1031c61>]
142 warn_slowpath_common+0x60/0x75
143 Oct 9 20:39:25 elsewhere kernel: [265702.325593] [<c10b3012>] ?
144 mark_buffer_dirty+0x23/0x6b
145 Oct 9 20:39:25 elsewhere kernel: [265702.325604] [<c1031c85>]
146 warn_slowpath_null+0xf/0x13
147 Oct 9 20:39:25 elsewhere kernel: [265702.325612] [<c10b3012>]
148 mark_buffer_dirty+0x23/0x6b
149 Oct 9 20:39:25 elsewhere kernel: [265702.325624] [<c1133e7b>]
150 journal_update_superblock+0x62/0xab
151 Oct 9 20:39:25 elsewhere kernel: [265702.325634] [<c1132a38>]
152 cleanup_journal_tail+0xd2/0xdb
153 Oct 9 20:39:25 elsewhere kernel: [265702.325643] [<c1132efa>]
154 log_do_checkpoint+0x41d/0x431
155 Oct 9 20:39:25 elsewhere kernel: [265702.325653] [<c102c6e9>] ?
156 update_curr+0x161/0x169
157 Oct 9 20:39:25 elsewhere kernel: [265702.325663] [<c102c9ed>] ?
158 dequeue_entity+0xb4/0x226
159 Oct 9 20:39:25 elsewhere kernel: [265702.325672] [<c1025c3d>] ?
160 __dequeue_entity+0x23/0x27
161 Oct 9 20:39:25 elsewhere kernel: [265702.325684] [<c100832b>] ?
162 __switch_to_xtra+0xe4/0x101
163 Oct 9 20:39:25 elsewhere kernel: [265702.325696] [<c13f112f>] ?
164 _raw_spin_unlock_irq+0x15/0x20
165 Oct 9 20:39:25 elsewhere kernel: [265702.325706] [<c10297e8>] ?
166 finish_task_switch+0x34/0x52
167 Oct 9 20:39:25 elsewhere kernel: [265702.325717] [<c1024b04>] ?
168 need_resched+0x14/0x1e
169 Oct 9 20:39:25 elsewhere kernel: [265702.325726] [<c13efd4d>] ?
170 schedule+0x4f9/0x509
171 Oct 9 20:39:25 elsewhere kernel: [265702.325737] [<c13f110f>] ?
172 _raw_spin_unlock_irqrestore+0x16/0x21
173 Oct 9 20:39:25 elsewhere kernel: [265702.325747] [<c10424af>] ?
174 spin_unlock_irqrestore+0x8/0xa
175 Oct 9 20:39:25 elsewhere kernel: [265702.325756] [<c1134166>]
176 journal_destroy+0xe3/0x18f
177 Oct 9 20:39:25 elsewhere kernel: [265702.325765] [<c1042441>] ?
178 autoremove_wake_function+0x0/0x2f
179 Oct 9 20:39:25 elsewhere kernel: [265702.325777] [<c1101918>]
180 ext3_put_super+0x3a/0x1e5
181 Oct 9 20:39:25 elsewhere kernel: [265702.325786] [<c109a7c9>]
182 generic_shutdown_super+0x42/0xb3
183 Oct 9 20:39:25 elsewhere kernel: [265702.325795] [<c109a857>]
184 kill_block_super+0x1d/0x31
185 Oct 9 20:39:25 elsewhere kernel: [265702.325803] [<c1099ee7>]
186 deactivate_locked_super+0x1a/0x36
187 Oct 9 20:39:25 elsewhere kernel: [265702.325811] [<c109a29b>]
188 deactivate_super+0x32/0x36
189 Oct 9 20:39:25 elsewhere kernel: [265702.325820] [<c10aa185>]
190 mntput_no_expire+0x85/0xa8
191 Oct 9 20:39:25 elsewhere kernel: [265702.325828] [<c10aa609>]
192 sys_umount+0x26e/0x293
193 Oct 9 20:39:25 elsewhere kernel: [265702.325837] [<c10aa63b>]
194 sys_oldumount+0xd/0xf
195 Oct 9 20:39:25 elsewhere kernel: [265702.325845] [<c100270c>]
196 sysenter_do_call+0x12/0x22
197 Oct 9 20:39:25 elsewhere kernel: [265702.325852] ---[ end trace
198 113c73c40a67cbb5 ]---
199 Oct 9 20:39:25 elsewhere kernel: [265702.327279] sd 8:0:0:0: [sda]
200 Result: hostbyte=0x00 driverbyte=0x08
201 Oct 9 20:39:25 elsewhere kernel: [265702.327292] sd 8:0:0:0: [sda]
202 Sense Key : 0x5 [current]
203 Oct 9 20:39:25 elsewhere kernel: [265702.327302] Info fld=0x0
204 Oct 9 20:39:25 elsewhere kernel: [265702.327307] sd 8:0:0:0: [sda]
205 ASC=0x21 ASCQ=0x0
206 Oct 9 20:39:25 elsewhere kernel: [265702.327316] sd 8:0:0:0: [sda]
207 CDB: cdb[0]=0x2a: 2a 00 00 11 ce 90 00 00 08 00
208 Oct 9 20:39:25 elsewhere kernel: [265702.327335] end_request: I/O
209 error, dev sda, sector 1166992
210 Oct 9 20:39:25 elsewhere kernel: [265702.327346] Buffer I/O error on
211 device sda2, logical block 1289
212 Oct 9 20:39:25 elsewhere kernel: [265702.327352] lost page write due
213 to I/O error on sda2
214 Oct 9 20:39:25 elsewhere kernel: [265702.327408] Aborting journal on
215 device sda2.
216 Oct 9 20:39:25 elsewhere kernel: [265702.328790] sd 8:0:0:0: [sda]
217 Result: hostbyte=0x00 driverbyte=0x08
218 Oct 9 20:39:25 elsewhere kernel: [265702.328804] sd 8:0:0:0: [sda]
219 Sense Key : 0x5 [current]
220 Oct 9 20:39:25 elsewhere kernel: [265702.328814] Info fld=0x0
221 Oct 9 20:39:25 elsewhere kernel: [265702.328819] sd 8:0:0:0: [sda]
222 ASC=0x21 ASCQ=0x0
223 Oct 9 20:39:25 elsewhere kernel: [265702.328827] sd 8:0:0:0: [sda]
224 CDB: cdb[0]=0x2a: 2a 00 00 11 ce 90 00 00 08 00
225 Oct 9 20:39:25 elsewhere kernel: [265702.328847] end_request: I/O
226 error, dev sda, sector 1166992
227 Oct 9 20:39:25 elsewhere kernel: [265702.328857] Buffer I/O error on
228 device sda2, logical block 1289
229 Oct 9 20:39:25 elsewhere kernel: [265702.328864] lost page write due
230 to I/O error on sda2
231 Oct 9 20:39:25 elsewhere kernel: [265702.328929] EXT3-fs (sda2):
232 error: ext3_put_super: Couldn't clean up the journal
233 Oct 9 20:39:25 elsewhere kernel: [265702.328938] EXT3-fs (sda2):
234 error: remounting filesystem read-only
235 Oct 9 20:39:25 elsewhere kernel: [265702.367778] sd 8:0:0:0: [sda]
236 Result: hostbyte=0x00 driverbyte=0x08
237 Oct 9 20:39:25 elsewhere kernel: [265702.367792] sd 8:0:0:0: [sda]
238 Sense Key : 0x5 [current]
239 Oct 9 20:39:25 elsewhere kernel: [265702.367802] Info fld=0x0
240 .... [goes like this for hundreds of lines]....
241
242 [1]
243 Oct 9 20:56:10 elsewhere kernel: [266707.674053] usb 1-2: new high
244 speed USB device using ehci_hcd and address 33
245 Oct 9 20:56:11 elsewhere kernel: [266707.789928] usb 1-2: New USB
246 device found, idVendor=1058, idProduct=0702
247 Oct 9 20:56:11 elsewhere kernel: [266707.789940] usb 1-2: New USB
248 device strings: Mfr=1, Product=2, SerialNumber=3
249 Oct 9 20:56:11 elsewhere kernel: [266707.789948] usb 1-2: Product:
250 External HDD
251 Oct 9 20:56:11 elsewhere kernel: [266707.789954] usb 1-2:
252 Manufacturer: Western Digital
253 Oct 9 20:56:11 elsewhere kernel: [266707.789961] usb 1-2:
254 SerialNumber: <U+FFFF><U+FFFF><U+FFFF><U+FFFF><U+FFFF><U+FFFF><U+FFFF><U+FFFF><U+FFFF><U+FFFF><U+FFFF>
255 <U+FFFF><U+FFFF><U+FFFF><U+FFFF><U+FFFF><U+FFFF><U+FFFF><U+FFFF><U+FFFF><U+FFFF><U+FFFF><U+FFFF><U+FFFF>
256 Oct 9 20:56:11 elsewhere kernel: [266707.795512] scsi10 : usb-storage 1-2:1.0
257 Oct 9 20:56:12 elsewhere kernel: [266708.825857] scsi 10:0:0:0:
258 Direct-Access WD 1.02 PQ: 0 ANSI: 0
259 Oct 9 20:56:12 elsewhere kernel: [266708.826235] sd 10:0:0:0:
260 Attached scsi generic sg0 type 0
261 Oct 9 20:56:12 elsewhere kernel: [266708.860619] sd 10:0:0:0: [sda]
262 Too big for this kernel. Use a kernel compiled with support for large
263 block devices.
264 Oct 9 20:56:12 elsewhere kernel: [266708.860634] sd 10:0:0:0: [sda]
265 Unsupported sector size -75.
266 Oct 9 20:56:12 elsewhere kernel: [266708.860647] sd 10:0:0:0: [sda] 0
267 512-byte logical blocks: (0 B/0 B)
268 Oct 9 20:56:12 elsewhere kernel: [266708.860655] sd 10:0:0:0: [sda]
269 0-byte physical blocks
270 Oct 9 20:56:12 elsewhere kernel: [266708.862230] sd 10:0:0:0: [sda]
271 Write Protect is off
272 Oct 9 20:56:12 elsewhere kernel: [266708.862241] sd 10:0:0:0: [sda]
273 Mode Sense: 00 00 00 00
274 Oct 9 20:56:12 elsewhere kernel: [266708.862249] sd 10:0:0:0: [sda]
275 Assuming drive cache: write through
276 Oct 9 20:56:12 elsewhere kernel: [266708.865587] sd 10:0:0:0: [sda]
277 Too big for this kernel. Use a kernel compiled with support for large
278 block devices.
279 Oct 9 20:56:12 elsewhere kernel: [266708.865601] sd 10:0:0:0: [sda]
280 Unsupported sector size -75.
281 Oct 9 20:56:12 elsewhere kernel: [266708.867487] sd 10:0:0:0: [sda]
282 Assuming drive cache: write through
283 Oct 9 20:56:12 elsewhere kernel: [266708.867499] sd 10:0:0:0: [sda]
284 Attached SCSI disk
285
286 [2]
287 Oct 9 06:55:30 elsewhere kernel: [216267.317041] usb 1-2: new high
288 speed USB device using ehci_hcd and address 31
289 Oct 9 06:55:30 elsewhere kernel: [216267.432928] usb 1-2: New USB
290 device found, idVendor=1058, idProduct=0702
291 Oct 9 06:55:30 elsewhere kernel: [216267.432939] usb 1-2: New USB
292 device strings: Mfr=1, Product=2, SerialNumber=3
293 Oct 9 06:55:30 elsewhere kernel: [216267.432947] usb 1-2: Product: External HDD
294 Oct 9 06:55:30 elsewhere kernel: [216267.432953] usb 1-2:
295 Manufacturer: Western Digital
296 Oct 9 06:55:30 elsewhere kernel: [216267.432959] usb 1-2:
297 SerialNumber: 575845343037323832333437
298 Oct 9 06:55:30 elsewhere kernel: [216267.437545] scsi8 : usb-storage 1-2:1.0
299 Oct 9 06:55:31 elsewhere kernel: [216268.447876] scsi 8:0:0:0:
300 Direct-Access WD 1600BEVExternal 1.02 PQ: 0 ANSI: 0
301 Oct 9 06:55:31 elsewhere kernel: [216268.448238] sd 8:0:0:0: Attached
302 scsi generic sg0 type 0
303 Oct 9 06:55:31 elsewhere kernel: [216268.481955] sd 8:0:0:0: [sda]
304 312581808 512-byte logical blocks: (160 GB/149 GiB)
305 Oct 9 06:55:31 elsewhere kernel: [216268.483225] sd 8:0:0:0: [sda]
306 Write Protect is off
307 Oct 9 06:55:31 elsewhere kernel: [216268.483238] sd 8:0:0:0: [sda]
308 Mode Sense: 00 00 00 00
309 Oct 9 06:55:31 elsewhere kernel: [216268.483246] sd 8:0:0:0: [sda]
310 Assuming drive cache: write through
311 Oct 9 06:55:31 elsewhere kernel: [216268.484846] sd 8:0:0:0: [sda]
312 Assuming drive cache: write through
313 Oct 9 06:55:31 elsewhere kernel: [216268.484864] sda: sda1 sda2 sda3
314 sda4 < sda5 >
315 Oct 9 06:55:31 elsewhere kernel: [216268.558235] sd 8:0:0:0: [sda]
316 Assuming drive cache: write through
317 Oct 9 06:55:31 elsewhere kernel: [216268.558248] sd 8:0:0:0: [sda]
318 Attached SCSI disk
319 Oct 9 06:55:33 elsewhere kernel: [216270.550685] EXT3-fs: barriers not enabled
320 Oct 9 06:55:33 elsewhere kernel: [216270.566936] kjournald starting.
321 Commit interval 5 seconds
322 Oct 9 06:55:33 elsewhere kernel: [216270.567529] EXT3-fs (sda2):
323 using internal journal
324 Oct 9 06:55:33 elsewhere kernel: [216270.567545] EXT3-fs (sda2):
325 mounted filesystem with writeback data mode
326 Oct 9 06:55:34 elsewhere kernel: [216270.898980] EXT3-fs: barriers not enabled
327 Oct 9 06:55:34 elsewhere kernel: [216270.910455] kjournald starting.
328 Commit interval 5 seconds
329 Oct 9 06:55:34 elsewhere kernel: [216270.911421] EXT3-fs (sda5):
330 using internal journal
331 Oct 9 06:55:34 elsewhere kernel: [216270.911437] EXT3-fs (sda5):
332 mounted filesystem with writeback data mode
333
334 [3]
335 # grub-install --no-floppy /dev/sda
336
337
338 GNU GRUB version 0.97 (640K lower / 5120K upper memory)
339
340 [ Minimal BASH-like line editing is supported. For the first word, TAB
341 lists possible command completions. Anywhere else TAB lists the possible
342 completions of a device/filename. ]
343 grub> root (hd0,0)
344 Filesystem type is ext2fs, partition type 0x83
345 grub> setup --stage2=/boot/grub/stage2 --prefix=/grub (hd1)
346 Checking if "/grub/stage1" exists... yes
347 Checking if "/grub/stage2" exists... yes
348 Checking if "/grub/e2fs_stage1_5" exists... yes
349 Running "embed /grub/e2fs_stage1_5 (hd1)"... failed (this is not fatal)
350 Running "embed /grub/e2fs_stage1_5 (hd0,0)"... failed (this is not fatal)
351 Running "install --stage2=/boot/grub/stage2 /grub/stage1 d (hd1)
352 /grub/stage2 p /grub/menu.lst "... failed
353
354 Error 21: Selected disk does not exist
355 grub> quit
356
357
358 --
359 Fatih

Replies