Gentoo Archives: gentoo-user

From: Volker Armin Hemmann <volkerarmin@××××××××××.com>
To: gentoo-user@l.g.o
Subject: Re: [gentoo-user] USB Problems
Date: Sun, 12 Jun 2011 23:37:22
Message-Id: 2225175.dLhMVbeAlM@localhost
In Reply to: Re: [gentoo-user] USB Problems by john
1 On Sunday 12 June 2011 22:30:30 john wrote:
2 > On Sun, 12 Jun 2011 21:20:48 +0200
3 >
4 > meino.cramer@×××.de wrote:
5 > > john <jdm@×××××××××××××××××××.uk> [11-06-12 20:52]:
6 > > > On Sun, 12 Jun 2011 16:45:38 +0200
7 > > >
8 > > > meino.cramer@×××.de wrote:
9 > > > > john <jdm@×××××××××××××××××××.uk> [11-06-12 16:36]:
10 > > > > > Gents
11 > > > > >
12 > > > > > Whenever I insert external USB device my machine locks and I
13 > > > > > have to do a hard reset (sys req does not even work). I have
14 > > > > > tried this in a graphical environment and without. Machine
15 > > > > > boots up fine with USB keyboard and mouse but will not boot
16 > > > > > up
17 > > > > > with usb storage (memory stick
18 > > > > > - tried several)
19 > > > > >
20 > > > > > I have recently upgrade kernel to 2.6.38-r6 when this error
21 > > > > > started (approx). Hav also tried 2.6.38-r7 (which according
22 > > > > > to
23 > > > > > bugzilla should fix an issue similar to this) the error
24 > > > > > still
25 > > > > > happens.
26 > > > > >
27 > > > > > Any suggestions on kernel config options, debugging would be
28 > > > > > appreciated. And how to trap full error message.
29 > > > > >
30 > > > > > Error message alongs these lines
31 > > > > >
32 > > > > > Kernel panic not syncing - fatal exception in iterrupt
33 > > > > > ksoftirqd/1 Not tainted
34 > > > >
35 > > > > Hi John,
36 > > > >
37 > > > > may be this could help to track th eerror more closely:
38 > > > > Recompile the kernel with CONFIG_USB_DEBUG set.
39 > > > > If not already done, enable CONFIG_MAGIC_SYSRQ and
40 > > > > read about the available commands in the according
41 > > > > docs under /usr/src/linux/Documentation/.
42 > > > > This is a overall life saver in many other cases too.
43 > > > > But it may be a security hole also -- you can disable
44 > > > > it in /proc/ later.
45 > > > > If possible change /etc/fstab to mount the partition
46 > > > > to which logging will go with the "sync" option so
47 > > > > every log entry will go directly onto the bare metal
48 > > > > instead of a RAM buffer.
49 > > > >
50 > > > > Reboot into console mode (no X).
51 > > > > cd to the logging directory and
52 > > > > do a tail -f onto the logfile in question
53 > > > >
54 > > > > Insert the "USB stick of death" ;) and
55 > > > > see, whether you can see anything in the
56 > > > > tailed log.
57 > > > >
58 > > > > If not do the magic sysrq dance to sync hds,
59 > > > > remount readonly and finally reboot.
60 > > > >
61 > > > > Take a closer look into the resulting logs.
62 > > > >
63 > > > > Good luck!
64 > > > >
65 > > > > Nevertheless have a nice weekend!
66 > > > > Best regards,
67 > > > > mcc
68 > > >
69 > > > Thanks mmc,
70 > > > CONFIG_MAGIC_SYSRQ and CONFIG_USB_DEBUG set to y
71 > > >
72 > > > When I plug in USB device SYSREQ does not work. I have tested when
73 > > > system is normal and all ok with SYSREQ so USB device is locking up
74 > > > system completely.
75 > > >
76 > > > I have put sync in mount table but when device is inserted error log
77 > > > is not being produced (in /var/log/messages, etc). I may end up
78 > > > having to take a photo of error log(lol). Or there maybe some other
79 > > > way of catching error which Gentoo gurus may advise.
80 > > >
81 > > > I have tried 3 memory sticks and they all lock system up. These work
82 > > > on another PC using Arch and work on this PC dual booting with
83 > > > Windows. I have a usb hard drive which works ok so it is a memory
84 > > > stick only issue!!!!????
85 > > >
86 > > > Hmmm.
87 > >
88 > > Hi John,
89 > >
90 > > if sysreq does not work after inserting the usbstick then the kernel
91 > > stops working completly (the sysreq-code of the kernel is not
92 > > entered), which seems to be that something happening on very low
93 > > level (the problem not the sysreq-code of course... ;)
94 > >
95 > > Just another shot into the dark:
96 > > Do you have modularized all usb-related stuff? This also may help to
97 > > let the usb-sticks work correctly.
98 > >
99 > > Ah! Another thing: Insert the related modules into
100 > > /etc/conf.d/modules so that they get loaded in a very early
101 > > phase of the bootprocess.
102 > >
103 > > With modules loaded and with no usb-sticks as root inspect the
104 > > according modules with modinfo -p <modulename> to see, whether
105 > > there a additional parameters you may pass to the modules to get
106 > > an extra of informations or - better - to fix the issue.
107 > >
108 > > Modify under "Kernel hacking" in the kernel config the
109 > > "Default message log level" to see more.
110 > >
111 > > Furthermore there is a switch "Kernel debugging" with sub-options.
112 > > May be turning on these may also help to get closer to all evil ;)
113 > >
114 > > Last thing:
115 > > If there is nothing valueable stored on the usbsticks:
116 > > Try to reformat the usbsticks. Instead of putting a FAT32-fs on the
117 > > bare device, create a regular partition and create a Linux-fs
118 > > (reiserfs, ext2/3/4 or like that) on that partition.
119 > >
120 > > This may not be good idea in general, but - if it works - this would
121 > > give a hint, where to search next.
122 > >
123 > > Good luck!
124 > >
125 > > Best regards,
126 > > mcc
127 >
128 > Thanks Meino,
129 > Have recompliled USB as modules (ehci-hcd, ohci-hcd and usb-storage and
130 > load with /etc/conf.d/modules but still locks ups. Hope these are the
131 > right ones. I'll try scsi next.
132 >
133 > Also tried fdisk Linux file system and format ext3. But all to no avail.
134 > Still locking up.
135 >
136 > Have configure kernel hacking options as suggested.
137 >
138 > Error message as follows:-
139 >
140 > --------------------------------------------------------
141 > Bug: unable to handle kernel null pointer dereference at
142 > 0000000000000048
143 >
144 > IP [<ffffffff811ddb8e>]elv_queue_empty+0x12/0x27
145 >
146 > Oops 000 [#1] SMP
147 >
148 > last sys file: /sys/devices/virtual/bdi/8:32/uevent
149 >
150 > Pid: 1-, comm: ksoftirqd/1 Tainted: P 2.6.38-gentoo-r7 #4 MICRO-STAR
151 > MS-7388
152 >
153 > RIP 0010[<ffffffff811ddb8e7>] elv-queue_empty+0x12/0x27
154 > --------------------------------------------------------
155 >
156 >
157 > From my very limited knowledge, I would suspect kernel problem (ksoft)
158 > and irq (software interrupt perhaps). Please put me right if I'm out of
159 > order here. I'm Linux noobie
160 >
161 >
162 > Think I'll drop back to an older kernel to see if I'm still getting the
163 > issue as this may elliminate hardware issue.
164 >
165 > Unsure how to pass options to modules when loading. is this just done
166 > through /etc/conf.d/modules
167 >
168 > I like the error messages though Oops, RIP (someone has a sense of
169 > humour)
170 >
171 >
172 > Thanks for you help
173 >
174 > At least you learn while debgging
175
176 on lkml was just a reminder email of a bugzilla entry: panic on usb hdd.
177
178 So - it is a kernel bug. You will probably not find the culprit just building
179 models.
180
181 Use vanilla sources, do a git bisect, report findings on lkml.
182 --
183 #163933