Gentoo Archives: gentoo-user

From: Kerin Millar <kerframil@×××××××××××.uk>
To: gentoo-user@l.g.o
Subject: Re: [gentoo-user] Network card failing under load
Date: Wed, 17 Oct 2012 19:29:19
Message-Id: 507F0699.8030104@fastmail.co.uk
In Reply to: [gentoo-user] Network card failing under load by Datty
1 Datty wrote:
2 > Hi all,
3 >
4 > I have an issue with a remote server that keeps losing access to the
5 > network. I've managed to replicate the issue by pulling a large file
6 > from the server at full speed (100mbit). It will cut out during the file
7 > copy and the log will show the following:
8 >
9 > Oct 17 17:47:26 eric kernel: WARNING: at net/sched/sch_generic.c:256
10 > dev_watchdog+0x251/0x260()
11 > Oct 17 17:47:26 eric kernel: Hardware name: KS
12 > Oct 17 17:47:26 eric kernel: NETDEV WATCHDOG: eth0 (sis900): transmit
13 > queue 0 timed out
14 > Oct 17 17:47:26 eric kernel: Modules linked in: tun
15 > Oct 17 17:47:26 eric kernel: Pid: 0, comm: swapper/0 Not tainted
16 > 3.4.9-gentoo-gnu-dat #7
17 > Oct 17 17:47:26 eric kernel: Call Trace:
18 > Oct 17 17:47:26 eric kernel: <IRQ> [<ffffffff810370db>] ?
19 > warn_slowpath_common+0x7b/0xc0
20 > Oct 17 17:47:26 eric kernel: [<ffffffff810371d5>] ?
21 > warn_slowpath_fmt+0x45/0x50
22 > Oct 17 17:47:26 eric kernel: [<ffffffff81217f81>] ? timerqueue_del+0x31/0x90
23 > Oct 17 17:47:26 eric kernel: [<ffffffff8131a791>] ? dev_watchdog+0x251/0x260
24 > Oct 17 17:47:26 eric kernel: [<ffffffff81042936>] ?
25 > run_timer_softirq+0x126/0x250
26 > Oct 17 17:47:26 eric kernel: [<ffffffff8131a540>] ? qdisc_reset+0x40/0x40
27 > Oct 17 17:47:26 eric kernel: [<ffffffff810573ad>] ?
28 > hrtimer_run_queues+0x5d/0x150
29 > Oct 17 17:47:26 eric kernel: [<ffffffff8103d178>] ? __do_softirq+0x98/0x120
30 > Oct 17 17:47:26 eric kernel: [<ffffffff8141e3fc>] ? call_softirq+0x1c/0x30
31 > Oct 17 17:47:26 eric kernel: [<ffffffff8100d065>] ? do_softirq+0x65/0xa0
32 > Oct 17 17:47:26 eric kernel: [<ffffffff8103d47e>] ? irq_exit+0x8e/0xb0
33 > Oct 17 17:47:26 eric kernel: [<ffffffff81023ce8>] ?
34 > smp_apic_timer_interrupt+0x68/0xa0
35 > Oct 17 17:47:26 eric kernel: [<ffffffff8141dc8a>] ?
36 > apic_timer_interrupt+0x6a/0x70
37 > Oct 17 17:47:26 eric kernel: <EOI> [<ffffffff81060435>] ?
38 > sched_clock_local+0x15/0x80
39 > Oct 17 17:47:26 eric kernel: [<ffffffff81012c78>] ? mwait_idle+0x58/0x80
40 > Oct 17 17:47:26 eric kernel: [<ffffffff810136b6>] ? cpu_idle+0x56/0xa0
41 > Oct 17 17:47:26 eric kernel: [<ffffffff815b4ae9>] ? start_kernel+0x2c6/0x2d1
42 > Oct 17 17:47:26 eric kernel: [<ffffffff815b459a>] ?
43 > repair_env_string+0x5c/0x5c
44 > Oct 17 17:47:26 eric kernel: [<ffffffff815b441e>] ?
45 > x86_64_start_kernel+0x104/0x113
46 > Oct 17 17:47:26 eric kernel: ---[ end trace 50e815f551bee4ac ]---
47 > Oct 17 17:47:26 eric kernel: eth0: Transmit timeout, status 00000005
48 > 00000000
49 > Oct 17 17:47:34 eric kernel: eth0: Transmit timeout, status 00000005
50 > 00000000
51 > Oct 17 17:47:42 eric kernel: eth0: Transmit timeout, status 00000005
52 > 00000000
53 > Oct 17 17:47:50 eric kernel: eth0: Transmit timeout, status 00000005
54 > 00000000
55 > Oct 17 17:47:58 eric kernel: eth0: Transmit timeout, status 00000005
56 > 00000000
57 >
58 > As it is remote I have to hard reboot it at this point to get anything
59 > more out of it, something that I'm not too keen on doing too often.
60 >
61 > From that I can work out there seems to be an issue with the sis900
62 > card but where to go from here I'm not sure on.
63 >
64 > Any help would be greatly appreciated
65
66 I would suggest that you upgrade to 3.4.14 then, if it still occurs,
67 re-open the following bug (which appears to be related):
68
69 https://bugzilla.kernel.org/show_bug.cgi?id=20092
70
71 Elsewhere, this was reported as a regression between Ubuntu 9.10 and
72 10.04 which might help to pinpoint where things went wrong in the
73 mainline kernel. Not that Ubuntu's lame bug resolution process has shed
74 any further light on the matter. See also:
75
76 http://www.alsvartr.de/?p=1104
77
78 --Kerin