Re: 2.6.24-rc6-mm1

!MAILaRCHIVE_VOTE_RePLACE
Previous message: [thread] [date] [author]
Next message: [thread] [date] [author]
To: Herbert Xu <herbert@...>
Cc: Andrew Morton <akpm@...>, <linux-kernel@...>, Neil Brown <neilb@...>, J. Bruce Fields <bfields@...>, <netdev@...>, Tom Tucker <tom@...>
Date: Tuesday, January 1, 2008 - 2:29 pm

On Jan 1, 2008 1:59 PM, Torsten Kaiser <just.for.lkml@googlemail.com> wrote:

I can't explain, why this seems to fix 2.6.24-rc3-mm2 for me, but at
least in 2.6.24-rc6-mm1 it does not seem to be involved.


During normal work I did not see the frag counters increase.
I used ping -s 10000 to create some frags, worked perfectly.
I used netio -b 63k -u [target] to create around half a million frags,
worked too.

And what really is strange is that I changed skb_morph into this:
struct sk_buff *skb_morph(struct sk_buff *dst, struct sk_buff *src)
{
        printk(KERN_ERR "morph %p:%p",dst,src);
        WARN_ON(1);
        skb_release_all(dst);
        return __skb_clone(dst, src);
}
... that warning was not triggered once.


The problem bisecting this, is that I can't seem to trigger this on
demand. Today I was just about giving up on triggering it in -rc6-mm1
with doing package complies when did happen again. But that was after
more then 4 hours...


As noted above, my WARN_ON(1) in skb_morph did not trigger once before
the system died with this OOPS:
[18663.909931] Unable to handle kernel NULL pointer dereference at
0000000000000000 RIP:
[18663.915489]  [<ffffffff8055f2e8>] tcp_read_sock+0x58/0x1b0
[18663.918652] PGD 73442067 PUD 7480e067 PMD 0
[18663.918652] Oops: 0000 [1] SMP
[18663.918652] last sysfs file:
/sys/devices/system/cpu/cpu3/cache/index2/shared_cpu_map
[18663.918652] CPU 1
[18663.918652] Modules linked in: radeon drm nfsd exportfs w83792d
ipv6 tuner tea5767 tda8290 tuner_xc2028 tda9887 tuner_simple mt20xx
tea5761 tvaudio msp3400 bttv ir_common compat_ioctl32 videobuf_dma_sg
videobuf_core btcx_risc tveeprom usbhid videodev v4l2_common
v4l1_compat hid sg pata_amd i2c_nforce2
[18663.918652] Pid: 0, comm: swapper Not tainted 2.6.24-rc6-mm1 #13
[18663.918652] RIP: 0010:[<ffffffff8055f2e8>]  [<ffffffff8055f2e8>]
tcp_read_sock+0x58/0x1b0
[18663.918652] RSP: 0018:ffff81007ff4fb60  EFLAGS: 00010286
[18663.918652] RAX: 0000000000000038 RBX: 0000000000000000 RCX: 0000000000000000
[18663.918652] RDX: ffff8100141a40b0 RSI: ffff81007ff4fbc0 RDI: 0000000000000000
[18663.918652] RBP: ffff81007ff4fbb0 R08: 0000000000000002 R09: 0000000000000000
[18663.918652] R10: ffffffff805b2afb R11: 000000000520cde8 R12: 00000000c05a019a
[18663.918652] R13: 000000000f26378b R14: ffff810066469d38 R15: ffff81004b4e4000
[18663.918652] FS:  00007f58ac9a0700(0000) GS:ffff81007ff12580(0000)
knlGS:0000000000000000
[18663.918652] CS:  0010 DS: 0018 ES: 0018 CR0: 000000008005003b
[18663.918652] CR2: 0000000000000000 CR3: 0000000073441000 CR4: 00000000000006e0
[18663.918652] DR0: 00007fffe1e55cbc DR1: 0000000000000000 DR2: 0000000000000000
[18663.918652] DR3: 0000000000000000 DR6: 00000000ffff4ff0 DR7: 0000000000000400
[18663.918652] Process swapper (pid: 0, threadinfo ffff81011ff2c000,
task ffff81007ff4a000)
[18663.918652] Stack:  ffff810066469d38 ffff81004b4e4148
ffffffff805b1ab0 ffff81007ff4fbc0
[18663.918652] Stack:  ffff810066469d38 ffff81004b4e4148
ffffffff805b1ab0 ffff81007ff4fbc0
[18663.918652]  00000000805b2afb ffff81004b4e4000 ffff81004b4e4298
ffff810066469d00
[18663.918652]  ffff810066469d38 0000000000000000 ffff81007ff4fbf0
ffffffff805b2b41
[18663.918652] Call Trace:
[18663.918652]  <IRQ>  [<ffffffff805b1ab0>] xs_tcp_data_recv+0x0/0x560
[18663.918652]  [<ffffffff805b2b41>] xs_tcp_data_ready+0x71/0x90
[18663.918652]  [<ffffffff80568bec>] __tcp_ack_snd_check+0x5c/0xa0
[18663.918652]  [<ffffffff8056a458>] tcp_rcv_established+0x3c8/0x800
[18663.918652]  [<ffffffff80571451>] tcp_v4_do_rcv+0x2e1/0x4e0
[18663.918652]  [<ffffffff80573cb1>] tcp_v4_rcv+0x721/0x850
[18663.918652]  [<ffffffff80553d63>] ip_local_deliver_finish+0xd3/0x250
[18663.918652]  [<ffffffff8055433b>] ip_local_deliver+0x3b/0x90
[18663.918652]  [<ffffffff80553988>] ip_rcv_finish+0x118/0x420
[18663.918652]  [<ffffffff8022e313>] enqueue_task_fair+0x73/0xd0
[18663.918652]  [<ffffffff80554236>] ip_rcv+0x226/0x2f0
[18663.918652]  [<ffffffff80537576>] netif_receive_skb+0x1d6/0x280
[18663.918652]  [<ffffffff8053a1ea>] process_backlog+0x8a/0xf0
[18663.918652]  [<ffffffff80539e84>] net_rx_action+0xb4/0x130
[18663.918652]  [<ffffffff8023d624>] __do_softirq+0x84/0x110
[18663.918652]  [<ffffffff8020c82c>] call_softirq+0x1c/0x30
[18663.918652]  [<ffffffff8020eaa5>] do_softirq+0x65/0xc0
[18663.918652]  [<ffffffff8023d595>] irq_exit+0x95/0xa0
[18663.918652]  [<ffffffff8020ebbf>] do_IRQ+0x8f/0x100
[18663.918652]  [<ffffffff8020a4b0>] default_idle+0x0/0x80
[18663.918652]  [<ffffffff8020bb26>] ret_from_intr+0x0/0xf
[18663.918652]  <EOI>  [<ffffffff80252310>]
__atomic_notifier_call_chain+0x0/0xa0
[18663.918652]  [<ffffffff8020a4f3>] default_idle+0x43/0x80
[18663.918652]  [<ffffffff8020a4f1>] default_idle+0x41/0x80
[18663.918652]  [<ffffffff8020a4b0>] default_idle+0x0/0x80
[18663.918652]  [<ffffffff8020a59c>] cpu_idle+0x6c/0xa0
[18663.918652]  [<ffffffff808109b8>] start_secondary+0x2f8/0x420
[18663.918652]
[18663.918652]
[18663.918652] Code: 48 8b 3b 0f 18 0f 74 75 8b 93 a0 00 00 00 45 89 ec 44 2b 63
[18663.918652] RIP  [<ffffffff8055f2e8>] tcp_read_sock+0x58/0x1b0
[18663.918652]  RSP <ffff81007ff4fb60>
[18663.918652] CR2: 0000000000000000
[18663.918680] ---[ end trace 1dc6b1bf3734ac14 ]---

(gdb) list *0xffffffff8055f2e8
0xffffffff8055f2e8 is in tcp_read_sock (net/ipv4/tcp.c:1173).
1168    static inline struct sk_buff *tcp_recv_skb(struct sock *sk,
u32 seq, u32 *off)
1169    {
1170            struct sk_buff *skb;
1171            u32 offset;
1172
1173            skb_queue_walk(&sk->sk_receive_queue, skb) {
1174                    offset = seq - TCP_SKB_CB(skb)->seq;
1175                    if (tcp_hdr(skb)->syn)
1176                            offset--;
1177                    if (offset < skb->len || tcp_hdr(skb)->fin) {

(gdb) list *0xffffffff805b2b41
0xffffffff805b2b41 is in xs_tcp_data_ready (net/sunrpc/xprtsock.c:1079).
1074                    goto out;
1075
1076            /* We use rd_desc to pass struct xprt to xs_tcp_data_recv */
1077            rd_desc.arg.data = xprt;
1078            rd_desc.count = 65536;
1079            tcp_read_sock(sk, &rd_desc, xs_tcp_data_recv);
1080    out:
1081            read_unlock(&sk->sk_callback_lock);
1082    }
1083

I will see what vanilla -rc6 will do...

Torsten
--
Previous message: [thread] [date] [author]
Next message: [thread] [date] [author]

Messages in current thread:
2.6.24-rc6-mm1, Andrew Morton, (Sun Dec 23, 3:30 am)
Re: 2.6.24-rc6-mm1, Dave Young, (Wed Dec 26, 4:37 am)
Re: 2.6.24-rc6-mm1, Torsten Kaiser, (Sun Dec 23, 12:27 pm)
Re: 2.6.24-rc6-mm1, Torsten Kaiser, (Fri Dec 28, 6:53 pm)
Re: 2.6.24-rc6-mm1, Andrew Morton, (Fri Dec 28, 7:07 pm)
Re: 2.6.24-rc6-mm1, Torsten Kaiser, (Sat Dec 29, 12:51 pm)
Re: 2.6.24-rc6-mm1, Herbert Xu, (Sat Dec 29, 9:30 pm)
Re: 2.6.24-rc6-mm1, Torsten Kaiser, (Sat Dec 29, 11:34 pm)
Re: 2.6.24-rc6-mm1, Torsten Kaiser, (Mon Dec 31, 4:15 pm)
Re: 2.6.24-rc6-mm1, Herbert Xu, (Tue Jan 1, 8:04 am)
Re: 2.6.24-rc6-mm1, Torsten Kaiser, (Wed Jan 2, 2:29 pm)
Re: 2.6.24-rc6-mm1, Herbert Xu, (Wed Jan 2, 5:51 pm)
Re: 2.6.24-rc6-mm1, Torsten Kaiser, (Fri Jan 4, 6:23 am)
Re: 2.6.24-rc6-mm1, Jarek Poplawski, (Fri Jan 4, 9:30 am)
Re: 2.6.24-rc6-mm1, Torsten Kaiser, (Fri Jan 4, 11:21 am)
Re: 2.6.24-rc6-mm1, Torsten Kaiser, (Fri Jan 4, 5:24 pm)
Re: 2.6.24-rc6-mm1, Jarek Poplawski, (Fri Jan 4, 8:07 pm)
Re: 2.6.24-rc6-mm1, Torsten Kaiser, (Sat Jan 5, 4:01 am)
Re: 2.6.24-rc6-mm1, Jarek Poplawski, (Sat Jan 5, 6:13 am)
Re: 2.6.24-rc6-mm1, Torsten Kaiser, (Sat Jan 5, 10:52 am)
Re: 2.6.24-rc6-mm1, Torsten Kaiser, (Sat Jan 5, 6:10 pm)
Re: 2.6.24-rc6-mm1, Torsten Kaiser, (Sat Jan 5, 11:16 pm)
Re: 2.6.24-rc6-mm1, Andrew Morton, (Sat Jan 5, 9:25 pm)
Re: 2.6.24-rc6-mm1, FUJITA Tomonori, (Sat Jan 5, 11:28 pm)
Re: 2.6.24-rc6-mm1, Torsten Kaiser, (Sun Jan 6, 6:41 am)
Re: 2.6.24-rc6-mm1, FUJITA Tomonori, (Sun Jan 6, 7:23 am)
Re: 2.6.24-rc6-mm1, Torsten Kaiser, (Sun Jan 6, 7:35 am)
Re: 2.6.24-rc6-mm1, FUJITA Tomonori, (Sun Jan 6, 9:33 am)
Re: 2.6.24-rc6-mm1, Torsten Kaiser, (Sun Jan 6, 4:03 pm)
Re: 2.6.24-rc6-mm1, FUJITA Tomonori, (Mon Jan 7, 2:16 am)
Re: 2.6.24-rc6-mm1, Torsten Kaiser, (Fri Jan 25, 5:06 pm)
Re: 2.6.24-rc6-mm1, Ingo Molnar, (Tue Jan 8, 11:59 am)
Re: 2.6.24-rc6-mm1, FUJITA Tomonori, (Tue Jan 8, 7:57 pm)
Re: 2.6.24-rc6-mm1, Jarek Poplawski, (Wed Jan 9, 5:04 am)
Re: 2.6.24-rc6-mm1, FUJITA Tomonori, (Wed Jan 9, 8:54 pm)
Re: 2.6.24-rc6-mm1, Andrew Morton, (Tue Jan 8, 8:27 pm)
Re: 2.6.24-rc6-mm1, FUJITA Tomonori, (Tue Jan 8, 8:54 pm)
Re: 2.6.24-rc6-mm1, Andrew Morton, (Tue Jan 8, 9:07 pm)
Re: 2.6.24-rc6-mm1, Jarek Poplawski, (Sun Jan 6, 4:27 am)
Re: 2.6.24-rc6-mm1, Torsten Kaiser, (Sun Jan 6, 6:30 am)
Re: 2.6.24-rc6-mm1, Jarek Poplawski, (Sun Jan 6, 10:52 am)
Re: 2.6.24-rc6-mm1, J. Bruce Fields, (Wed Jan 2, 5:57 pm)
Re: 2.6.24-rc6-mm1, Torsten Kaiser, (Thu Jan 3, 11:37 am)
Re: 2.6.24-rc6-mm1, J. Bruce Fields, (Thu Jan 3, 2:52 pm)
Re: 2.6.24-rc6-mm1, Torsten Kaiser, (Thu Jan 3, 1:02 am)
Re: 2.6.24-rc6-mm1, Torsten Kaiser, (Tue Jan 1, 8:59 am)
Re: 2.6.24-rc6-mm1, Torsten Kaiser, (Tue Jan 1, 2:29 pm)
Re: 2.6.24-rc6-mm1, Randy Dunlap, (Sun Dec 30, 1:41 am)
Re: 2.6.24-rc6-mm1, J. Bruce Fields, (Sun Dec 30, 5:24 pm)
Re: 2.6.24-rc6-mm1, Torsten Kaiser, (Sun Dec 30, 5:35 pm)
Re: 2.6.24-rc6-mm1, Torsten Kaiser, (Mon Dec 31, 9:17 am)
Re: 2.6.24-rc6-mm1, Andrew Morton, (Sun Dec 23, 4:39 pm)
Re: 2.6.24-rc6-mm1, Torsten Kaiser, (Thu Dec 27, 7:42 am)
Re: 2.6.24-rc6-mm1, Torsten Kaiser, (Thu Dec 27, 10:30 am)
Re: 2.6.24-rc6-mm1 Kernel panics at different functions (), Kamalesh Babulal, (Thu Dec 27, 4:49 am)
Re: 2.6.24-rc6-mm1 Kernel panics at different functions (), Kamalesh Babulal, (Fri Dec 28, 5:11 am)
Re: 2.6.24-rc6-mm1 Kernel panics at different functions (), Kamalesh Babulal, (Thu Dec 27, 6:25 am)
Re: 2.6.24-rc6-mm1 - e1000 breakage, James Morris, (Wed Dec 26, 7:39 pm)
Re: 2.6.24-rc6-mm1 (driver core/sysfs), Randy Dunlap, (Mon Dec 31, 4:11 pm)
Re: 2.6.24-rc6-mm1 (driver core/sysfs), Greg KH, (Fri Jan 11, 9:05 pm)
[patch] auto-qa Kconfig, Ingo Molnar, (Mon Jan 14, 12:11 pm)
Re: [patch] auto-qa Kconfig, Pavel Machek, (Tue Jan 15, 6:13 pm)
Re: 2.6.24-rc6-mm1 (build problem: gpio/W1), Randy Dunlap, (Mon Dec 31, 2:19 pm)
Re: 2.6.24-rc6-mm1 (build problem: gpio/W1), Evgeniy Polyakov, (Sat Jan 5, 11:29 am)
Re: 2.6.24-rc6-mm1 (build problem: gpio/W1), Ville , (Sat Jan 5, 12:16 pm)
Re: 2.6.24-rc6-mm1 (build problem: gpio/W1), Randy Dunlap, (Sat Jan 5, 1:18 pm)
Re: 2.6.24-rc6-mm1 (build problem: gpio_keys), Randy Dunlap, (Mon Dec 31, 2:18 pm)
Re: 2.6.24-rc6-mm1 (build problem: gpio_keys), David Brownell, (Mon Dec 31, 2:40 pm)
[PATCH -mm] gpio: fix x86 build problem: gpio_keys, Randy Dunlap, (Mon Dec 31, 3:10 pm)
Re: [PATCH -mm] gpio: fix x86 build problem: gpio_keys, Ingo Molnar, (Tue Jan 1, 11:32 am)
Re: 2.6.24-rc6-mm1 (build problem: v4l / i2c), Randy Dunlap, (Mon Dec 31, 2:18 pm)
[PATCH -mm] driver core: build with SYSFS=n, Randy Dunlap, (Mon Dec 31, 2:05 pm)
[PATCH -mm] crypto: scatterwalk.h needs sched.h, Randy Dunlap, (Mon Dec 31, 2:05 pm)
Re: [PATCH -mm] crypto: scatterwalk.h needs sched.h, Herbert Xu, (Mon Dec 31, 6:31 pm)
Re: 2.6.24-rc6-mm1: __raw_spin_is_contended undefined, Joseph Fannin, (Wed Dec 26, 10:21 pm)
Re: 2.6.24-rc6-mm1: __raw_spin_is_contended undefined, Nick Piggin, (Thu Dec 27, 1:21 am)
Re: 2.6.24-rc6-mm1: some section mismatches on sparc64, Mariusz Kozlowski, (Wed Dec 26, 8:29 am)
Re: 2.6.24-rc6-mm1: some section mismatches on sparc64, David Miller, (Wed Dec 26, 11:05 pm)
Re: 2.6.24-rc6-mm1: some section mismatches on sparc64, Adrian Bunk, (Fri Dec 28, 7:22 pm)
Re: 2.6.24-rc6-mm1: some section mismatches on sparc64, David Miller, (Sat Dec 29, 4:14 am)
Re: 2.6.24-rc6-mm1: some section mismatches on sparc64, Adrian Bunk, (Sat Dec 29, 4:48 am)
Re: 2.6.24-rc6-mm1: some section mismatches on sparc64, David Miller, (Sat Dec 29, 4:54 am)
Re: 2.6.24-rc6-mm1: some section mismatches on sparc64, Adrian Bunk, (Sat Dec 29, 5:06 am)
Re: 2.6.24-rc6-mm1: some section mismatches on sparc64, David Miller, (Sat Dec 29, 5:18 am)
Re: 2.6.24-rc6-mm1: some section mismatches on sparc64, Adrian Bunk, (Sat Dec 29, 5:53 am)
Re: 2.6.24-rc6-mm1: some section mismatches on sparc64, David Miller, (Sat Dec 29, 5:15 am)
Re: 2.6.24-rc6-mm1: some section mismatches on sparc64, David Miller, (Sat Dec 29, 4:27 am)
Re: 2.6.24-rc6-mm1, Andreas Mohr, (Tue Dec 25, 5:51 pm)
Re: 2.6.24-rc6-mm1: suspend broken on HP nx6325 due to cpufr..., Rafael J. Wysocki, (Sun Dec 23, 6:54 pm)
Re: 2.6.24-rc6-mm1: suspend broken on HP nx6325 due to cpufr..., Rafael J. Wysocki, (Mon Dec 24, 10:13 am)
Re: 2.6.24-rc6-mm1, Rafael J. Wysocki, (Sun Dec 23, 8:35 am)
Re: 2.6.24-rc6-mm1, H. Peter Anvin, (Sun Dec 23, 7:09 pm)
Re: 2.6.24-rc6-mm1, Ingo Molnar, (Sun Dec 23, 9:00 am)
Re: 2.6.24-rc6-mm1, Rafael J. Wysocki, (Sun Dec 23, 9:48 am)
Re: 2.6.24-rc6-mm1, Rafael J. Wysocki, (Sun Dec 23, 9:53 am)
Re: 2.6.24-rc6-mm1, Sam Ravnborg, (Sun Dec 23, 4:09 pm)
Re: 2.6.24-rc6-mm1, Rafael J. Wysocki, (Sun Dec 23, 6:44 pm)
Re: 2.6.24-rc6-mm1, Ingo Molnar, (Sun Dec 23, 7:04 am)
Re: 2.6.24-rc6-mm1, Ingo Molnar, (Sun Dec 23, 7:10 am)
Re: 2.6.24-rc6-mm1, Andrew Morton, (Sun Dec 23, 7:34 am)
Re: 2.6.24-rc6-mm1, Ingo Molnar, (Sun Dec 23, 7:57 am)
Re: 2.6.24-rc6-mm1, Christoph Hellwig, (Sun Dec 23, 8:12 am)