Re: 2.6.24 BUG: soft lockup - CPU#X

!MAILaRCHIVE_VOTE_RePLACE
Previous message: [thread] [date] [author]
Next message: [thread] [date] [author]
To: Brandeburg, Jesse <jesse.brandeburg@...>
Cc: Jarek Poplawski <jarkao2@...>, <netdev@...>
Date: Thursday, March 27, 2008 - 7:45 pm

Brandeburg, Jesse wrote:

I have seen the lockup on kernels 2.6.18 and newer mostly on TX traffic. 
I have seen it on another 10G driver (off the tree niu driver sibling, 
nxge).  The nxge driver doesn't use any TX interrupts and I have seen it 
with UDP TX, irqbalance disabled, with no irq activity at all.  some 
example traces included.
Regards
Matheos


nsn57-110 login: BUG: soft lockup - CPU#4 stuck for 11s! [uperf.x86_64:6697]
CPU 4:
Modules linked in: nxge nfs lockd nfs_acl autofs4 hidp rfcomm l2cap 
bluetooth sunrpc ipv6 cpufreq_ondemand rdma_ucm ib_ucm rdma_cm iw_cm 
ib_addr ib_srp scsi_transport_srp ib_cm ib_ipoib ib_sa ib_uverbs ib_umad 
ib_mad ib_core dm_multipath battery ac parport_pc lp parport joydev 
sr_mod sg e1000 i2c_nforce2 button i2c_core shpchp pcspkr dm_snapshot 
dm_zerodm_mirror dm_mod usb_storage mptsas mptscsih mptbase 
scsi_transport_sas sd_mod scsi_mod ext3 jbd ehci_hcd ohci_hcd uhci_hcd
Pid: 6697, comm: uperf.x86_64 Not tainted 2.6.24-mati #3
RIP: 0010:[<ffffffff80316c64>]  [<ffffffff80316c64>] 
_raw_spin_unlock+0x37/0x7f
RSP: 0018:ffff8101e00c3af8  EFLAGS: 00000246
RAX: ffff8101e084e000 RBX: ffff8101f0908000 RCX: ffff8101f0908780
RDX: 0000000000000000 RSI: 0000000000000116 RDI: ffff8101f0908300
RBP: 00000000000003d5 R08: 0000000000000004 R09: 0000000000000115
R10: ffff8101e00c3968 R11: 0000000000000000 R12: ffff8101e00c3968
R13: 0000000000000000 R14: ffff8101f0908000 R15: ffffffff882e77ef
FS:  0000000041001940(0063) GS:ffff8102fba13580(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 000000004f81cf48 CR3: 00000001e05b4000 CR4: 00000000000006e0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400

Call Trace:
 [<ffffffff80406dca>] __qdisc_run+0x96/0x174
 [<ffffffff803f8139>] dev_queue_xmit+0x141/0x270
 [<ffffffff80417faf>] ip_push_pending_frames+0x32c/0x3a0
 [<ffffffff80419676>] ip_generic_getfrag+0x0/0x8b
 [<ffffffff8043359f>] udp_push_pending_frames+0x2ba/0x337
 [<ffffffff80434794>] udp_sendmsg+0x4c8/0x606
 [<ffffffff803eafbb>] sock_sendmsg+0xe2/0xff
 [<ffffffff8029e1a1>] iput+0x42/0x7b
 [<ffffffff802480e0>] autoremove_wake_function+0x0/0x2e
 [<ffffffff80275d0c>] find_extend_vma+0x16/0x59
 [<ffffffff8045e4d3>] _spin_lock_irqsave+0x9/0xe
 [<ffffffff80311d88>] __up_read+0x13/0x8a
 [<ffffffff803eba5c>] sys_sendto+0x128/0x151
 [<ffffffff8045e3ed>] _spin_unlock_bh+0x9/0x15
 [<ffffffff8020b7fc>] tracesys+0xdc/0xe1

BUG: soft lockup - CPU#4 stuck for 11s! [uperf.x86_64:6697]
CPU 4:
Modules linked in: nxge nfs lockd nfs_acl autofs4 hidp rfcomm l2cap 
bluetooth sunrpc ipv6 cpufreq_ondemand rdma_ucm ib_ucm rdma_cm iw_cm 
ib_addr ib_srp scsi_transport_srp ib_cm ib_ipoib ib_sa ib_uverbs ib_umad 
ib_mad ib_core dm_multipath battery ac parport_pc lp parport joydev 
sr_mod sg e1000 i2c_nforce2 button i2c_core shpchp pcspkr dm_snapshot 
dm_zero dm_mirror dm_mod usb_storage mptsas mptscsih mptbase 
scsi_transport_sas sd_mod scsi_mod ext3 jbd ehci_hcd ohci_hcd uhci_hcd
Pid: 6697, comm: uperf.x86_64 Not tainted 2.6.24-mati #3
RIP: 0010:[<ffffffff80316d29>]  [<ffffffff80316d29>] 
_raw_spin_lock+0x7d/0xf6
RSP: 0018:ffff8101e00c3ad8  EFLAGS: 00000246
RAX: 0000000000000000 RBX: 0000000000000036 RCX: 00000000dabeab42
RDX: 0000000000000e24 RSI: 0000000000000392 RDI: 0000000000000001
RBP: ffffffff882e77ef R08: 0000000000000004 R09: 0000000000000391
R10: ffff8101e00c3968 R11: 0000000000000000 R12: ffffffffffffff10
R13: 0000000000000011 R14: ffff8102f9db8010 R15: 00000000000005de
FS:  0000000041001940(0063) GS:ffff8102fba13580(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 000000004f81cf48 CR3: 00000001e05b4000 CR4: 00000000000006e0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400

Call Trace:
 [<ffffffff80316d3a>] _raw_spin_lock+0x8e/0xf6
 [<ffffffff80406fa1>] pfifo_fast_dequeue+0x24/0x59
 [<ffffffff80406dd2>] __qdisc_run+0x9e/0x174
 [<ffffffff803f8139>] dev_queue_xmit+0x141/0x270
 [<ffffffff80417faf>] ip_push_pending_frames+0x32c/0x3a0
 [<ffffffff80419676>] ip_generic_getfrag+0x0/0x8b
 [<ffffffff8043359f>] udp_push_pending_frames+0x2ba/0x337
 [<ffffffff80434794>] udp_sendmsg+0x4c8/0x606
 [<ffffffff803eafbb>] sock_sendmsg+0xe2/0xff
 [<ffffffff8029e1a1>] iput+0x42/0x7b
 [<ffffffff802480e0>] autoremove_wake_function+0x0/0x2e
 [<ffffffff80275d0c>] find_extend_vma+0x16/0x59
 [<ffffffff8045e4d3>] _spin_lock_irqsave+0x9/0xe
 [<ffffffff80311d88>] __up_read+0x13/0x8a
 [<ffffffff803eba5c>] sys_sendto+0x128/0x151
 [<ffffffff8045e3ed>] _spin_unlock_bh+0x9/0x15
 [<ffffffff8020b7fc>] tracesys+0xdc/0xe1

BUG: soft lockup - CPU#4 stuck for 11s! [uperf.x86_64:6697]
CPU 4:
Modules linked in: nxge nfs lockd nfs_acl autofs4 hidp rfcomm l2cap 
bluetooth sunrpc ipv6 cpufreq_ondemand rdma_ucm ib_ucm rdma_cm iw_cm 
ib_addr ib_srp scsi_transport_srp ib_cm ib_ipoib ib_sa ib_uverbs ib_umad 
ib_mad ib_core dm_multipath battery ac parport_pc lp parport joydev 
sr_mod sg e1000 i2c_nforce2 button i2c_core shpchp pcspkr dm_snapshot 
dm_zero dm_mirror dm_mod usb_storage mptsas mptscsih mptbase 
scsi_transport_sas sd_mod scsi_mod ext3 jbd ehci_hcd ohci_hcd uhci_hcd
Pid: 6697, comm: uperf.x86_64 Not tainted 2.6.24-mati #3
RIP: 0010:[<ffffffff80316d29>]  [<ffffffff80316d29>] 
_raw_spin_lock+0x7d/0xf6
RSP: 0018:ffff8101e00c3ad8  EFLAGS: 00000246
RAX: 0000000000000000 RBX: 0000000000000026 RCX: 000000002a2092e7
RDX: 0000000000000e28 RSI: 00000000000001d8 RDI: 0000000000000001
RBP: ffffffff882e77ef R08: 0000000000000004 R09: 00000000000001d7
R10: ffff8101e00c3968 R11: 0000000000000000 R12: ffffffffffffff10
R13: 0000000000000011 R14: ffff8102f686e010 R15: 00000000000005de
FS:  0000000041001940(0063) GS:ffff8102fba13580(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 000000004f81cf48 CR3: 00000001e05b4000 CR4: 00000000000006e0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400

Call Trace:
 [<ffffffff80316d3a>] _raw_spin_lock+0x8e/0xf6
 [<ffffffff80406e8a>] __qdisc_run+0x156/0x174
 [<ffffffff80406dd2>] __qdisc_run+0x9e/0x174
 [<ffffffff803f8139>] dev_queue_xmit+0x141/0x270
 [<ffffffff80417faf>] ip_push_pending_frames+0x32c/0x3a0
 [<ffffffff80419676>] ip_generic_getfrag+0x0/0x8b
 [<ffffffff8043359f>] udp_push_pending_frames+0x2ba/0x337
 [<ffffffff80434794>] udp_sendmsg+0x4c8/0x606
 [<ffffffff803eafbb>] sock_sendmsg+0xe2/0xff
 [<ffffffff8029e1a1>] iput+0x42/0x7b
 [<ffffffff802480e0>] autoremove_wake_function+0x0/0x2e
 [<ffffffff80275d0c>] find_extend_vma+0x16/0x59
 [<ffffffff8045e4d3>] _spin_lock_irqsave+0x9/0xe
 [<ffffffff80311d88>] __up_read+0x13/0x8a
 [<ffffffff803eba5c>] sys_sendto+0x128/0x151
 [<ffffffff8045e3ed>] _spin_unlock_bh+0x9/0x15
 [<ffffffff8020b7fc>] tracesys+0xdc/0xe1





--
To unsubscribe from this list: send the line "unsubscribe netdev" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Previous message: [thread] [date] [author]
Next message: [thread] [date] [author]

Messages in current thread:
2.6.24 BUG: soft lockup - CPU#X, Matheos Worku, (Wed Mar 26, 12:46 pm)
Re: 2.6.24 BUG: soft lockup - CPU#X, Jarek Poplawski, (Wed Mar 26, 4:14 pm)
Re: 2.6.24 BUG: soft lockup - CPU#X, Matheos Worku, (Wed Mar 26, 4:26 pm)
Re: 2.6.24 BUG: soft lockup - CPU#X, Jarek Poplawski, (Thu Mar 27, 6:33 am)
RE: 2.6.24 BUG: soft lockup - CPU#X, Brandeburg, Jesse, (Thu Mar 27, 7:18 pm)
Re: 2.6.24 BUG: soft lockup - CPU#X, Matheos Worku, (Thu Mar 27, 7:45 pm)
Re: 2.6.24 BUG: soft lockup - CPU#X, David Miller, (Thu Mar 27, 8:02 pm)
Re: 2.6.24 BUG: soft lockup - CPU#X, Matheos Worku, (Thu Mar 27, 8:19 pm)
Re: 2.6.24 BUG: soft lockup - CPU#X, David Miller, (Thu Mar 27, 8:34 pm)
Re: 2.6.24 BUG: soft lockup - CPU#X, Herbert Xu, (Thu Mar 27, 9:22 pm)
Re: 2.6.24 BUG: soft lockup - CPU#X, Matheos Worku, (Thu Mar 27, 9:58 pm)
Re: 2.6.24 BUG: soft lockup - CPU#X, Herbert Xu, (Fri Mar 28, 6:38 am)
Re: 2.6.24 BUG: soft lockup - CPU#X, Jarek Poplawski, (Fri Mar 28, 9:38 am)
Re: 2.6.24 BUG: soft lockup - CPU#X, Herbert Xu, (Fri Mar 28, 9:53 am)
Re: 2.6.24 BUG: soft lockup - CPU#X, Jarek Poplawski, (Fri Mar 28, 10:39 am)
Re: 2.6.24 BUG: soft lockup - CPU#X, Herbert Xu, (Fri Mar 28, 10:56 am)
Re: 2.6.24 BUG: soft lockup - CPU#X, Jarek Poplawski, (Fri Mar 28, 11:29 am)
Re: 2.6.24 BUG: soft lockup - CPU#X, Herbert Xu, (Fri Mar 28, 9:06 pm)
Re: 2.6.24 BUG: soft lockup - CPU#X, Jarek Poplawski, (Sat Mar 29, 5:11 am)
Re: 2.6.24 BUG: soft lockup - CPU#X, Jarek Poplawski, (Fri Mar 28, 11:47 am)
Re: 2.6.24 BUG: soft lockup - CPU#X, jamal, (Fri Mar 28, 6:33 am)
Re: 2.6.24 BUG: soft lockup - CPU#X, Matheos Worku, (Fri Mar 28, 1:00 pm)
Re: 2.6.24 BUG: soft lockup - CPU#X, David Miller, (Thu Mar 27, 9:38 pm)
Re: 2.6.24 BUG: soft lockup - CPU#X, Herbert Xu, (Fri Mar 28, 6:29 am)
Re: 2.6.24 BUG: soft lockup - CPU#X, Ingo Molnar, (Fri Mar 28, 6:56 am)
Re: 2.6.24 BUG: soft lockup - CPU#X, Herbert Xu, (Fri Mar 28, 7:06 am)
Re: 2.6.24 BUG: soft lockup - CPU#X, Ingo Molnar, (Fri Mar 28, 10:09 am)
Re: 2.6.24 BUG: soft lockup - CPU#X, Herbert Xu, (Fri Mar 28, 7:29 am)
Re: 2.6.24 BUG: soft lockup - CPU#X, David Miller, (Fri Mar 28, 7:25 pm)
Re: 2.6.24 BUG: soft lockup - CPU#X, jamal, (Fri Mar 28, 8:19 am)
Re: 2.6.24 BUG: soft lockup - CPU#X, Herbert Xu, (Fri Mar 28, 9:26 am)
Re: 2.6.24 BUG: soft lockup - CPU#X, Ingo Molnar, (Fri Mar 28, 10:12 am)
Re: 2.6.24 BUG: soft lockup - CPU#X, jamal, (Fri Mar 28, 10:07 am)
Re: 2.6.24 BUG: soft lockup - CPU#X, Jarek Poplawski, (Wed Mar 26, 5:46 pm)
Re: 2.6.24 BUG: soft lockup - CPU#X, Jarek Poplawski, (Wed Mar 26, 5:53 pm)
Re: 2.6.24 BUG: soft lockup - CPU#X, Rick Jones, (Wed Mar 26, 1:31 pm)