Re: Weird rcu lockdep warning

Previous message: [thread] [date] [author]
Next message: [thread] [date] [author]
From: Frederic Weisbecker
Date: Tuesday, April 13, 2010 - 5:02 pm

On Tue, Apr 13, 2010 at 04:40:43PM -0700, Paul E. McKenney wrote:



Yeah :-/





No, for example I just found the same problem in x86 in -tip:


===================================================
[ INFO: suspicious rcu_dereference_check() usage. ]
---------------------------------------------------
kernel/perf_event.c:2236 invoked rcu_dereference_check() without protection!

other info that might help us debug this:


rcu_scheduler_active = 1, debug_locks = 0
2 locks held by perf/3466:
 #0:  (&ctx->mutex){+.+...}, at: [<c10bc567>] sys_perf_event_open+0x2a7/0x420
 #1:  (&ctx->lock){-.....}, at: [<c10b940f>] __perf_install_in_context+0x6f/0x160

stack backtrace:
Pid: 3466, comm: perf Not tainted 2.6.34-rc3-atom #411
Call Trace:
 [<c150f95f>] ? printk+0x1d/0x1f
 [<c1075f8a>] lockdep_rcu_dereference+0xaa/0xb0
 [<c10b8c01>] perf_event_update_userpage+0x151/0x190
 [<c10b8ab0>] ? perf_event_update_userpage+0x0/0x190
 [<c1010931>] x86_perf_event_set_period+0x101/0x1d0
 [<c1010cf2>] intel_pmu_save_and_restart+0x12/0x20
 [<c1013743>] intel_pmu_handle_irq+0x1d3/0x4e0
 [<c1069b08>] ? sched_clock_cpu+0x128/0x170
 [<c1074e8b>] ? trace_hardirqs_off+0xb/0x10
 [<c1069b9f>] ? cpu_clock+0x4f/0x60
 [<c1074e8b>] ? trace_hardirqs_off+0xb/0x10
 [<c1069b9f>] ? cpu_clock+0x4f/0x60
 [<c1078105>] ? __lock_acquire+0x1c5/0x1900
 [<c1069942>] ? sched_clock_local+0xd2/0x170
 [<c100f180>] perf_event_nmi_handler+0x40/0x50
 [<c1068885>] notifier_call_chain+0x35/0x70
 [<c1068eec>] __atomic_notifier_call_chain+0x6c/0xb0
 [<c1068e80>] ? __atomic_notifier_call_chain+0x0/0xb0
 [<c1068f4f>] atomic_notifier_call_chain+0x1f/0x30
 [<c1068f8d>] notify_die+0x2d/0x30
 [<c100428c>] do_nmi+0x16c/0x350
 [<c1074f36>] ? lock_release_holdtime+0xa6/0x1a0
 [<c151458d>] nmi_stack_correct+0x28/0x2d
 [<c10104cc>] ? intel_pmu_enable_all+0x8c/0x110
 [<c1010c5a>] hw_perf_enable+0x1ba/0x240
 [<c10b7df5>] perf_enable+0x25/0x30
 [<c10b94b7>] __perf_install_in_context+0x117/0x160
 [<c10807f6>] smp_call_function_single+0x76/0x170
 [<c10b93a0>] ? __perf_install_in_context+0x0/0x160
 [<c10bb34d>] perf_install_in_context+0x7d/0x80
 [<c10bc575>] sys_perf_event_open+0x2b5/0x420
 [<c1002c4c>] sysenter_do_call+0x12/0x32





I fear it's too easily reproducible (for me at least) and too well localized
(always the same place) to be a random interrupt there.

I just have a guess though....
This seems to always happen from NMI path, and lockdep is disabled on NMI.
I suspect the lock_acquire() performed by rcu_read_lock() is just ignored
and then the rcu_read_lock_held() check has the wrong result...

--
Previous message: [thread] [date] [author]
Next message: [thread] [date] [author]

Messages in current thread:
Weird rcu lockdep warning, Frederic Weisbecker, (Tue Apr 13, 1:04 pm)
Re: Weird rcu lockdep warning, Paul E. McKenney, (Tue Apr 13, 4:40 pm)
Re: Weird rcu lockdep warning, Frederic Weisbecker, (Tue Apr 13, 5:02 pm)
Re: Weird rcu lockdep warning, David Miller, (Tue Apr 13, 5:13 pm)
Re: Weird rcu lockdep warning, Paul E. McKenney, (Tue Apr 13, 6:49 pm)
Re: Weird rcu lockdep warning, David Miller, (Tue Apr 13, 6:51 pm)
Re: Weird rcu lockdep warning, Lai Jiangshan, (Tue Apr 13, 8:34 pm)
Re: Weird rcu lockdep warning, Paul E. McKenney, (Wed Apr 14, 8:43 am)
Re: Weird rcu lockdep warning, Frederic Weisbecker, (Wed Apr 14, 8:51 am)
Re: Weird rcu lockdep warning, Paul E. McKenney, (Wed Apr 14, 9:00 am)
Re: Weird rcu lockdep warning, Paul E. McKenney, (Wed Apr 14, 9:24 pm)
Re: Weird rcu lockdep warning, Frederic Weisbecker, (Thu Apr 15, 11:57 am)
Re: Weird rcu lockdep warning, Paul E. McKenney, (Thu Apr 15, 12:47 pm)