Re: oprofile + hibernation = badness

!MAILaRCHIVE_VOTE_RePLACE
Previous message: [thread] [date] [author]
Next message: [thread] [date] [author]
To: Rafael J. Wysocki <rjw@...>
Cc: Pavel Machek <pavel@...>, Robert Richter <robert.richter@...>, Ingo Molnar <mingo@...>, Andi Kleen <ak@...>, Philippe Elie <phil.el@...>, Linux Kernel Mailing List <linux-kernel@...>
Date: Monday, August 18, 2008 - 5:08 pm

On Mon, Aug 18, 2008 at 10:51 PM, Rafael J. Wysocki <rjw@sisk.pl> wrote:

That is a good suggestion :-)

Here is offlining:

CPU 1 is now offline
lockdep: fixing up alternatives.
SMP alternatives: switching to UP code
CPU0 attaching NULL sched-domain.
WQ on CPU0, prefer CPU1
CPU1 attaching NULL sched-domain.
CPU0 attaching sched-domain:
 domain 0: span 0 level CPU
  groups: 0
WQ on CPU0, prefer CPU1
WQ on CPU0, prefer CPU1
WQ on CPU0, prefer CPU1
[repeat last message indefinitely]

Here is onlining:

Booting processor 1/1 ip 6000
Initializing CPU#1
WQ on CPU0, prefer CPU1
WQ on CPU0, prefer CPU1
Calibrating delay using timer specific routine.. 5986.15 BogoMIPS (lpj=29930790)
CPU: Trace cache: 12K uops, L1 D cache: 16K
CPU: L2 cache: 2048K
CPU: Physical Processor ID: 0
Intel machine check architecture supported.
Intel machine check reporting enabled on CPU#1.
CPU1: Intel P4/Xeon Extended MCE MSRs (24) available
CPU1: Thermal monitoring enabled
x86 PAT enabled: cpu 1, old 0x7040600070406, new 0x7010600070106
CPU1: Intel(R) Pentium(R) 4 CPU 3.00GHz stepping 05
checking TSC synchronization [CPU#0 -> CPU#1]:
Measured 120 cycles TSC warp between CPUs, turning off TSC clock.
Marking TSC unstable due to check_tsc_sync_source failed
APIC error on CPU1: 00(40)
Clockevents: could not switch to one-shot mode:<7>APIC error on CPU1: 40(40)
 lapic is not functional.
Could not switch to high resolution mode on CPU 0
Clockevents: could not switch to one-shot mode: lapic is not functional.
Could not switch to high resolution mode on CPU 1
APIC error on CPU1: 40(40)
[sched domains messages
WQ on CPU0, prefer CPU1
APIC error on CPU1: 40(40)
[repeat last message 9 times]

Then follows this pattern indefinitely:

WQ on CPU0, prefer CPU1
APIC error on CPU1: 40(40)
[repeat last message 9 times]

That's basically the same thing as I saw with suspend. So it can be
reproduced easily with CPU hotplug.


Vegard

-- 
"The animistic metaphor of the bug that maliciously sneaked in while
the programmer was not looking is intellectually dishonest as it
disguises that the error is the programmer's own creation."
	-- E. W. Dijkstra, EWD1036
--
Previous message: [thread] [date] [author]
Next message: [thread] [date] [author]

Messages in current thread:
oprofile + hibernation = badness, Vegard Nossum, (Mon Aug 18, 4:32 pm)
Re: oprofile + hibernation = badness, Andi Kleen, (Mon Aug 18, 9:13 pm)
Re: oprofile + hibernation = badness, Robert Richter, (Mon Sep 1, 12:34 pm)
Re: oprofile + hibernation = badness, Ingo Molnar, (Fri Sep 5, 1:58 pm)
Re: oprofile + hibernation = badness, Robert Richter, (Fri Sep 5, 2:59 pm)
Re: oprofile + hibernation = badness, Ingo Molnar, (Fri Sep 5, 4:31 pm)
Re: oprofile + hibernation = badness, Vegard Nossum, (Tue Aug 19, 3:12 am)
Re: oprofile + hibernation = badness, Ingo Molnar, (Tue Aug 19, 5:49 am)
Re: oprofile + hibernation = badness, Andi Kleen, (Tue Aug 19, 8:12 am)
Re: oprofile + hibernation = badness, Ingo Molnar, (Tue Aug 19, 9:18 am)
Re: oprofile + hibernation = badness, Johannes Weiner, (Tue Aug 19, 8:56 am)
Re: oprofile + hibernation = badness, Andi Kleen, (Tue Aug 19, 9:18 am)
Re: oprofile + hibernation = badness, Robert Richter, (Tue Aug 19, 8:37 am)
Re: oprofile + hibernation = badness, Rafael J. Wysocki, (Mon Aug 18, 4:51 pm)
Re: oprofile + hibernation = badness, Vegard Nossum, (Mon Aug 18, 5:08 pm)
Re: oprofile + hibernation = badness, Rafael J. Wysocki, (Mon Aug 18, 5:15 pm)
Re: oprofile + hibernation = badness, Andrew Morton, (Mon Aug 18, 5:29 pm)