Re: Severe reproducible nouveau breakage in 2.6.36 (and maybe .35)

Previous message: [thread] [date] [author]
Next message: [thread] [date] [author]
From: Andrew Lutomirski
Date: Wednesday, November 10, 2010 - 1:06 pm

On Wed, Nov 10, 2010 at 2:28 PM, Andrew Lutomirski <andy@luto.us> wrote:



I tracked it down.  The interrupt code in 2.6.36 is totally broken ---
it acknowledges the interrupt *in the bottom half*.  This might work
by accident if the bottom half gets queued on a different CPU, but
something probably changed (concurrency-managed workqueues?) that make
the BH end up on the same cpu.  So the cpu starves the BH and there
goes a cpu.

Then the clocksource watchdog hits and takes the whole system down
when it calls stop_machine, which also gets starved on that cpu.

Patch coming.

--Andy
--
Previous message: [thread] [date] [author]
Next message: [thread] [date] [author]

Messages in current thread:
Severe reproducible nouveau breakage in 2.6.36 (and maybe .35), Andrew Lutomirski, (Wed Nov 10, 12:28 pm)
Re: Severe reproducible nouveau breakage in 2.6.36 (and ma ..., Andrew Lutomirski, (Wed Nov 10, 1:06 pm)
[PATCH 0/2] Fix nouveau-related freezes, Andy Lutomirski, (Wed Nov 10, 2:21 pm)
[PATCH 0/2] Fix nouveau-related freezes, Andy Lutomirski, (Wed Nov 10, 2:32 pm)
[PATCH 1/2] Use existing defines for NV50 hotplug registers, Andy Lutomirski, (Wed Nov 10, 2:32 pm)
Re: [PATCH 2/2] nouveau: Acknowledge HPD irq in handler, n ..., Andrew Lutomirski, (Wed Nov 10, 3:25 pm)
Re: [PATCH 2/2] nouveau: Acknowledge HPD irq in handler, n ..., Andrew Lutomirski, (Wed Nov 10, 3:51 pm)
Re: [PATCH 2/2] nouveau: Acknowledge HPD irq in handler, n ..., Andrew Lutomirski, (Wed Nov 10, 4:01 pm)