On Wed, 2008-05-14 at 15:43 -0700, Ray Lee wrote:
quoted text > On Wed, May 14, 2008 at 12:27 PM, Lee Howard <faxguy@howardsilvan.com> wr=
ote:
quoted text > > But, without kernel messages indicating where to look to debug... what=
is
quoted text > > the best approach to start troubleshooting and debugging this condition=
? Is
quoted text > > there some general debug feature that can be enabled in the kernel that
> > would help hone in on the culprit?
>=20
> There's something called the NMI watchdog, that will print debugging
> messages out if it finds the system has hard locked. The short version
> is that you should add "nmi_watchdog=3D1" (no quotes) to the line in
> GRUB that has the kernel options. That assumes you have an APIC on the
> system. If that's not the case (you're on Uniprocessor, and no APIC)
> then you can try nmi_watchdog=3D2 instead. That'll only work on some
> systems, though.
>=20
> Better docs (than my cheesy writeup) are in
> Documentation/nmi_watchdog.txt in the kernel source distribution.
I was once told to add these to the kernel command line as well when
using NMI watchdog and they do seem to help it trigger more reliably:=20
"idle=3Dpoll nohz=3Doff"
--=20
Zan Lynx <zlynx@acm.org>