Re: [Bug #10860] total system freeze at boot with 2.6.26-rc

!MAILaRCHIVE_VOTE_RePLACE
Previous message: [thread] [date] [author]
Next message: [thread] [date] [author]
To: Rafael J. Wysocki <rjw@...>, Christian Casteyde <casteyde.christian@...>
Cc: Linux Kernel Mailing List <linux-kernel@...>
Date: Saturday, June 7, 2008 - 6:24 pm

On Sat, Jun 7, 2008 at 10:42 PM, Rafael J. Wysocki <rjw@sisk.pl> wrote:

I looked at the bugzilla entry, but it said not to post any replies in
there. I hope it's okay to reply here, because I couldn't find the
original discussion on LKML.

Christian, did you try the nmi watchdog parameter on boot? It is
really quite simple -- add nmi_watchdog=1 to the kernel parameters.
When the machine freezes, leave it for a minute or two in that state.
The NMI watchdog code might be able to give us a backtrace and tell us
exactly where the machine is hanging. While waiting, you can prepare
the camera... ;-)

(BTW, why isn't this nmi watchdog trick the "standard" reply to hung
kernels? It seems that very few are actually aware of it, or using it
to debug these cases.)

Anyway, good luck with that! :-)


Vegard

-- 
"The animistic metaphor of the bug that maliciously sneaked in while
the programmer was not looking is intellectually dishonest as it
disguises that the error is the programmer's own creation."
	-- E. W. Dijkstra, EWD1036
--
Previous message: [thread] [date] [author]
Next message: [thread] [date] [author]

Messages in current thread:
2.6.26-rc5-git2: Reported regressions from 2.6.25, Rafael J. Wysocki, (Sat Jun 7, 4:38 pm)
[Bug #10861] 2.6.26-rc4-git2 - long pause during boot, Rafael J. Wysocki, (Sat Jun 7, 4:42 pm)
[Bug #10872] x86_64 boot hang when CONFIG_NUMA=n, Rafael J. Wysocki, (Sat Jun 7, 4:42 pm)
Re: [Bug #10872] x86_64 boot hang when CONFIG_NUMA=n, Randy Dunlap, (Wed Jun 11, 4:30 pm)
[Bug #10874] blackfin drivers/net/smc91x.c build error, Rafael J. Wysocki, (Sat Jun 7, 4:42 pm)
[Bug #10873] serial/bfin_5xx.c build error, Rafael J. Wysocki, (Sat Jun 7, 4:42 pm)
[Bug #10863] kvm causing memory corruption? now 2.6.26-rc4, Rafael J. Wysocki, (Sat Jun 7, 4:42 pm)
[Bug #10862] forcedeth: lockdep warning on ethtool -s, Rafael J. Wysocki, (Sat Jun 7, 4:42 pm)
[Bug #10860] total system freeze at boot with 2.6.26-rc, Rafael J. Wysocki, (Sat Jun 7, 4:42 pm)
Re: [Bug #10860] total system freeze at boot with 2.6.26-rc, Vegard Nossum, (Sat Jun 7, 6:24 pm)
Re: [Bug #10860] total system freeze at boot with 2.6.26-rc, Rafael J. Wysocki, (Sun Jun 8, 12:58 pm)
[Bug #10830] two different oopses with 2.6.26-rc4, Rafael J. Wysocki, (Sat Jun 7, 4:42 pm)
[Bug #10823] stuck localhost TCP connections, v2.6.26-rc3+, Rafael J. Wysocki, (Sat Jun 7, 4:42 pm)
[Bug #10827] 2.6.26rc4 GFS2 oops., Rafael J. Wysocki, (Sat Jun 7, 4:42 pm)
[Bug #10826] NFS oops in 2.6.26rc4, Rafael J. Wysocki, (Sat Jun 7, 4:42 pm)
[Bug #10825] appletouch after wakeup, Rafael J. Wysocki, (Sat Jun 7, 4:42 pm)
Re: [Bug #10825] appletouch after wakeup, Oliver Neukum, (Mon Jun 9, 5:07 am)
Re: [Bug #10825] appletouch after wakeup, Justin Mattock, (Mon Jun 9, 12:18 pm)
Re: [Bug #10825] appletouch after wakeup, Oliver Neukum, (Mon Jun 9, 3:53 pm)
Re: [Bug #10825] appletouch after wakeup, Justin Mattock, (Mon Jun 9, 4:29 pm)
Re: [Bug #10825] appletouch after wakeup, Oliver Neukum, (Mon Jun 9, 4:31 pm)
Re: [Bug #10825] appletouch after wakeup, Justin Mattock, (Mon Jun 9, 4:52 pm)
Re: [Bug #10825] appletouch after wakeup, Oliver Neukum, (Mon Jun 9, 5:16 pm)
Re: [Bug #10825] appletouch after wakeup, Justin Mattock, (Mon Jun 9, 6:04 pm)
Re: [Bug #10825] appletouch after wakeup, Justin Mattock, (Mon Jun 9, 6:36 pm)
Re: [Bug #10825] appletouch after wakeup, Oliver Neukum, (Mon Jun 9, 6:40 pm)
Re: [Bug #10825] appletouch after wakeup, Justin Mattock, (Mon Jun 9, 8:03 pm)
Re: [Bug #10825] appletouch after wakeup, Jiri Kosina, (Mon Jun 9, 8:06 pm)
Re: [Bug #10825] appletouch after wakeup, Justin Mattock, (Mon Jun 9, 9:11 pm)
Re: [Bug #10825] appletouch after wakeup, Justin Mattock, (Sat Jun 7, 6:54 pm)
Re: [Bug #10825] appletouch after wakeup, Rafael J. Wysocki, (Sun Jun 8, 12:53 pm)
Re: [Bug #10825] appletouch after wakeup, Justin Mattock, (Sun Jun 8, 2:26 pm)
[Bug #10819] Fatal DMA error with b43 driver since 2.6.26, Rafael J. Wysocki, (Sat Jun 7, 4:42 pm)
[Bug #10816] vt/fbcon: fix background color on line feed, Rafael J. Wysocki, (Sat Jun 7, 4:42 pm)
Re: [Bug #10816] vt/fbcon: fix background color on line feed, Rafael J. Wysocki, (Sun Jun 8, 12:52 pm)
[Bug #10815] 2.6.26-rc4: RIP find_pid_ns+0x6b/0xa0, Rafael J. Wysocki, (Sat Jun 7, 4:42 pm)
Re: [Bug #10815] 2.6.26-rc4: RIP find_pid_ns+0x6b/0xa0, Adrian Bunk, (Fri Jun 13, 9:52 am)
Re: [Bug #10815] 2.6.26-rc4: RIP find_pid_ns+0x6b/0xa0, Paul E. McKenney, (Sat Jun 14, 10:42 am)
Re: [Bug #10815] 2.6.26-rc4: RIP find_pid_ns+0x6b/0xa0, Oleg Nesterov, (Sat Jun 14, 10:58 am)
Re: [Bug #10815] 2.6.26-rc4: RIP find_pid_ns+0x6b/0xa0, Paul E. McKenney, (Sat Jun 14, 2:12 pm)
Re: [Bug #10815] 2.6.26-rc4: RIP find_pid_ns+0x6b/0xa0, Alexey Dobriyan, (Sat Jun 14, 3:43 pm)
Re: [Bug #10815] 2.6.26-rc4: RIP find_pid_ns+0x6b/0xa0, Paul E. McKenney, (Sat Jun 14, 11:30 pm)
Re: [Bug #10815] 2.6.26-rc4: RIP find_pid_ns+0x6b/0xa0, Alexey Dobriyan, (Sun Jun 15, 12:21 pm)
Re: [Bug #10815] 2.6.26-rc4: RIP find_pid_ns+0x6b/0xa0, Paul E. McKenney, (Sun Jun 15, 2:17 pm)
Re: [Bug #10815] 2.6.26-rc4: RIP find_pid_ns+0x6b/0xa0, Paul E. McKenney, (Sun Jun 15, 7:26 pm)
Re: [Bug #10815] 2.6.26-rc4: RIP find_pid_ns+0x6b/0xa0, Linus Torvalds, (Sun Jun 15, 4:32 pm)
Re: [Bug #10815] 2.6.26-rc4: RIP find_pid_ns+0x6b/0xa0, Paul E. McKenney, (Sun Jun 15, 7:27 pm)
Re: [Bug #10815] 2.6.26-rc4: RIP find_pid_ns+0x6b/0xa0, Linus Torvalds, (Sun Jun 15, 7:38 pm)
Re: [Bug #10815] 2.6.26-rc4: RIP find_pid_ns+0x6b/0xa0, Alexey Dobriyan, (Sun Jun 15, 11:01 pm)
Re: [Bug #10815] 2.6.26-rc4: RIP find_pid_ns+0x6b/0xa0, Vegard Nossum, (Mon Jun 16, 9:53 am)
Re: [Bug #10815] 2.6.26-rc4: RIP find_pid_ns+0x6b/0xa0, Paul E. McKenney, (Sun Jun 15, 11:31 pm)
Re: [Bug #10815] 2.6.26-rc4: RIP find_pid_ns+0x6b/0xa0, Alexey Dobriyan, (Sun Jun 15, 11:46 pm)
Re: [Bug #10815] 2.6.26-rc4: RIP find_pid_ns+0x6b/0xa0, Paul E. McKenney, (Mon Jun 16, 11:42 pm)
Re: [Bug #10815] 2.6.26-rc4: RIP find_pid_ns+0x6b/0xa0, Alexey Dobriyan, (Mon Jun 23, 8:50 pm)
Re: [Bug #10815] 2.6.26-rc4: RIP find_pid_ns+0x6b/0xa0, Paul E. McKenney, (Tue Jun 24, 8:04 am)
Re: [Bug #10815] 2.6.26-rc4: RIP find_pid_ns+0x6b/0xa0, Paul E. McKenney, (Tue Jun 24, 5:08 pm)
Re: [Bug #10815] 2.6.26-rc4: RIP find_pid_ns+0x6b/0xa0, Ingo Molnar, (Tue Jun 24, 5:15 pm)
Re: [Bug #10815] 2.6.26-rc4: RIP find_pid_ns+0x6b/0xa0, Paul E. McKenney, (Wed Jun 25, 5:04 am)
Re: [Bug #10815] 2.6.26-rc4: RIP find_pid_ns+0x6b/0xa0, Linus Torvalds, (Mon Jun 23, 9:31 pm)
Re: [Bug #10815] 2.6.26-rc4: RIP find_pid_ns+0x6b/0xa0, Nick Piggin, (Mon Jun 23, 9:51 pm)
[Bug #10799] sky2 general protection fault, Rafael J. Wysocki, (Sat Jun 7, 4:42 pm)
[Bug #10794] mips: CONF_CM_DEFAULT build error, Rafael J. Wysocki, (Sat Jun 7, 4:42 pm)
Re: [Bug #10794] mips: CONF_CM_DEFAULT build error, Adrian Bunk, (Wed Jun 11, 2:51 pm)
[Bug #10787] pcie hotplug bootup crash fix, Rafael J. Wysocki, (Sat Jun 7, 4:42 pm)
Re: [Bug #10787] pcie hotplug bootup crash fix, Kenji Kaneshige, (Mon Jun 9, 4:22 am)
[Bug #10764] some serial configurations are now broken, Rafael J. Wysocki, (Sat Jun 7, 4:42 pm)
[Bug #10786] 2.6.26-rc3 64bit SMP does not boot on J5600, Rafael J. Wysocki, (Sat Jun 7, 4:42 pm)
Re: [Bug #10761] hackbench regression with 2.6.26-rc2 on tul..., Rafael J. Wysocki, (Sun Jun 8, 12:51 pm)
[Bug #10748] dhclient fails to run; capabilities error, Rafael J. Wysocki, (Sat Jun 7, 4:42 pm)
[Bug #10741] bug in `tty: BKL pushdown'?, Rafael J. Wysocki, (Sat Jun 7, 4:42 pm)
[Bug #10726] x86-64 NODES_SHIFT compile failure., Rafael J. Wysocki, (Sat Jun 7, 4:42 pm)
[Bug #10724] ACPI: EC: GPE storm detected, disabling EC GPE, Rafael J. Wysocki, (Sat Jun 7, 4:42 pm)
[Bug #10725] Write protect on on, Rafael J. Wysocki, (Sat Jun 7, 4:42 pm)
[Bug #10714] Badness seen on 2.6.26-rc2 with lockdep enabled, Rafael J. Wysocki, (Sat Jun 7, 4:42 pm)
[Bug #10629] 2.6.26-rc1-$sha1: RIP __d_lookup+0x8c/0x160, Rafael J. Wysocki, (Sat Jun 7, 4:42 pm)
[Bug #10616] Horrendous Audio Stutter - current git, Rafael J. Wysocki, (Sat Jun 7, 4:42 pm)
[Bug #10493] mips BCM47XX compile error, Rafael J. Wysocki, (Sat Jun 7, 4:38 pm)
Re: [Bug #10493] mips BCM47XX compile error, Adrian Bunk, (Wed Jun 11, 12:17 pm)