login
Header Space

 
 

Re: [BUG] 2.6.20-rc3-mm1: can not mount root

Previous thread: ACPI bay - 2.6.20-rc3-mm1 hangs on boot by Jiri Kosina on Friday, January 5, 2007 - 9:19 am. (7 messages)

Next thread: Re: Linux Redhat 9.0 - SATA HDD compatibility by Robert Hancock on Friday, January 5, 2007 - 10:25 am. (2 messages)
To: Andrew Morton <akpm@...>, Ingo Molnar <mingo@...>
Cc: <linux-kernel@...>
Date: Friday, January 5, 2007 - 10:20 am

I can not boot machine with 2.6.20-rc3-mm1 and  2.6.20-rc2-mm1.
I made binary search, patch bellow cause this bug:
$quilt top
patches/sched-improve-sched_clock-on-i686.patch

backtrace which I got by connecting "gdb" to machine:

_raw_spin_lock (lock=0xc06c0c60) at lib/spinlock_debug.c:108
108                     for (i = 0; i &lt; loops; i++) {
(gdb) bt
#0  _raw_spin_lock (lock=0xc06c0c60) at lib/spinlock_debug.c:108
#1  0xc056ac42 in _spin_lock (lock=0xc06c0c60) at kernel/spinlock.c:182
#2  0xc011c3bb in vprintk (fmt=0xc0649c00 "&lt;0&gt;BUG: spinlock %s on
CPU#%d, %s/%d\n",
    args=0xc1167a84 "") at kernel/printk.c:534
#3  0xc011c6c7 in printk (fmt=0xc0649c00 "&lt;0&gt;BUG: spinlock %s on
CPU#%d, %s/%d\n")
    at kernel/printk.c:508
#4  0xc027be42 in spin_bug (lock=0xc06c0c60, msg=0xc065fc00
"recursion") at lib/spinlock_debug.c:61
#5  0xc027c178 in _raw_spin_lock (lock=0xc06c0c60) at lib/spinlock_debug.c:79
#6  0xc056ac42 in _spin_lock (lock=0xc06c0c60) at kernel/spinlock.c:182
#7  0xc011c3bb in vprintk (fmt=0xc0626ed0 "&lt;1&gt;BUG: unable to handle
kernel paging request",
    args=0xc1167b8c "") at kernel/printk.c:534
#8  0xc011c6c7 in printk (fmt=0xc0626ed0 "&lt;1&gt;BUG: unable to handle
kernel paging request")
    at kernel/printk.c:508
#9  0xc0116de4 in do_page_fault (regs=0xc1167bcc, error_code=0) at
arch/i386/mm/fault.c:555
#10 0xc056b11c in page_fault ()
#11 0xc0808160 in ?? ()
#12 0xc0626ed0 in kallsyms_token_index ()
#13 0xc1167cac in ?? ()
#14 0x00000001 in ?? ()
#15 0xc0808163 in printk_buf.19225 ()
#16 0xc1167c0c in ?? ()
#17 0x00000000 in ?? ()
-
To: Tomasz Kvarsin <kvarsin@...>
Cc: Ingo Molnar <mingo@...>, <linux-kernel@...>
Date: Tuesday, January 30, 2007 - 4:12 am

On Fri, 5 Jan 2007 17:20:38 +0300

Tomasz, is this still happening in 2.6.20-rc6-mm3?

err.  We merged that patch.  So perhaps 2.6.20-rc6 now crashes in the same
manner?

-
To: Andrew Morton <akpm@...>
Cc: Ingo Molnar <mingo@...>, <linux-kernel@...>
Date: Tuesday, January 30, 2007 - 5:04 am

Have no idea, I tryied to test 2.6.20-rc6-mm2, but


if you merge his patch also all should be fine.
-
To: Andrew Morton <akpm@...>
Cc: Ingo Molnar <mingo@...>, <linux-kernel@...>
Date: Tuesday, January 30, 2007 - 10:45 am

2.6.20-rc6-mm3 seems to work ok.
-
To: Andrew Morton <akpm@...>
Cc: Tomasz Kvarsin <kvarsin@...>, <linux-kernel@...>
Date: Tuesday, January 30, 2007 - 4:22 am

no, we havent merged that patch yet, but it's:

  x86_64-mm-improve-sched_clock-on-i686.patch

I bet this is due to Qemu simulating a CPU that does not truly exist. 
I'll try to reproduce this.

	Ingo
-
To: Ingo Molnar <mingo@...>
Cc: Andrew Morton <akpm@...>, <linux-kernel@...>
Date: Tuesday, January 30, 2007 - 10:48 am

I supposed you was agree that
                               ^^^^^^^^^^^^^^^^
-
To: Andrew Morton <akpm@...>
Cc: Tomasz Kvarsin <kvarsin@...>, <linux-kernel@...>
Date: Tuesday, January 30, 2007 - 4:30 am

cannot see the crash in qemu - i suspect it's .config dependent. Tomasz, 
could you send me the .config you used?

	Ingo
-
To: Tomasz Kvarsin <kvarsin@...>
Cc: Ingo Molnar <mingo@...>, <linux-kernel@...>
Date: Friday, January 5, 2007 - 2:49 pm

On Fri, 5 Jan 2007 17:20:38 +0300


It looks like the machine was trying to oops, only it gets stuck on
logbuf_lock.  Perhaps it hit an oops while running printk_clock() inside
vprintk() then tried to go recursive.

oopses while holding logbuf_lock are rare, and appear to be fatal.  Perhaps
we should ignore logbuf_lock if oops_in_progress, but the chances are we'll
just hit the same oops again..

Do you have "time" on the kernel boot command line?  If so, does removing
that option make the hang go away?  
-
To: Andrew Morton <akpm@...>
Cc: Ingo Molnar <mingo@...>, <linux-kernel@...>
Date: Friday, January 5, 2007 - 3:41 pm

No, I have no "time" option,
I use grub, the config looks like:
kernel /boot/kernel root=/dev/hda1 init=/linuxrc
-
To: Andrew Morton <akpm@...>, Ingo Molnar <mingo@...>
Cc: <linux-kernel@...>
Date: Friday, January 5, 2007 - 1:42 pm

Some details:
system hangs after it print:
VFS: Mounted root (ext3 filesystem) readonly.
And if connect gdb to machine, I see the trace that I include into
previous letter.
My kernel config in attachment.

If you need some more info please tell.
Previous thread: ACPI bay - 2.6.20-rc3-mm1 hangs on boot by Jiri Kosina on Friday, January 5, 2007 - 9:19 am. (7 messages)

Next thread: Re: Linux Redhat 9.0 - SATA HDD compatibility by Robert Hancock on Friday, January 5, 2007 - 10:25 am. (2 messages)
speck-geostationary