Re: [Bug #11608] 2.6.27-rc6 BUG: unable to handle kernel paging request

Previous message: [thread] [date] [author]
Next message: [thread] [date] [author]
From: Nick Piggin
Date: Wednesday, September 24, 2008 - 8:03 pm

On Wed, Sep 24, 2008 at 08:46:55PM -0400, Chuck Ebbert wrote:

54384.988151] BUG: unable to handle kernel paging request at ffff8800601dd000
[54384.992095] IP: [<ffffffff80375457>] clear_page_c+0x7/0x10
[54384.992095] PGD 202063 PUD 8067 PMD 65d54163 PTE 80002020601dd163
[54384.992095] Oops: 000b [1] SMP DEBUG_PAGEALLOC

I initially suspect PAT (maybe via DEBUG_PAGEALLOC)... but let's see if the
3rd line here is useful.

     xRRRRRRRRRRRRRRRRRRRRRRR|40b|<--MAXPHYS     PHYS-->|...RR.actuwp
PGD:                                         001000000010000001100011

     xRRRRRRRRRRRRRRRRRRRRRRR|40b|<--MAXPHYS     PHYS-->|...RR.actuwp
PUD:                                                 1000000001100111

     xRRRRRRRRRRRRRRRRRRRRRRR|40b|<--MAXPHYS     PHYS-->|...Rs.actuwp
PMD:                                 01100101110101010100000101100011

     xRRRRRRRRRRRRRRRRRRRRRRR|40b|<--MAXPHYS     PHYS-->|...gP.actuwp
PTE: 1000000000000000001000000010000001100000000111011101000101100011
     3210987654321098765432109876543210987654321098765432109876543210

Is this a 36-bit physical address CPU? In which case you have 2 bits in
the pte that are outside "maxphys". Or if it is a 40-bit CPU, then you
have just 1 bit outside maxphys, in which case I'd say it is memory
corruption (maybe a hardware bug, maybe a scribble from elsewhere). So
I'm wrong about PAT.

Interestingly, the PMD also has a 1 set in a reserved bit (page global),
but according to the Intel docs, the CPU doesn't check that bit, so it
is not faulting there.

Does the machine survive memtest? Is the bug reproduceable? If the
answer is no to either of these, I think we can take it off the
regression list. Otherwise, is it possible to track down to a specific
commit?

Thanks,
Nick

--
Previous message: [thread] [date] [author]
Next message: [thread] [date] [author]

Messages in current thread:
2.6.27-rc6-git6: Reported regressions from 2.6.26, Rafael J. Wysocki, (Sun Sep 21, 11:52 am)
[Bug #11207] VolanoMark regression with 2.6.27-rc1, Rafael J. Wysocki, (Sun Sep 21, 11:52 am)
[Bug #11210] libata badness, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11215] INFO: possible recursive locking detected ps2 ..., Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11220] Screen stays black after resume, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11264] Invalid op opcode in kernel/workqueue, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11237] corrupt PMD after resume, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11224] Only three cores found on quad-core machine., Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11230] Kconfig no longer outputs a .config with fres ..., Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11308] tbench regression on each kernel release from ..., Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11335] 2.6.27-rc2-git5 BUG: unable to handle kernel ..., Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11272] BUG: parport_serial in 2.6.27-rc1 for NetMos ..., Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11271] BUG: fealnx in 2.6.27-rc1, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11380] lockdep warning: cpu_add_remove_lock at:cpu_m ..., Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11357] Can not boot up with zd1211rw USB-Wlan Stick, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11340] LTP overnight run resulted in unusable box, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11404] BUG: in 2.6.23-rc3-git7 in do_cciss_intr, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11459] kernel crash after wifi connection established, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11442] btusb hibernation/suspend breakage in current ..., Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11439] [2.6.27-rc4-git4] compilation warnings, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11407] suspend: unable to handle kernel paging request, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11465] Linux-2.6.27-rc5, drm errors in log, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11476] failure to associate after resume from suspen ..., Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11501] Failed to open destination file: Permission d ..., Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11516] severe performance degradation on x86_64 goin ..., Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11506] oops during unmount - ext3? (2.6.27-rc5), Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11507] usb: sometimes dead keyboard after boot, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11505] oltp ~10% regression with 2.6.27-rc5 on stoak ..., Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11548] kernel BUG at drivers/pci/intel-iommu.c:1373!, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11543] kernel panic: softlockup in tick_periodic() ???, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11552] Disabling IRQ #23, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11551] Semi-repeatable hard lockup on 2.6.27-rc6, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11549] 2.6.27-rc5 acpi: EC Storm error message on bootup, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11568] spontaneous reboot on resume with 2.6.27, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11590] Nokia 5310 Xpress usb-storage not mounting, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11569] Don't complain about disabled irqs when the s ..., Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11609] oops in find_get_page, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11610] Problem with kernel commit 664d080c41463570b9 ..., Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11608] 2.6.27-rc6 BUG: unable to handle kernel pagin ..., Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11611] Commit 2344abbcbdb82140050e8be29d3d55e4f6fe86 ..., Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
Re: 2.6.27-rc6-git6: Reported regressions from 2.6.26, Alexey Starikovskiy, (Sun Sep 21, 2:57 pm)
Re: [Bug #11610] Problem with kernel commit 664d080c414635 ..., Michal 'vorner' Vaner, (Sun Sep 21, 4:10 pm)
Re: [Bug #11552] Disabling IRQ #23, Justin Mattock, (Sun Sep 21, 4:16 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, David Miller, (Sun Sep 21, 4:51 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Dave Airlie, (Sun Sep 21, 11:59 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, David Miller, (Mon Sep 22, 12:01 am)
Re: [Bug #11552] Disabling IRQ #23, Alan Stern, (Mon Sep 22, 3:53 am)
Re: [Bug #11552] Disabling IRQ #23, Justin Mattock, (Mon Sep 22, 9:20 am)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, David Miller, (Mon Sep 22, 3:28 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, David Miller, (Mon Sep 22, 6:59 pm)
Re: [Bug #11568] spontaneous reboot on resume with 2.6.27, Andy Wettstein, (Mon Sep 22, 7:13 pm)
Re: [Bug #11543] kernel panic: softlockup in tick_periodic ..., Rafael J. Wysocki, (Tue Sep 23, 6:52 am)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Renato S. Yamane, (Tue Sep 23, 9:38 am)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, David Miller, (Tue Sep 23, 2:05 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, David Miller, (Tue Sep 23, 3:05 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, David Miller, (Tue Sep 23, 3:07 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Jeff Kirsher, (Tue Sep 23, 3:12 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, David Miller, (Tue Sep 23, 9:12 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Dave Airlie, (Tue Sep 23, 10:45 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, David Newall, (Tue Sep 23, 11:02 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, David Miller, (Wed Sep 24, 12:36 am)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, David Miller, (Wed Sep 24, 2:01 am)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Jonathan Corbet, (Wed Sep 24, 9:27 am)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Jiri Kosina, (Wed Sep 24, 11:10 am)
Re: [Bug #11548] kernel BUG at drivers/pci/intel-iommu.c:1373!, Rafael J. Wysocki, (Wed Sep 24, 11:23 am)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Kyle McMartin, (Wed Sep 24, 12:10 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Jesse Brandeburg, (Wed Sep 24, 12:22 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, David Miller, (Wed Sep 24, 12:52 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Theodore Tso, (Wed Sep 24, 1:47 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Parag Warudkar, (Wed Sep 24, 3:54 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Jesse Barnes, (Wed Sep 24, 5:26 pm)
Re: [Bug #11608] 2.6.27-rc6 BUG: unable to handle kernel p ..., Nick Piggin, (Wed Sep 24, 8:03 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, David Miller, (Wed Sep 24, 9:00 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Jesse Brandeburg, (Wed Sep 24, 9:25 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Jesse Barnes, (Thu Sep 25, 9:08 am)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Krzysztof Halasa, (Thu Sep 25, 9:26 am)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Jiri Kosina, (Thu Sep 25, 10:24 am)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, H. Peter Anvin, (Thu Sep 25, 11:39 am)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, H. Peter Anvin, (Thu Sep 25, 11:46 am)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Jesse Barnes, (Thu Sep 25, 11:56 am)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Jiri Kosina, (Thu Sep 25, 12:01 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Krzysztof Halasa, (Thu Sep 25, 12:23 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Jesse Barnes, (Thu Sep 25, 12:36 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Jesse Barnes, (Thu Sep 25, 12:43 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, David Miller, (Thu Sep 25, 1:06 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Jesse Brandeburg, (Thu Sep 25, 2:42 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, H. Peter Anvin, (Thu Sep 25, 3:57 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Alexey Rempel, (Fri Sep 26, 12:06 am)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Krzysztof Halasa, (Fri Sep 26, 11:55 am)
Re: [Bug #11220] Screen stays black after resume, Pavel Machek, (Tue Sep 30, 3:25 pm)