Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM

Previous message: [thread] [date] [author]
Next message: [thread] [date] [author]
From: David Miller
Date: Wednesday, September 24, 2008 - 12:36 am

From: "Dave Airlie" <airlied@gmail.com>
Date: Wed, 24 Sep 2008 15:45:46 +1000


Sure.  My theory is that it's a recent xorg change causing this,
so I've been going through GIT history for xserver, libpciaccess,
and the intel driver for the past year looking for clues.

If there is usually a gap after the video device, there would just
be no response from the PCI bus, and the way that's handled is
chipset specific.  At least a while back, most x86 systems would
silently ignore writes and return all 1's in such a case, but
they may be generating bus error events these days.  I simply don't
know.


The e1000e side here is reproducable way too easily for it to be the
same case, as far as I see it.

The e1000 driver has probably had this problem for years and we've
only recently had some concrete cases of it triggering.

Also, what utility are you running on your system that is even
accessing the NVRAM on the e1000e card?  Knowing that might help
us understand why this problem has appeared now.  Maybe there is
some diagnostic or monitoring tool that is now becoming prevalent
in these distributions where it triggers.

This problem started happening seemingly "all of a sudden", even to
people who have been keeping sort-of recent with their kernels, such
as yourself.

Yet we can't get any sense yet what range of kernel versions are in
use when the problem triggers.

I'm about to leave for a week or so in Paris for the netfilter
workshop, so I hope that someone other than myself will do some data
mining like I have instead of (merely) tossing theories around and
finger pointing.
--
Previous message: [thread] [date] [author]
Next message: [thread] [date] [author]

Messages in current thread:
2.6.27-rc6-git6: Reported regressions from 2.6.26, Rafael J. Wysocki, (Sun Sep 21, 11:52 am)
[Bug #11207] VolanoMark regression with 2.6.27-rc1, Rafael J. Wysocki, (Sun Sep 21, 11:52 am)
[Bug #11210] libata badness, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11215] INFO: possible recursive locking detected ps2 ..., Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11220] Screen stays black after resume, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11264] Invalid op opcode in kernel/workqueue, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11237] corrupt PMD after resume, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11224] Only three cores found on quad-core machine., Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11230] Kconfig no longer outputs a .config with fres ..., Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11308] tbench regression on each kernel release from ..., Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11335] 2.6.27-rc2-git5 BUG: unable to handle kernel ..., Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11272] BUG: parport_serial in 2.6.27-rc1 for NetMos ..., Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11271] BUG: fealnx in 2.6.27-rc1, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11380] lockdep warning: cpu_add_remove_lock at:cpu_m ..., Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11357] Can not boot up with zd1211rw USB-Wlan Stick, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11340] LTP overnight run resulted in unusable box, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11404] BUG: in 2.6.23-rc3-git7 in do_cciss_intr, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11459] kernel crash after wifi connection established, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11442] btusb hibernation/suspend breakage in current ..., Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11439] [2.6.27-rc4-git4] compilation warnings, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11407] suspend: unable to handle kernel paging request, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11465] Linux-2.6.27-rc5, drm errors in log, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11476] failure to associate after resume from suspen ..., Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11501] Failed to open destination file: Permission d ..., Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11516] severe performance degradation on x86_64 goin ..., Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11506] oops during unmount - ext3? (2.6.27-rc5), Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11507] usb: sometimes dead keyboard after boot, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11505] oltp ~10% regression with 2.6.27-rc5 on stoak ..., Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11548] kernel BUG at drivers/pci/intel-iommu.c:1373!, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11543] kernel panic: softlockup in tick_periodic() ???, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11552] Disabling IRQ #23, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11551] Semi-repeatable hard lockup on 2.6.27-rc6, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11549] 2.6.27-rc5 acpi: EC Storm error message on bootup, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11568] spontaneous reboot on resume with 2.6.27, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11590] Nokia 5310 Xpress usb-storage not mounting, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11569] Don't complain about disabled irqs when the s ..., Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11609] oops in find_get_page, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11610] Problem with kernel commit 664d080c41463570b9 ..., Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11608] 2.6.27-rc6 BUG: unable to handle kernel pagin ..., Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11611] Commit 2344abbcbdb82140050e8be29d3d55e4f6fe86 ..., Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
Re: 2.6.27-rc6-git6: Reported regressions from 2.6.26, Alexey Starikovskiy, (Sun Sep 21, 2:57 pm)
Re: [Bug #11610] Problem with kernel commit 664d080c414635 ..., Michal 'vorner' Vaner, (Sun Sep 21, 4:10 pm)
Re: [Bug #11552] Disabling IRQ #23, Justin Mattock, (Sun Sep 21, 4:16 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, David Miller, (Sun Sep 21, 4:51 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Dave Airlie, (Sun Sep 21, 11:59 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, David Miller, (Mon Sep 22, 12:01 am)
Re: [Bug #11552] Disabling IRQ #23, Alan Stern, (Mon Sep 22, 3:53 am)
Re: [Bug #11552] Disabling IRQ #23, Justin Mattock, (Mon Sep 22, 9:20 am)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, David Miller, (Mon Sep 22, 3:28 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, David Miller, (Mon Sep 22, 6:59 pm)
Re: [Bug #11568] spontaneous reboot on resume with 2.6.27, Andy Wettstein, (Mon Sep 22, 7:13 pm)
Re: [Bug #11543] kernel panic: softlockup in tick_periodic ..., Rafael J. Wysocki, (Tue Sep 23, 6:52 am)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Renato S. Yamane, (Tue Sep 23, 9:38 am)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, David Miller, (Tue Sep 23, 2:05 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, David Miller, (Tue Sep 23, 3:05 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, David Miller, (Tue Sep 23, 3:07 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Jeff Kirsher, (Tue Sep 23, 3:12 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, David Miller, (Tue Sep 23, 9:12 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Dave Airlie, (Tue Sep 23, 10:45 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, David Newall, (Tue Sep 23, 11:02 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, David Miller, (Wed Sep 24, 12:36 am)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, David Miller, (Wed Sep 24, 2:01 am)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Jonathan Corbet, (Wed Sep 24, 9:27 am)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Jiri Kosina, (Wed Sep 24, 11:10 am)
Re: [Bug #11548] kernel BUG at drivers/pci/intel-iommu.c:1373!, Rafael J. Wysocki, (Wed Sep 24, 11:23 am)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Kyle McMartin, (Wed Sep 24, 12:10 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Jesse Brandeburg, (Wed Sep 24, 12:22 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, David Miller, (Wed Sep 24, 12:52 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Theodore Tso, (Wed Sep 24, 1:47 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Parag Warudkar, (Wed Sep 24, 3:54 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Jesse Barnes, (Wed Sep 24, 5:26 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, David Miller, (Wed Sep 24, 9:00 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Jesse Brandeburg, (Wed Sep 24, 9:25 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Jesse Barnes, (Thu Sep 25, 9:08 am)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Krzysztof Halasa, (Thu Sep 25, 9:26 am)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Jiri Kosina, (Thu Sep 25, 10:24 am)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, H. Peter Anvin, (Thu Sep 25, 11:39 am)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, H. Peter Anvin, (Thu Sep 25, 11:46 am)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Jesse Barnes, (Thu Sep 25, 11:56 am)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Jiri Kosina, (Thu Sep 25, 12:01 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Krzysztof Halasa, (Thu Sep 25, 12:23 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Jesse Barnes, (Thu Sep 25, 12:36 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Jesse Barnes, (Thu Sep 25, 12:43 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, David Miller, (Thu Sep 25, 1:06 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Jesse Brandeburg, (Thu Sep 25, 2:42 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, H. Peter Anvin, (Thu Sep 25, 3:57 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Alexey Rempel, (Fri Sep 26, 12:06 am)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Krzysztof Halasa, (Fri Sep 26, 11:55 am)
Re: [Bug #11220] Screen stays black after resume, Pavel Machek, (Tue Sep 30, 3:25 pm)