Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM

Previous message: [thread] [date] [author]
Next message: [thread] [date] [author]
From: Jiri Kosina
Date: Wednesday, September 24, 2008 - 6:27 pm

On Thu, 25 Sep 2008, Dave Airlie wrote:


Uh oh. Shouldn't we put something like the patch below in Linus' tree 
unless we get this sorted out? Otherwise more and more people who use -rc 
kernels will run into this, and will get their hardware [hopefully 
temporarily, but not all users are able to re-flash their network card 
EEPROMs, right] bricked.

I know that it is quite aggressive and is going to disable wired 
networking on a lot of systems that have been functioning properly, 
therefore RFC ...



From: Jiri Kosina <jkosina@suse.cz>
Subject: [PATCH] [RFC] E1000E: temporarily disable e1000e driver

E1000E: temporarily disable e1000e driver

There is a serious bug somewhere, that renders e1000e network cards 
unusable on certain hardware configurations by rewriting EEPROM with 0xff 
all over. Debugging this is not trivial, because:

- it is not yet even clear whether the bug is caused by userspace (new 
  version of xorg drivers, bad interaction with PAT, ...) or some bug in 
  kernel code; it's even not yet certain at which exact combination of 
  software versions and hardware configuration this started to trigger
- you have only one attempt to test potential fix. If the fix doesn't 
  work, the eeprom of the card is hosed

and therefore fixing this has potential to take some time.

The tool that will safely restore the previous contents of EEPROM is 
currently being written, but even this is not trivial (Dave Airlie has 
turned his notebook into brick while trying to restore the EEPROM 
contents).

Let's therefore mark this driver as broken (though it is very well 
possible that this particular driver is not at fault at all) until this 
gets resolved, so that users of -rc kernels don't get their network cards 
totally unusable.

References (information about sw/hw configurations of affected systems 
might be found in the bugzillas):

	http://lkml.org/lkml/2008/8/8/123
	http://lkml.org/lkml/2008/9/22/23

	http://bugzilla.kernel.org/show_bug.cgi?id=11382

	https://bugzilla.novell.com/show_bug.cgi?id=425480
	https://bugzilla.redhat.com/show_bug.cgi?id=459202
	https://bugs.launchpad.net/ubuntu/+source/linux/+bug/263555
	https://qa.mandriva.com/show_bug.cgi?id=44147

Signed-off-by: Jiri Kosina <jkosina@suse.cz>

--- 

diff --git a/drivers/net/Kconfig b/drivers/net/Kconfig
index 4a11296..2d7a7f2 100644
--- a/drivers/net/Kconfig
+++ b/drivers/net/Kconfig
@@ -1938,7 +1938,7 @@ config E1000_DISABLE_PACKET_SPLIT
 
 config E1000E
 	tristate "Intel(R) PRO/1000 PCI-Express Gigabit Ethernet support"
-	depends on PCI && (!SPARC32 || BROKEN)
+	depends on PCI && BROKEN
 	---help---
 	  This driver supports the PCI-Express Intel(R) PRO/1000 gigabit
 	  ethernet family of adapters. For PCI or PCI-X e1000 adapters,

--
Previous message: [thread] [date] [author]
Next message: [thread] [date] [author]

Messages in current thread:
2.6.27-rc6-git6: Reported regressions from 2.6.26, Rafael J. Wysocki, (Sun Sep 21, 11:52 am)
[Bug #11207] VolanoMark regression with 2.6.27-rc1, Rafael J. Wysocki, (Sun Sep 21, 11:52 am)
[Bug #11210] libata badness, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11215] INFO: possible recursive locking detected ps2 ..., Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11220] Screen stays black after resume, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11264] Invalid op opcode in kernel/workqueue, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11237] corrupt PMD after resume, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11224] Only three cores found on quad-core machine., Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11230] Kconfig no longer outputs a .config with fres ..., Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11308] tbench regression on each kernel release from ..., Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11335] 2.6.27-rc2-git5 BUG: unable to handle kernel ..., Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11272] BUG: parport_serial in 2.6.27-rc1 for NetMos ..., Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11271] BUG: fealnx in 2.6.27-rc1, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11380] lockdep warning: cpu_add_remove_lock at:cpu_m ..., Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11357] Can not boot up with zd1211rw USB-Wlan Stick, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11340] LTP overnight run resulted in unusable box, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11404] BUG: in 2.6.23-rc3-git7 in do_cciss_intr, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11459] kernel crash after wifi connection established, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11442] btusb hibernation/suspend breakage in current ..., Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11439] [2.6.27-rc4-git4] compilation warnings, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11407] suspend: unable to handle kernel paging request, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11465] Linux-2.6.27-rc5, drm errors in log, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11476] failure to associate after resume from suspen ..., Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11501] Failed to open destination file: Permission d ..., Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11516] severe performance degradation on x86_64 goin ..., Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11506] oops during unmount - ext3? (2.6.27-rc5), Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11507] usb: sometimes dead keyboard after boot, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11505] oltp ~10% regression with 2.6.27-rc5 on stoak ..., Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11548] kernel BUG at drivers/pci/intel-iommu.c:1373!, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11543] kernel panic: softlockup in tick_periodic() ???, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11552] Disabling IRQ #23, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11551] Semi-repeatable hard lockup on 2.6.27-rc6, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11549] 2.6.27-rc5 acpi: EC Storm error message on bootup, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11568] spontaneous reboot on resume with 2.6.27, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11590] Nokia 5310 Xpress usb-storage not mounting, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11569] Don't complain about disabled irqs when the s ..., Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11609] oops in find_get_page, Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11610] Problem with kernel commit 664d080c41463570b9 ..., Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11608] 2.6.27-rc6 BUG: unable to handle kernel pagin ..., Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
[Bug #11611] Commit 2344abbcbdb82140050e8be29d3d55e4f6fe86 ..., Rafael J. Wysocki, (Sun Sep 21, 11:54 am)
Re: 2.6.27-rc6-git6: Reported regressions from 2.6.26, Alexey Starikovskiy, (Sun Sep 21, 2:57 pm)
Re: [Bug #11610] Problem with kernel commit 664d080c414635 ..., Michal 'vorner' Vaner, (Sun Sep 21, 4:10 pm)
Re: [Bug #11552] Disabling IRQ #23, Justin Mattock, (Sun Sep 21, 4:16 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, David Miller, (Sun Sep 21, 4:51 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Dave Airlie, (Sun Sep 21, 11:59 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, David Miller, (Mon Sep 22, 12:01 am)
Re: [Bug #11552] Disabling IRQ #23, Alan Stern, (Mon Sep 22, 3:53 am)
Re: [Bug #11552] Disabling IRQ #23, Justin Mattock, (Mon Sep 22, 9:20 am)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, David Miller, (Mon Sep 22, 3:28 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, David Miller, (Mon Sep 22, 6:59 pm)
Re: [Bug #11568] spontaneous reboot on resume with 2.6.27, Andy Wettstein, (Mon Sep 22, 7:13 pm)
Re: [Bug #11543] kernel panic: softlockup in tick_periodic ..., Rafael J. Wysocki, (Tue Sep 23, 6:52 am)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Renato S. Yamane, (Tue Sep 23, 9:38 am)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, David Miller, (Tue Sep 23, 2:05 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, David Miller, (Tue Sep 23, 3:05 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, David Miller, (Tue Sep 23, 3:07 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Jeff Kirsher, (Tue Sep 23, 3:12 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, David Miller, (Tue Sep 23, 9:12 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Dave Airlie, (Tue Sep 23, 10:45 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, David Newall, (Tue Sep 23, 11:02 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, David Miller, (Wed Sep 24, 12:36 am)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, David Miller, (Wed Sep 24, 2:01 am)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Jonathan Corbet, (Wed Sep 24, 9:27 am)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Jiri Kosina, (Wed Sep 24, 11:10 am)
Re: [Bug #11548] kernel BUG at drivers/pci/intel-iommu.c:1373!, Rafael J. Wysocki, (Wed Sep 24, 11:23 am)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Kyle McMartin, (Wed Sep 24, 12:10 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Jesse Brandeburg, (Wed Sep 24, 12:22 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, David Miller, (Wed Sep 24, 12:52 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Theodore Tso, (Wed Sep 24, 1:47 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Parag Warudkar, (Wed Sep 24, 3:54 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Jesse Barnes, (Wed Sep 24, 5:26 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Jiri Kosina, (Wed Sep 24, 6:27 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, David Miller, (Wed Sep 24, 9:00 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Jesse Brandeburg, (Wed Sep 24, 9:25 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Jesse Barnes, (Thu Sep 25, 9:08 am)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Krzysztof Halasa, (Thu Sep 25, 9:26 am)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Jiri Kosina, (Thu Sep 25, 10:24 am)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, H. Peter Anvin, (Thu Sep 25, 11:39 am)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, H. Peter Anvin, (Thu Sep 25, 11:46 am)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Jesse Barnes, (Thu Sep 25, 11:56 am)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Jiri Kosina, (Thu Sep 25, 12:01 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Krzysztof Halasa, (Thu Sep 25, 12:23 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Jesse Barnes, (Thu Sep 25, 12:36 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Jesse Barnes, (Thu Sep 25, 12:43 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, David Miller, (Thu Sep 25, 1:06 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Jesse Brandeburg, (Thu Sep 25, 2:42 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, H. Peter Anvin, (Thu Sep 25, 3:57 pm)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Alexey Rempel, (Fri Sep 26, 12:06 am)
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM, Krzysztof Halasa, (Fri Sep 26, 11:55 am)
Re: [Bug #11220] Screen stays black after resume, Pavel Machek, (Tue Sep 30, 3:25 pm)