Re: e1000e NVM corruption issue status

Previous message: [thread] [date] [author]
Next message: [thread] [date] [author]
To: Brandeburg, Jesse <jesse.brandeburg@...>
Cc: LKML <linux-kernel@...>, <agospoda@...>, Ronciak, John <john.ronciak@...>, Allan, Bruce W <bruce.w.allan@...>, Graham, David <david.graham@...>, <kkiel@...>, Thomas Gleixner <tglx@...>, <chris.jones@...>, <arjan@...>
Date: Monday, September 29, 2008 - 11:52 am

On Thu, 25 Sep 2008, Brandeburg, Jesse wrote:

> in case your mailer hoses something apply in this order:

When using this patchset (plus patch that adds check for address range in
pci_mmap_resource() by Jesse Barnes), the machine (that already has
corrupted (but not completely erased)) hangs after dumping eeprom
contents:

0000:00:19.0: 0000:00:19.0: The NVM Checksum Is Not Valid
/*********************/
Current EEPROM Checksum : 0x2259
Calculated : 0xa259
Offset Values
======== ======
00000000: 00 15 58 c6 4a ff 00 08 ff ff 30 00 ff ff ff ff
00000010: ff ff ff ff c7 10 b9 20 aa 17 49 10 86 80 00 00
00000020: 01 0d 00 00 00 00 05 16 20 50 00 38 00 00 8b 0d
00000030: 02 06 c1 01 03 08 00 00 00 00 00 00 00 00 00 00
00000040: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
00000050: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
00000060: 00 01 00 40 28 12 07 40 ff ff ff ff ff ff ff ff
00000070: ff ff ff ff ff ff ff ff ff ff ff ff ff ff 59 22
/*********************/

after this, alt-sysrq-p indicates that it's somehow running in the loops
around r1000_read_nvm_ich8lan and e1000_release_swflag_uch8lan. Below
there are several subsequent alt-sysrq-p outputs on this frozen system

SysRq : Show Regs
CPU 1:
Modules linked in: pcmcia_core v4l1_compat pcspkr(+) e1000e(+) button(+)
joydev led_class parport soundcore sg sr_mod cdrom sd_mod crc_t10r
Pid: 841, comm: modprobe Tainted: G 2.6.27-rc6-7.10-default #1
RIP: 0010:[] []
e1000_release_swflag_ich8lan+0x15/0x3c [e1000e]
RSP: 0018:ffff88003adb5b48 EFLAGS: 00000286
RAX: ffffc20004540f00 RBX: ffff88003adb5b48 RCX: ffff88003adb5b7e
RDX: 0000000000000022 RSI: 000000000000431c RDI: ffff88003c44cb28
RBP: 0000000000000000 R08: ffff88003adb5b7e R09: ffff88003c44cb28
R10: ffff88003adb5b7e R11: ffff88003adb5ad8 R12: ffff88003c44cb28
R13: ffffc20004520008 R14: ffffffff8020c394 R15: ffff88003adb5b08
FS: 00007f398e1eb6f0(0000) GS:ffff88003e1e93c0(0000)
knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: 000000000061ee78 CR3: 000000003b125000 CR4: 00000000000006e0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400

Call Trace:
Inexact backtrace:

[] ? e1000_release_swflag_ich8lan+0x9/0x3c [e1000e]
[] ? e1000_read_nvm_ich8lan+0xf2/0x10a [e1000e]
[] ? mcount_call+0x5/0x31
[] ? e1000e_validate_nvm_checksum_generic+0x34/0x62
[e1000e]
[] ? e1000_validate_nvm_checksum_ich8lan+0x6c/0x73
[e1000e]
[] ? e1000_probe+0x5a4/0xb7e [e1000e]
[] ? e1000_probe+0x5a4/0xb7e [e1000e]
[] ? set_cpus_allowed_ptr+0x119/0x126
[] ? kobject_get+0x1a/0x22
[] ? pci_device_probe+0xc9/0x120
[] ? driver_probe_device+0xc5/0x173
[] ? __driver_attach+0x54/0x7e
[] ? __driver_attach+0x0/0x7e
[] ? bus_for_each_dev+0x54/0x8e
[] ? driver_attach+0x21/0x23
[] ? bus_add_driver+0xbc/0x206
[] ? driver_register+0xad/0x12d
[] ? __pci_register_driver+0x6b/0xa5
[] ? e1000_init_module+0x0/0x75 [e1000e]
[] ? e1000_init_module+0x59/0x75 [e1000e]
[] ? _stext+0x4c/0x151
[] ? sys_init_module+0xae/0x1cc
[] ? system_call_fastpath+0x16/0x1b

SysRq : Show Regs
CPU 1:
Modules linked in: pcmcia_core v4l1_compat pcspkr(+) e1000e(+) button(+)
joydev led_class parport soundcore sg sr_mod cdrom sd_mod crc_t10r
Pid: 841, comm: modprobe Tainted: G 2.6.27-rc6-7.10-default #1
RIP: 0010:[] []
e1000_flash_cycle_ich8lan+0x3d/0x6d [e1000e]
RSP: 0018:ffff88003adb5ae8 EFLAGS: 00000282
RAX: ffffc20004526009 RBX: ffff88003adb5b08 RCX: 000000006dae9ce2
RDX: 000000006dae9ce2 RSI: 000000000000431c RDI: 00000000000007bc
RBP: 0000000000000018 R08: ffff88003adb5b7e R09: ffff88003c44cb28
R10: ffff88003adb5b7e R11: ffff88003adb5ac8 R12: ffff88003adb5a68
R13: 0000000000000282 R14: 0000000000000010 R15: ffffffff802135d0
FS: 00007f398e1eb6f0(0000) GS:ffff88003e1e93c0(0000)
knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: 000000000061ee78 CR3: 000000003b125000 CR4: 00000000000006e0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400

Call Trace:
Inexact backtrace:

[] ? e1000_flash_cycle_ich8lan+0x4d/0x6d [e1000e]
[] ? e1000_read_flash_data_ich8lan+0xab/0x104 [e1000e]
[] ? e1000_read_nvm_ich8lan+0xcc/0x10a [e1000e]
[] ? mcount_call+0x5/0x31
[] ? e1000e_validate_nvm_checksum_generic+0x34/0x62
[e1000e]
[] ? e1000_validate_nvm_checksum_ich8lan+0x6c/0x73
[e1000e]
[] ? e1000_probe+0x5a4/0xb7e [e1000e]
[] ? e1000_probe+0x5a4/0xb7e [e1000e]
[] ? set_cpus_allowed_ptr+0x119/0x126
[] ? kobject_get+0x1a/0x22
[] ? pci_device_probe+0xc9/0x120
[] ? driver_probe_device+0xc5/0x173
[] ? __driver_attach+0x54/0x7e
[] ? __driver_attach+0x0/0x7e
[] ? bus_for_each_dev+0x54/0x8e
[] ? driver_attach+0x21/0x23
[] ? bus_add_driver+0xbc/0x206
[] ? driver_register+0xad/0x12d
[] ? __pci_register_driver+0x6b/0xa5
[] ? e1000_init_module+0x0/0x75 [e1000e]
[] ? e1000_init_module+0x59/0x75 [e1000e]
[] ? _stext+0x4c/0x151
[] ? sys_init_module+0xae/0x1cc
[] ? system_call_fastpath+0x16/0x1b

SysRq : Show Regs
CPU 1:
Modules linked in: pcmcia_core v4l1_compat pcspkr(+) e1000e(+) button(+)
joydev led_class parport soundcore sg sr_mod cdrom sd_mod crc_t10r
Pid: 841, comm: modprobe Tainted: G 2.6.27-rc6-7.10-default #1
RIP: 0010:[] []
e1000_read_flash_data_ich8lan+0xbb/0x104 [e1000e]
RSP: 0018:ffff88003adb5b18 EFLAGS: 00000282
RAX: 0000000000000000 RBX: ffff88003adb5b48 RCX: 000000002edf089a
RDX: 0000000000000000 RSI: 000000000000431c RDI: 00000000000007bc
RBP: 000000000027e044 R08: ffff88003adb5b7e R09: ffff88003c44cb28
R10: ffff88003adb5b7e R11: ffff88003adb5ac8 R12: 0000000000000000
R13: 0000000000000100 R14: ffffffff8037db03 R15: ffff88003adb5ac8
FS: 00007f398e1eb6f0(0000) GS:ffff88003e1e93c0(0000)
knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: 000000000061ee78 CR3: 000000003b125000 CR4: 00000000000006e0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400

Call Trace:
Inexact backtrace:

[] ? e1000_read_flash_data_ich8lan+0xab/0x104 [e1000e]
[] ? e1000_read_nvm_ich8lan+0xcc/0x10a [e1000e]
[] ? mcount_call+0x5/0x31
[] ? e1000e_validate_nvm_checksum_generic+0x34/0x62
[e1000e]
[] ? e1000_validate_nvm_checksum_ich8lan+0x6c/0x73
[e1000e]
[] ? e1000_probe+0x5a4/0xb7e [e1000e]
[] ? e1000_probe+0x5a4/0xb7e [e1000e]
[] ? set_cpus_allowed_ptr+0x119/0x126
[] ? kobject_get+0x1a/0x22
[] ? pci_device_probe+0xc9/0x120
[] ? driver_probe_device+0xc5/0x173
[] ? __driver_attach+0x54/0x7e
[] ? __driver_attach+0x0/0x7e
[] ? bus_for_each_dev+0x54/0x8e
[] ? driver_attach+0x21/0x23
[] ? bus_add_driver+0xbc/0x206
[] ? driver_register+0xad/0x12d
[] ? __pci_register_driver+0x6b/0xa5
[] ? e1000_init_module+0x0/0x75 [e1000e]
[] ? e1000_init_module+0x59/0x75 [e1000e]
[] ? _stext+0x4c/0x151
[] ? sys_init_module+0xae/0x1cc
[] ? system_call_fastpath+0x16/0x1b

--
Jiri Kosina
SUSE Labs
--

Previous message: [thread] [date] [author]
Next message: [thread] [date] [author]

Messages in current thread:
e1000e NVM corruption issue status, Brandeburg, Jesse, (Thu Sep 25, 9:50 pm)
Re: e1000e NVM corruption issue status, James Courtier-Dutton, (Sat Oct 18, 3:13 pm)
Re: e1000e NVM corruption issue status, Jiri Kosina, (Sat Oct 18, 6:49 pm)
Re: e1000e NVM corruption issue status, Karsten Keil, (Fri Sep 26, 3:19 am)
Re: e1000e NVM corruption issue status, Jesse Brandeburg, (Fri Sep 26, 1:44 am)
Re: e1000e NVM corruption issue status, Brandeburg, Jesse, (Thu Sep 25, 10:01 pm)
Re: e1000e NVM corruption issue status, Karsten Keil, (Fri Sep 26, 10:23 am)
Re: e1000e NVM corruption issue status, Jiri Kosina, (Fri Sep 26, 2:13 am)
Re: e1000e NVM corruption issue status, Arjan van de Ven, (Fri Sep 26, 7:49 am)
Re: e1000e NVM corruption issue status, Jesse Barnes, (Fri Sep 26, 1:52 pm)
Re: e1000e NVM corruption issue status, Jesse Barnes, (Fri Sep 26, 2:23 pm)
Re: e1000e NVM corruption issue status, Tim Gardner, (Fri Sep 26, 2:53 pm)
Re: e1000e NVM corruption issue status, Brandeburg, Jesse, (Fri Sep 26, 8:05 pm)
Re: e1000e NVM corruption issue status, Tim Gardner, (Sat Sep 27, 12:20 am)
Re: e1000e NVM corruption issue status, Krzysztof Halasa, (Fri Sep 26, 6:04 pm)
RE: e1000e NVM corruption issue status, Brandeburg, Jesse, (Fri Sep 26, 6:23 pm)
Re: e1000e NVM corruption issue status, Krzysztof Halasa, (Sat Sep 27, 2:45 pm)
Re: e1000e NVM corruption issue status, Jesse Barnes, (Fri Sep 26, 2:39 pm)
Re: e1000e NVM corruption issue status, Jesse Barnes, (Fri Sep 26, 2:43 pm)
Re: e1000e NVM corruption issue status, Brandeburg, Jesse, (Thu Sep 25, 10:13 pm)
Re: e1000e NVM corruption issue status, Jiri Kosina, (Mon Sep 29, 11:52 am)
Re: e1000e NVM corruption issue status, Jiri Kosina, (Mon Sep 29, 12:20 pm)
RE: e1000e NVM corruption issue status, Brandeburg, Jesse, (Mon Sep 29, 12:24 pm)
RE: e1000e NVM corruption issue status, Jiri Kosina, (Mon Sep 29, 1:18 pm)
RE: e1000e NVM corruption issue status, Jiri Kosina, (Mon Sep 29, 1:36 pm)
RE: e1000e NVM corruption issue status, Jiri Kosina, (Mon Sep 29, 6:43 pm)
Re: e1000e NVM corruption issue status, Brandeburg, Jesse, (Thu Sep 25, 10:13 pm)
Re: e1000e NVM corruption issue status, Brandeburg, Jesse, (Thu Sep 25, 10:12 pm)
Re: e1000e NVM corruption issue status, Brandeburg, Jesse, (Thu Sep 25, 10:12 pm)
Re: e1000e NVM corruption issue status, Brandeburg, Jesse, (Thu Sep 25, 10:11 pm)
Re: e1000e NVM corruption issue status, Brandeburg, Jesse, (Thu Sep 25, 10:11 pm)
Re: e1000e NVM corruption issue status, Brandeburg, Jesse, (Thu Sep 25, 10:10 pm)
Re: e1000e NVM corruption issue status, Brandeburg, Jesse, (Thu Sep 25, 10:10 pm)
Re: e1000e NVM corruption issue status, Brandeburg, Jesse, (Thu Sep 25, 10:10 pm)
Re: e1000e NVM corruption issue status, Brandeburg, Jesse, (Thu Sep 25, 10:09 pm)
Re: e1000e NVM corruption issue status, Brandeburg, Jesse, (Thu Sep 25, 10:09 pm)
Re: e1000e NVM corruption issue status, Ingo Molnar, (Fri Sep 26, 3:12 am)
Re: e1000e NVM corruption issue status, Chris Snook, (Thu Sep 25, 9:58 pm)
Re: e1000e NVM corruption issue status, Brandeburg, Jesse, (Thu Sep 25, 10:04 pm)