Re: Multipath failover handling (Was: Re: 2.6.24-rc3-mm1)

!MAILaRCHIVE_VOTE_RePLACE
Previous message: [thread] [date] [author]
Next message: [thread] [date] [author]
To: James Bottomley <James.Bottomley@...>
Cc: Hannes Reinecke <hare@...>, Andrew Morton <akpm@...>, Gabriel C <nix.or.die@...>, <linux-kernel@...>, <linux-scsi@...>
Date: Monday, January 7, 2008 - 2:24 pm

James Bottomley wrote:


How about my patches to use new transport error values and make the 
iscsi and fc behave the same.

The problem I think Hannes and I are both trying to solve is this:

1. We do not want to wait for dev_loss_tmo seconds for failover.

2. The FC drivers can hook into fast_io_fail_tmo related callouts and 
with that set that tmo to a very low value like a couple of seconds if 
they are using multipath, so failovers are fast. However, there is a bug 
with where when the fast_io_fail_tmo fires requests that made it to the 
driver get failed and returned to the multipath layer, but commands in 
the blocked request queue are stuck in there until dev_loss_tmo fires.

With my patches here (need to be rediffed and for FC I need to handle 
JamesS's comments about not using a new field for the fast_fail_timeout 
state bit):

http://marc.info/?l=linux-scsi&m=117399843216280&w=2
http://marc.info/?l=linux-scsi&m=117399544112073&w=2
http://marc.info/?l=linux-scsi&m=117399844316771&w=2
http://marc.info/?l=linux-scsi&m=117400203324693&w=2
http://marc.info/?l=linux-scsi&m=117400203324690&w=2

For FC we can use the fast_io_fail_tmo for fast failovers, and commands 
will not get stuck in a blocked queue for dev_loss_tmo seconds because 
when the fast_io_fail_tmo fires the target's queues are unblocked and 
fc_remote_port_chkready() ready kicks in (iSCSI does the same with the 
patches in the links). And with the patches if multipath-tools is 
sending its path testing IO it will get a DID_TRANSPORT_* error code 
that it can use to make a decent path failing decision with.
--
Previous message: [thread] [date] [author]
Next message: [thread] [date] [author]

Messages in current thread:
2.6.24-rc3-mm1, Andrew Morton, (Wed Nov 21, 12:45 am)
Re: 2.6.24-rc3-mm1 - brick my Dell Latitude D820, Dave Young, (Tue Nov 27, 4:25 am)
Re: 2.6.24-rc3-mm1 - brick my Dell Latitude D820, Andrew Morton, (Tue Nov 27, 3:27 am)
Re: 2.6.24-rc3-mm1 - brick my Dell Latitude D820, Ingo Molnar, (Tue Nov 27, 6:25 am)
Re: 2.6.24-rc3-mm1 - brick my Dell Latitude D820, Andrew Morton, (Tue Nov 27, 4:17 am)
Re: 2.6.24-rc3-mm1, Randy Dunlap, (Mon Nov 26, 3:13 pm)
Re: 2.6.24-rc3-mm1, Christoph Lameter, (Mon Nov 26, 3:34 pm)
[PATCH -mm] x86 allnoconfig memory model, Randy Dunlap, (Mon Nov 26, 4:47 pm)
Re: [PATCH -mm] x86 allnoconfig memory model, Christoph Lameter, (Mon Nov 26, 5:00 pm)
Re: [PATCH -mm] x86 allnoconfig memory model, Andrew Morton, (Mon Nov 26, 5:20 pm)
Re: [PATCH -mm] x86 allnoconfig memory model, Christoph Lameter, (Mon Nov 26, 5:52 pm)
Re: [PATCH -mm] x86 allnoconfig memory model, Andrew Morton, (Mon Nov 26, 5:57 pm)
Re: [PATCH -mm] x86 allnoconfig memory model, Christoph Lameter, (Mon Nov 26, 7:19 pm)
Re: [PATCH -mm] x86 allnoconfig memory model, Randy Dunlap, (Mon Nov 26, 5:17 pm)
Re: 2.6.24-rc3-mm1, Randy Dunlap, (Mon Nov 26, 4:40 pm)
Re: 2.6.24-rc3-mm1, Christoph Lameter, (Mon Nov 26, 4:56 pm)
Re: 2.6.24-rc3-mm1, Gabriel C, (Thu Nov 22, 9:39 pm)
Re: 2.6.24-rc3-mm1, Andrew Morton, (Fri Nov 23, 12:12 am)
Re: 2.6.24-rc3-mm1, Gabriel C, (Fri Nov 23, 1:55 am)
Re: 2.6.24-rc3-mm1, Andrew Morton, (Tue Nov 27, 2:15 am)
Re: 2.6.24-rc3-mm1, James Bottomley, (Tue Dec 11, 12:33 pm)
Re: 2.6.24-rc3-mm1, Hannes Reinecke, (Fri Dec 14, 5:00 am)
Re: 2.6.24-rc3-mm1, James Bottomley, (Fri Dec 14, 10:26 am)
Multipath failover handling (Was: Re: 2.6.24-rc3-mm1), Hannes Reinecke, (Mon Jan 7, 10:05 am)
Re: Multipath failover handling (Was: Re: 2.6.24-rc3-mm1), James Bottomley, (Mon Jan 7, 1:57 pm)
Re: Multipath failover handling (Was: Re: 2.6.24-rc3-mm1), Mike Christie, (Mon Jan 7, 2:24 pm)
Re: 2.6.24-rc3-mm1, Boaz Harrosh, (Wed Dec 12, 6:08 am)
Re: 2.6.24-rc3-mm1, Jens Axboe, (Wed Dec 12, 7:36 am)
[PATCH] REQ-flags to/from BIO-flags bugfix, Boaz Harrosh, (Wed Dec 12, 7:03 am)
Re: [PATCH] REQ-flags to/from BIO-flags bugfix, Matthew Wilcox, (Wed Dec 12, 11:18 am)
Re: [PATCH] REQ-flags to/from BIO-flags bugfix, Boaz Harrosh, (Wed Dec 12, 12:06 pm)
Re: [PATCH] REQ-flags to/from BIO-flags bugfix, Matthew Wilcox, (Wed Dec 12, 12:33 pm)
Re: [PATCH] REQ-flags to/from BIO-flags bugfix, Matthew Wilcox, (Wed Dec 12, 11:54 am)
Re: [PATCH] REQ-flags to/from BIO-flags bugfix, David Chinner, (Thu Dec 13, 1:36 am)
Re: 2.6.24-rc3-mm1, Kirill A. Shutemov, (Thu Nov 22, 6:22 am)
Re: 2.6.24-rc3-mm1, Andrew Morton, (Thu Nov 22, 8:18 pm)
Re: 2.6.24-rc3-mm1, Thomas Gleixner, (Thu Nov 22, 8:48 pm)
Re: 2.6.24-rc3-mm1, Kirill A. Shutemov, (Fri Nov 23, 2:05 am)
Re: 2.6.24-rc3-mm1, Andreas Herrmann, (Fri Nov 23, 4:59 am)
Re: 2.6.24-rc3-mm1: I/O error, system hangs, Laurent Riffard, (Wed Nov 21, 5:45 pm)
Re: 2.6.24-rc3-mm1: I/O error, system hangs, Andrew Morton, (Wed Nov 21, 6:41 pm)
Re: 2.6.24-rc3-mm1: I/O error, system hangs, Laurent Riffard, (Fri Nov 23, 3:29 am)
Re: 2.6.24-rc3-mm1: I/O error, system hangs, Hannes Reinecke, (Fri Nov 23, 3:51 am)
Re: 2.6.24-rc3-mm1: I/O error, system hangs, Hannes Reinecke, (Fri Nov 23, 7:38 am)
Re: 2.6.24-rc3-mm1: I/O error, system hangs, James Bottomley, (Sat Nov 24, 1:44 pm)
Re: 2.6.24-rc3-mm1: I/O error, system hangs, Hannes Reinecke, (Mon Nov 26, 3:54 am)
Re: 2.6.24-rc3-mm1: I/O error, system hangs, Laurent Riffard, (Fri Nov 23, 1:52 pm)
Re: 2.6.24-rc3-mm1: I/O error, system hangs, James Bottomley, (Sat Nov 24, 2:42 am)
Re: 2.6.24-rc3-mm1: I/O error, system hangs, Laurent Riffard, (Sat Nov 24, 8:57 am)
Re: 2.6.24-rc3-mm1: I/O error, system hangs, James Bottomley, (Sat Nov 24, 9:26 am)
Re: 2.6.24-rc3-mm1: I/O error, system hangs, Laurent Riffard, (Sat Nov 24, 6:59 pm)
Re: 2.6.24-rc3-mm1: I/O error, system hangs, James Bottomley, (Sun Nov 25, 3:37 am)
Re: 2.6.24-rc3-mm1: I/O error, system hangs, Laurent Riffard, (Sun Nov 25, 4:39 pm)
Re: 2.6.24-rc3-mm1: I/O error, system hangs, Laurent Riffard, (Wed Nov 28, 5:38 pm)
Re: 2.6.24-rc3-mm1: I/O error, system hangs, Gabriel C, (Sat Nov 24, 1:54 pm)
Re: 2.6.24-rc3-mm1: I/O error, system hangs, James Bottomley, (Sat Nov 24, 2:04 pm)
Re: 2.6.24-rc3-mm1: I/O error, system hangs, Gabriel C, (Sat Nov 24, 2:08 pm)
Re: 2.6.24-rc3-mm1: I/O error, system hangs, Gabriel C, (Sat Nov 24, 2:28 pm)
Re: 2.6.24-rc3-mm1: usb mouse doesn't work, Kirill A. Shutemov, (Wed Nov 21, 2:23 pm)
Re: 2.6.24-rc3-mm1: usb mouse doesn't work, Andrew Morton, (Wed Nov 21, 6:22 pm)
Re: 2.6.24-rc3-mm1: usb mouse doesn't work, Kirill A. Shutemov, (Thu Nov 22, 6:17 am)
Re: [linux-usb-devel] 2.6.24-rc3-mm1: usb mouse doesn't work, Kirill A. Shutemov, (Fri Nov 23, 1:19 am)
Re: 2.6.24-rc3-mm1- powerpc link failure, Kamalesh Babulal, (Wed Nov 21, 4:06 am)
Re: 2.6.24-rc3-mm1- powerpc link failure, Stephen Rothwell, (Wed Nov 21, 6:52 pm)
Re: 2.6.24-rc3-mm1 (sync is slow ?), KAMEZAWA Hiroyuki, (Wed Nov 21, 4:42 am)
Re: 2.6.24-rc3-mm1 (sync is slow ?), Andrew Morton, (Wed Nov 21, 4:49 am)
Re: 2.6.24-rc3-mm1 (sync is slow ?), kosaki, (Sat Nov 24, 8:04 am)
Re: 2.6.24-rc3-mm1 (sync is slow ?), Gabriel C, (Sat Nov 24, 2:04 pm)
Re: 2.6.24-rc3-mm1 (sync is slow ?), KAMEZAWA Hiroyuki, (Mon Nov 26, 3:06 am)
Re: 2.6.24-rc3-mm1 (sync is slow ?), KAMEZAWA Hiroyuki, (Wed Nov 21, 11:06 pm)
Re: 2.6.24-rc3-mm1 (sync is slow ?), KAMEZAWA Hiroyuki, (Wed Nov 21, 4:49 am)
Re: 2.6.24-rc3-mm1 make headers_check fails, Kamalesh Babulal, (Wed Nov 21, 4:24 am)
Re: 2.6.24-rc3-mm1 make headers_check fails, Andrew Morton, (Tue Nov 20, 8:32 pm)
Re: 2.6.24-rc3-mm1 make headers_check fails, Kamalesh Babulal, (Wed Nov 21, 4:41 am)
Re: 2.6.24-rc3-mm1 make headers_check fails, Avi Kivity, (Wed Nov 21, 4:44 am)
Re: 2.6.24-rc3-mm1 make headers_check fails, Sam Ravnborg, (Wed Nov 21, 5:58 am)
Re: 2.6.24-rc3-mm1 make headers_check fails, Adrian Bunk, (Sat Nov 24, 10:34 am)
Re: 2.6.24-rc3-mm1 make headers_check fails, Avi Kivity, (Wed Nov 21, 6:00 am)
Re: 2.6.24-rc3-mm1 make headers_check fails, Avi Kivity, (Wed Nov 21, 6:17 am)
Re: 2.6.24-rc3-mm1 make headers_check fails, Andrew Morton, (Wed Nov 28, 1:02 am)
Re: 2.6.24-rc3-mm1 make headers_check fails, Avi Kivity, (Sun Dec 2, 4:56 am)
Re: 2.6.24-rc3-mm1 make headers_check fails, Robert P. J. Day, (Wed Nov 21, 6:31 am)
Re: 2.6.24-rc3-mm1 make headers_check fails, Robert P. J. Day, (Wed Nov 21, 4:52 am)
Re: 2.6.24-rc3-mm1 make headers_check fails, Andrew Morton, (Wed Nov 21, 5:04 am)
Re: 2.6.24-rc3-mm1 make headers_check fails, Robert P. J. Day, (Wed Nov 21, 5:06 am)
Re: 2.6.24-rc3-mm1 - Kernel Panic on IO-APIC, Kamalesh Babulal, (Wed Nov 21, 2:11 am)
Re: 2.6.24-rc3-mm1 - Kernel Panic on IO-APIC, Andrew Morton, (Wed Nov 21, 2:18 am)
Re: 2.6.24-rc3-mm1 - Kernel Panic on IO-APIC, Rik van Riel, (Mon Nov 26, 3:39 pm)
Re: 2.6.24-rc3-mm1 - Kernel Panic on IO-APIC, Andrew Morton, (Mon Nov 26, 4:33 pm)
Re: 2.6.24-rc3-mm1 - Kernel Panic on IO-APIC, Christoph Lameter, (Mon Nov 26, 4:56 pm)
Re: 2.6.24-rc3-mm1 - Kernel Panic on IO-APIC, Rik van Riel, (Mon Nov 26, 4:54 pm)
Re: 2.6.24-rc3-mm1 - Kernel Panic on IO-APIC, Ingo Molnar, (Mon Nov 26, 4:45 pm)
Re: 2.6.24-rc3-mm1 - Kernel Panic on IO-APIC, Jiri Slaby, (Mon Nov 26, 6:08 pm)
Re: 2.6.24-rc3-mm1 - Kernel Panic on IO-APIC, Andrew Morton, (Mon Nov 26, 6:17 pm)
Re: 2.6.24-rc3-mm1 - Kernel Panic on IO-APIC, Jiri Slaby, (Mon Nov 26, 7:14 pm)
Re: 2.6.24-rc3-mm1 - Kernel Panic on IO-APIC, Andrew Morton, (Mon Nov 26, 7:28 pm)
Re: 2.6.24-rc3-mm1 - Kernel Panic on IO-APIC, Rik van Riel, (Tue Nov 27, 1:50 pm)
Re: 2.6.24-rc3-mm1 - Kernel Panic on IO-APIC, Jiri Slaby, (Mon Nov 26, 6:22 pm)
Re: 2.6.24-rc3-mm1 - Kernel Panic on IO-APIC, Alexey Dobriyan, (Fri Nov 23, 8:49 pm)
Re: 2.6.24-rc3-mm1 - Kernel Panic on IO-APIC, Len Brown, (Wed Nov 21, 3:22 pm)
Re: 2.6.24-rc3-mm1 - Kernel Panic on IO-APIC, Torsten Kaiser, (Wed Nov 21, 3:48 pm)
Re: 2.6.24-rc3-mm1 - Kernel Panic on IO-APIC, Kamalesh Babulal, (Wed Nov 21, 5:22 am)
Re: 2.6.24-rc3-mm1 - Kernel Panic on IO-APIC, Andrew Morton, (Wed Nov 21, 5:29 am)
Re: 2.6.24-rc3-mm1 - Kernel Panic on IO-APIC, Torsten Kaiser, (Wed Nov 21, 3:33 pm)
Re: 2.6.24-rc3-mm1 - Kernel Panic on IO-APIC, Kirill A. Shutemov, (Thu Nov 22, 6:04 am)
Re: 2.6.24-rc3-mm1 - Kernel Panic on IO-APIC, Kamalesh Babulal, (Wed Nov 21, 5:43 am)
Re: 2.6.24-rc3-mm1 - Build Failure on S390x, Kamalesh Babulal, (Wed Nov 21, 1:56 am)
Re: 2.6.24-rc3-mm1 - Build Failure on S390x, Andrew Morton, (Wed Nov 21, 2:04 am)
Re: 2.6.24-rc3-mm1, KAMEZAWA Hiroyuki, (Wed Nov 21, 1:58 am)
Re: 2.6.24-rc3-mm1, Andrew Morton, (Wed Nov 21, 2:08 am)
Re: 2.6.24-rc3-mm1, Rene Herman, (Wed Nov 21, 8:49 am)
Re: 2.6.24-rc3-mm1, Dave Young, (Wed Nov 21, 1:51 am)
Re: 2.6.24-rc3-mm1, Andrew Morton, (Wed Nov 21, 2:00 am)
Re: 2.6.24-rc3-mm1, Dave Young, (Wed Nov 21, 2:03 am)
Re: 2.6.24-rc3-mm1, Rik van Riel, (Mon Nov 26, 2:48 pm)
Re: 2.6.24-rc3-mm1, Jiri Slaby, (Mon Nov 26, 3:33 pm)
Re: 2.6.24-rc3-mm1, Andrew Morton, (Wed Nov 21, 2:15 am)
Re: 2.6.24-rc3-mm1, Kirill A. Shutemov, (Wed Nov 21, 2:35 pm)
Re: 2.6.24-rc3-mm1, Andrew Morton, (Wed Nov 21, 6:25 pm)
Re: 2.6.24-rc3-mm1, Dave Young, (Wed Nov 21, 2:22 am)