Re: 2.6.23.1: mdadm/raid5 hung/d-state

Previous message: [thread] [date] [author]
Next message: [thread] [date] [author]
From: Carlos Carvalho
Date: Thursday, November 8, 2007 - 2:40 pm

Jeff Lessem (Jeff@Lessem.org) wrote on 6 November 2007 22:00:
 >Dan Williams wrote:
 > > The following patch, also attached, cleans up cases where the code looks
 > > at sh->ops.pending when it should be looking at the consistent
 > > stack-based snapshot of the operations flags.
 >
 >I tried this patch (against a stock 2.6.23), and it did not work for
 >me.  Not only did I/O to the effected RAID5 & XFS partition stop, but
 >also I/O to all other disks.  I was not able to capture any debugging
 >information, but I should be able to do that tomorrow when I can hook
 >a serial console to the machine.
 >
 >I'm not sure if my problem is identical to these others, as mine only
 >seems to manifest with RAID5+XFS.  The RAID rebuilds with no problem,
 >and I've not had any problems with RAID5+ext3.

Us too! We're stuck trying to build a disk server with several disks
in a raid5 array, and the rsync from the old machine stops writing to
the new filesystem. It only happens under heavy IO. We can make it
lock without rsync, using 8 simultaneous dd's to the array. All IO
stops, including the resync after a newly created raid or after an
unclean reboot.

We could not trigger the problem with ext3 or reiser3; it only happens
with xfs.
-
Previous message: [thread] [date] [author]
Next message: [thread] [date] [author]

Messages in current thread:
Re: 2.6.23.1: mdadm/raid5 hung/d-state, Neil Brown, (Sun Nov 4, 2:49 pm)
Re: 2.6.23.1: mdadm/raid5 hung/d-state, Justin Piszcz, (Sun Nov 4, 2:51 pm)
Re: 2.6.23.1: mdadm/raid5 hung/d-state, BERTRAND Joël, (Mon Nov 5, 1:36 am)
Re: 2.6.23.1: mdadm/raid5 hung/d-state, Dan Williams, (Mon Nov 5, 11:35 am)
Re: 2.6.23.1: mdadm/raid5 hung/d-state, Justin Piszcz, (Mon Nov 5, 11:35 am)
Re: 2.6.23.1: mdadm/raid5 hung/d-state, Dan Williams, (Mon Nov 5, 5:19 pm)
Re: 2.6.23.1: mdadm/raid5 hung/d-state, BERTRAND Joël, (Tue Nov 6, 3:19 am)
Re: 2.6.23.1: mdadm/raid5 hung/d-state, Justin Piszcz, (Tue Nov 6, 4:29 am)
Re: 2.6.23.1: mdadm/raid5 hung/d-state, BERTRAND Joël, (Tue Nov 6, 4:39 am)
Re: 2.6.23.1: mdadm/raid5 hung/d-state, Justin Piszcz, (Tue Nov 6, 4:42 am)
Re: 2.6.23.1: mdadm/raid5 hung/d-state, BERTRAND Joël, (Tue Nov 6, 5:20 am)
Re: 2.6.23.1: mdadm/raid5 hung/d-state, Jeff Lessem, (Tue Nov 6, 4:18 pm)
Re: 2.6.23.1: mdadm/raid5 hung/d-state, Dan Williams, (Tue Nov 6, 6:25 pm)
Re: 2.6.23.1: mdadm/raid5 hung/d-state, Jeff Lessem, (Tue Nov 6, 10:00 pm)
Re: 2.6.23.1: mdadm/raid5 hung/d-state, BERTRAND Joël, (Wed Nov 7, 4:20 am)
Re: 2.6.23.1: mdadm/raid5 hung/d-state, Chuck Ebbert, (Wed Nov 7, 9:39 am)
Re: 2.6.23.1: mdadm/raid5 hung/d-state, BERTRAND Joël, (Wed Nov 7, 9:48 am)
Re: 2.6.23.1: mdadm/raid5 hung/d-state, BERTRAND Joël, (Thu Nov 8, 4:42 am)
Re: 2.6.23.1: mdadm/raid5 hung/d-state, Justin Piszcz, (Thu Nov 8, 5:44 am)
Re: 2.6.23.1: mdadm/raid5 hung/d-state, Bill Davidsen, (Thu Nov 8, 10:45 am)
Re: 2.6.23.1: mdadm/raid5 hung/d-state, Dan Williams, (Thu Nov 8, 11:02 am)
Re: 2.6.23.1: mdadm/raid5 hung/d-state, Carlos Carvalho, (Thu Nov 8, 2:40 pm)
Re: 2.6.23.1: mdadm/raid5 hung/d-state, Justin Piszcz, (Fri Nov 9, 2:14 am)
Re: 2.6.23.1: mdadm/raid5 hung/d-state, Fabiano Silva, (Fri Nov 9, 7:09 am)
Re: 2.6.23.1: mdadm/raid5 hung/d-state, Jeff Lessem, (Fri Nov 9, 1:36 pm)