Re: sata_sil24 broken since 2.6.23-rc4-mm1

!MAILaRCHIVE_VOTE_RePLACE
Previous message: [thread] [date] [author]
Next message: [thread] [date] [author]
To: Tejun Heo <htejun@...>
Cc: Jeff Garzik <jeff@...>, <linux-kernel@...>, <akpm@...>, Matt Mackall <mpm@...>
Date: Wednesday, October 3, 2007 - 11:55 am

[CC added to author of the bad patch]

Short recap:
Since 2.6.23-rc4-mm1 all mm-kernel randomly fail one of two drives on
my Silicon Image 3132. This failure happens when my initramfs wants to
start the RAID that is on these drives.

The first error libata throws is:
Oct  3 16:56:46 treogen [   63.320000] ata2.00: exception Emask 0x0
SAct 0x1 SErr 0x0 action 0x6 frozen
Oct  3 16:56:46 treogen [   63.320000] ata2.00: cmd
61/08:00:09:d6:42/00:00:25:00:00/40 tag 0 cdb 0x0 data 4096 out
Oct  3 16:56:46 treogen [   63.320000]          res
40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
Oct  3 16:56:46 treogen [   63.320000] ata2.00: status: {DRDY }

Resetting the sata link fails, the drive is no longer reachable until a reboot.

I then bisected the mm-patches from 2.6.23-rc4-mm1 with the following result:

On 10/3/07, Torsten Kaiser <just.for.lkml@googlemail.com> wrote:

The simplify-patch just seems to move some code around, but I see a
real change in the other one:
This patch removes clear_refs_smap() from fs/proc/task_mmu.c by moving
its code to a new function. But during the move the main for-loop from
clear_refs_smap was changed:

old:
	for (vma = mm->mmap; vma; vma = vma->vm_next)
		if (vma->vm_mm && !is_vm_hugetlb_page(vma))
			walk_page_range(vma->vm_mm, vma->vm_start, vma->vm_end,
					&clear_refs_walk, vma);

new:
	for (vma = mm->mmap; vma; vma = vma->vm_next)
		if (!is_vm_hugetlb_page(vma))
			walk_page_range(mm, vma->vm_start, vma->vm_end,
					&clear_refs_walk, vma);

The walk_page_range() is no longer called on vma->vm_mm, but on mm directly.
I don't know how this can kill the sata_sil24-driver, but at least it
looks suspicious.
As I'm not really a kernel hacker, I defer this question to the ones that are.

Torsten
-
Previous message: [thread] [date] [author]
Next message: [thread] [date] [author]

Messages in current thread:
sata_sil24 broken since 2.6.23-rc4-mm1, Torsten Kaiser, (Wed Sep 26, 4:26 pm)
Re: sata_sil24 broken since 2.6.23-rc4-mm1, Tejun Heo, (Thu Sep 27, 12:54 am)
Re: sata_sil24 broken since 2.6.23-rc4-mm1, Tejun Heo, (Thu Sep 27, 12:57 am)
Re: sata_sil24 broken since 2.6.23-rc4-mm1, Torsten Kaiser, (Thu Sep 27, 2:14 am)
Re: sata_sil24 broken since 2.6.23-rc4-mm1, Jeff Garzik, (Thu Sep 27, 2:24 am)
Re: sata_sil24 broken since 2.6.23-rc4-mm1, Torsten Kaiser, (Thu Sep 27, 1:34 pm)
Re: sata_sil24 broken since 2.6.23-rc4-mm1, Tejun Heo, (Thu Sep 27, 4:22 pm)
Re: sata_sil24 broken since 2.6.23-rc4-mm1, Torsten Kaiser, (Fri Sep 28, 1:36 am)
Re: sata_sil24 broken since 2.6.23-rc4-mm1, Torsten Kaiser, (Sun Sep 30, 2:00 am)
Re: sata_sil24 broken since 2.6.23-rc4-mm1, Tejun Heo, (Sun Sep 30, 10:34 am)
Re: sata_sil24 broken since 2.6.23-rc4-mm1, Torsten Kaiser, (Sun Sep 30, 12:19 pm)
Re: sata_sil24 broken since 2.6.23-rc4-mm1, Tejun Heo, (Sun Sep 30, 1:39 pm)
Re: sata_sil24 broken since 2.6.23-rc4-mm1, Torsten Kaiser, (Sun Sep 30, 2:39 pm)
Re: sata_sil24 broken since 2.6.23-rc4-mm1, Torsten Kaiser, (Mon Oct 1, 2:00 pm)
Re: sata_sil24 broken since 2.6.23-rc4-mm1, Torsten Kaiser, (Wed Oct 3, 11:21 am)
Re: sata_sil24 broken since 2.6.23-rc4-mm1, Torsten Kaiser, (Wed Oct 3, 11:55 am)
Re: sata_sil24 broken since 2.6.23-rc4-mm1, Matt Mackall, (Wed Oct 3, 12:38 pm)
Re: sata_sil24 broken since 2.6.23-rc4-mm1, Torsten Kaiser, (Thu Oct 4, 1:32 am)
Re: sata_sil24 broken since 2.6.23-rc4-mm1, Matt Mackall, (Thu Oct 4, 1:05 pm)
Re: sata_sil24 broken since 2.6.23-rc4-mm1, Torsten Kaiser, (Fri Oct 5, 2:06 am)
Re: sata_sil24 broken since 2.6.23-rc4-mm1, Torsten Kaiser, (Sun Oct 7, 4:44 am)
Re: sata_sil24 broken since 2.6.23-rc4-mm1, Torsten Kaiser, (Sun Oct 7, 10:39 am)
Re: sata_sil24 broken since 2.6.23-rc4-mm1, Tejun Heo, (Wed Oct 10, 11:25 pm)
Re: sata_sil24 broken since 2.6.23-rc4-mm1, Jens Axboe, (Thu Oct 11, 4:26 am)
Re: sata_sil24 broken since 2.6.23-rc4-mm1, Tejun Heo, (Thu Oct 11, 4:36 am)
Re: sata_sil24 broken since 2.6.23-rc4-mm1, Jens Axboe, (Thu Oct 11, 6:28 am)
Re: sata_sil24 broken since 2.6.23-rc4-mm1, Torsten Kaiser, (Thu Oct 11, 1:54 am)
Re: sata_sil24 broken since 2.6.23-rc4-mm1, Tejun Heo, (Thu Oct 11, 2:26 am)
Re: sata_sil24 broken since 2.6.23-rc4-mm1, Torsten Kaiser, (Thu Oct 11, 1:51 pm)
Re: sata_sil24 broken since 2.6.23-rc4-mm1, Torsten Kaiser, (Wed Oct 3, 1:36 pm)
Re: sata_sil24 broken since 2.6.23-rc4-mm1, Matt Mackall, (Wed Oct 3, 1:51 pm)
Re: sata_sil24 broken since 2.6.23-rc4-mm1, Torsten Kaiser, (Wed Oct 3, 2:06 pm)