[Adding Jens Axboe, the author of what looks like the probable cause] On 10/7/07, Torsten Kaiser <just.for.lkml@googlemail.com> wrote:Looking closer at http://git.kernel.org/?p=linux/kernel/git/axboe/linux-2.6-block.git;a=commitdiff;h=ec6... the change to libata.h seems bogus : in ata_qc_first_sg: old new return qc->__sg return qc->__sg qc->__sg - qc->__sg == 0 qc->n_iter=0 -> sg - qc->__sg corresponds to qc->n_iter in ata_qc_next_sg: sg++; sg_next(sg); qc->n_iter++; sg - qc->__sg < qc->n_elem qc->n_iter < qc->nelem -> sg - qc->__sg corresponds to qc->n_iter but in ata_sg_is_last: (sg - qc->__sg) +1 == qc->n_elem qc->n_iter == qc->n_elem if sg - qc->__sg corresponds to qc->n_iter then shoudn't it be qc->n_iter+1 == qc->n_elem? That missing +1 would explain, why the SGE_TRM never gets set. And it would fit the symptoms, that the boot would fail at random. If the "correct" garbage was in place to where the sglist runs off it hangs the drive. And that would even fit the two different errors that I only got one time each: * a completely illegal access (PCI master abort while fetching SGT) * wrong alignment of the SGT (SGT no on qword boundary) At that that times the garbage seemed to point invalid addresses. But I'm still not understanding, how the kernel could only fail sometimes at bootup, but after that working without any visible errors? Is the sil-chip rather intelligent about detecting corrupted sglists and silently ignoring them? Torsten -
| Tarkan Erimer | Re: Dual-Licensing Linux Kernel with GPL V2 and GPL V3 |
| Greg KH | [GIT PATCH] driver core patches against 2.6.24 |
| Linus Torvalds | Re: init's children list is long and slows reaping children. |
| Kohei KaiGai | [PATCH 0/3] exporting capability name/code pairs (final#2) |
git: | |
| Gerrit Renker | [PATCH 33/37] dccp: Initialisation framework for feature negotiation |
| Jarek Poplawski | Re: [PATCH] pkt_sched: Destroy gen estimators under rtnl_lock(). |
| Arjan van de Ven | Re: [GIT]: Networking |
| Mark Ryden | Re: Linux Wireless Mini-Summit -- Ottawa -- July 22, 2008 |
