On Friday 12 October 2007 10:56, Berkley Shands wrote:OK, it does sound suspiciously like a hardware bug, or some unrelated software bug that is causing memory scribbles... A few things you could do. One is that you could verify that it indeed is the kswapd_wait spinlock that it is spinning on, and then when you see the lockup, you could verify that no other tasks are holding the lock. (it is quite an inner lock, so you shouldn't have to wade through call chains...). That would confirm corruption. Dumping the lock contents and the fields in the structure around the lock might give a clue. You could put the spinlock somewhere else and see what happens (move it around in the structure, or get even more creative...). or do something like have 2 spinlocks, and when you encounter the lockup, verify whether or not they agree. (It sounds like you're pretty capable, but if you want me to have a look at doing a patch or two to help, let me know.) Another is to bisect the problem, however as you say the kernel is going slower, so you may just bisect to the point where it is sustaining enough load to trigger the bug, so this may not be worth you time just yet. You could _try_ turning on slab debugging. If there is random corruption, it might get caught. Maybe it will just change things enough to hide the problem though. Thanks for reporting! -
| James Bruce | Re: Dual-Licensing Linux Kernel with GPL V2 and GPL V3 |
| Peter Zijlstra | [PATCH 00/23] per device dirty throttling -v8 |
| Jan Engelhardt | intel iommu (Re: -mm merge plans for 2.6.23) |
| Peter Zijlstra | [RFC/PATCH 0/4] CPUSET driven CPU isolation |
git: | |
| Gerrit Renker | [PATCH 18/37] dccp: Support for Mandatory options |
| Rick Jones | Re: Network latency regressions from 2.6.22 to 2.6.29 |
| David Miller | [GIT]: Networking |
| Josip Rodin | bnx2_poll panicking kernel |
