On Thu, 8 May 2008, Zhang, Yanmin wrote:Well, that shows that it was the BKL. That said, "idle 0%" is easy when you spin. Do you also have actual performance numbers? I'd hope that not only do we use full CPU time, it's also at least as fast as the old semaphores were? While I've been dissing sleeping locks (because their overhead is so high), at least in _theory_ they can get better behavior when not spinning. Now, that's not going to happen with the BKL, I'm 99.99% sure, but I'd still like to hear actual performance numbers too, just to be sure. Anyway, at least the "was it the BKL or some other semaphore user" question is entirely off the table. So we need to - fix the BKL. My patch may be a good starting point, but there are alternatives: (a) reinstate the old BKL code entirely Quite frankly, I'd prefer not to. Not only did it have three totally different cases, some of them were apparently broken (ie BKL+regular preempt didn't cond_resched() right), and I just don't think it's worth maintaining three different versions, when distro's are going to pick one anyway. *We* should pick one, and maintain it. (b) screw the special BKL preemption - it's a spinlock, we don't preempt other spinlocks, but at least fix BKL+preempt+cond_resched thing. This would be "my patch + fixes" where at least one of the fixes is the known (apparently old) cond_preempt() bug. (c) Try to keep the 2.6.25 code as closely as possible, but just switch over to mutexes instead. I dunno. I was never all that enamoured with the BKL as a sleeping lock, so I'm biased against this one, but hey, it's just a personal bias. - get rid of the BKL anyway, at least in anything that is noticeable. Matthew's patch to file locking is probably worth doing as-is, simply because I haven't heard any better solutions. The BKL certainly can't be it, and whatever comes out of the NFSD discussion will almost certainly involve just making sure that those leases just use the new fs/locks.c lock. This is also why I'd actually prefer the simplest possible (non-preempting) spinlock BKL. Because it means that we can get rid of all that "saved_lock_depth" crud (like my patch already did). We shouldn't aim for a clever BKL, we should aim for a BKL that nobody uses. I'm certainly open to anything. Regardless, we should decide fairly soon, so that we have the choice made before -rc2 is out, and not drag this out, since regardless of the choice it needs to be tested and people comfy with it for the 2.6.26 release. Linus --
| david | Re: Dual-Licensing Linux Kernel with GPL V2 and GPL V3 |
| Greg KH | [2.6.22.2 review 05/84] Fix deadlocks in sparc serial console. |
| Greg KH | [GIT PATCH] driver core patches against 2.6.24 |
| Andrew Morton | -mm merge plans for 2.6.23 |
git: | |
| Jeff Kirsher | [RESEND][NET-NEXT PATCH 01/29] ixgbe: fix bug where using wake queue instead of st... |
| Gerrit Renker | [PATCH 27/37] dccp: Integration of dynamic feature activation - part 2 (server side) |
| Jarek Poplawski | [PATCH] pkt_sched: Destroy gen estimators under rtnl_lock(). |
| Patrick McHardy | Re: [GIT]: Networking |
| Manuel Bouyer | Re: Interactive performance in -current |
| Christian Limpach | Re: newfs: determining file system parameters |
| YAMAMOTO Takashi | Re: statvfs(2) replacement for statfs(2) patch |
| Charles M. Hannum | Re: kern/22869: Slave IDE drive not detected |
