On Wed, Jun 20, 2007 at 10:34:15AM -0700, Linus Torvalds wrote:I think we can agree it was more about some minimal fairness. ... IMHO, we shouldn't try to blame the hardware until we know exactly the source of this bug. ... I could agree with this, but only when we know exactly what place should be fixed if we don't care about speed. Then, of course, the cost could be estimated. But after last Ingo's patch I'm not sure he, or anybody else here, has this knowledge. I'd also remind that adding one smp_mb() also did the work, and it doesn't look like a big performance hit. We should only better know why this works. Not necessarily so. Until the exact reason isn't known "for sure", this one place could be fixed, but the same problem could appear somewhere else in more masked form or is far less repeatable. BTW, I've looked a bit at these NMI watchdog traces, and now I'm not even sure it's necessarily the spinlock's problem (but I don't exclude this possibility yet). It seems both processors use task_rq_lock(), so there could be also a problem with that loop. The way the correctness of the taken lock is verified is racy: there is a small probability that if we have taken the wrong lock the check inside the loop is done just before the value is beeing changed elsewhere under the right lock. Another possible problem could be a result of some wrong optimization or wrong propagation of change of this task_rq(p) value. Thanks for response & best regards, Jarek P. -
| Eric Anholt | [PATCH] Export kmap_atomic_pfn for DRM-GEM. |
| Rafael J. Wysocki | 2.6.27-rc4-git1: Reported regressions from 2.6.26 |
| Robin Lee Powell | NFS hang + umount -f: better behaviour requested. |
| Avi Kivity | [PATCH] x86: default to reboot via ACPI |
git: | |
| Shawn O. Pearce | Re: MinGW port - initial work uploaded |
| Pierre Habouzit | git submodules |
| Mike Hommey | Re: Minor annoyance with git push |
| H. Peter Anvin | kde.git is now online |
| Chris Bullock | OpenBSD isakmpd and pf vs Cisco PIX or ASA |
| Brandon Lee | Re: DELL PERC 5iR slow performance |
| peter | ntpd not synching |
| bofh | Re: Load balancing with DSR |
| Jim Winstead Jr. | Re: Root Disk/Book Disk Compatibility |
| Peter Grehan | TCP SYN_RECV state: stuck forever in accept() ? |
| Brandon S. Allbery | Re: mkdir says "no space left on device" and more problems... |
| Theodore Ts'o | Re: Stabilizing Linux |
