>>>>> "Roland" == Roland Dreier <rdreier@cisco.com> writes:Roland> Cool... I assume you do this for mutex_unlock() etc? Roland> Is there any reason why ia64 can't do this too so we can kill Roland> mmiowb and save everyone a lot of hassle? (mips, sh and frv Roland> have non-empty mmiowb() definitions too but I'd guess that Roland> these are all bugs based on misunderstandings of the mmiowb() Roland> semantics...) Hi Roland, Thats not going to solve the problem on Altix. On Altix the issue is that there can be multiple paths through the NUMA fabric from cpuX to PCI bridge Y. Consider this uber-cool<tm> ascii art - NR is my abbrevation for NUMA router: ------- ------- |cpu X| |cpu Y| ------- ------- | \____ ____/ | | \/ | | ____/\____ | | / \ | ----- ------ |NR 1| |NR 2| ------ ------ \ / \ / ------- | PCI | ------- The problem is that your two writel's, despite being both issued on cpu X, due to the spin lock, in your example, can end up with the first one going through NR 1 and the second one going through NR 2. If there's contention on NR 1, the write going via NR 2 may hit the PCI bridge prior to the one going via NR 1. Of course, the bigger the system, the worse the problem.... The only way to guarantee ordering in the above setup, is to either make writel() fully ordered or adding the mmiowb()'s inbetween the two writel's. On Altix you have to go and read from the PCI brige to ensure all writes to it have been flushed, which is also what mmiowb() is doing. If writel() was to guarantee this ordering, it would make every writel() call extremely expensive :-( Cheers, Jes --
| Joe Perches | [PATCH 143/148] include/asm-x86/vm86.h: checkpatch cleanups - formatting only |
| Linus Torvalds | Re: Back to the future. |
| Greg Kroah-Hartman | [PATCH 004/196] Chinese: add translation of SubmittingPatches |
| Trent Piepho | [PATCH] [POWERPC] Improve (in|out)_beXX() asm code |
git: | |
| David Miller | Re: [PATCH] pkt_sched: Destroy gen estimators under rtnl_lock(). |
| Gerrit Renker | [PATCH 15/37] dccp: Set per-connection CCIDs via socket options |
| David Miller | [GIT]: Networking |
| Linus Torvalds | Re: iptables very slow after commit 784544739a25c30637397ace5489eeb6e15d7d49 |
