> Well that's exactly right. For threaded programs (and maybe even
For some CPUs, replacing an conditional branch with a conditional move is a
*huge* win because it cannot be mispredicted. In general, compilers should
optimize for unshared data since that's much more common in typical code.
Even for shared data, the usual case is that you are going to access the
data few times, so pulling the cache line to the CPU is essentially free
since it will happen eventually.
Heuristics may show that the vast majority of such constructs write anyway.
So the optimization may also be valid based on such heuristics.
A better question is whether it's legal for a compiler that claims to
support POSIX threads. I'm going to post on comp.programming.threads, where
the threading experts hang out.
A very interesting case to be sure.
DS
-