> Another conclusion from the cited text is that in contrast with whatIt all depends upon what threading standard you are using. If GCC is going to support POSIX threading, it cannot require that thread-shared data be marked 'volatile' since POSIX does not require this. It can offer semantic guarantees for volatile-qualified data if it wants to. But POSIX provides a set of guarantees that do not require marking data as 'volatile' and if GCC is going to support POSIX threading, it has to support providing those guarantees. As far as I know, no threading standard either requires 'volatile' or states that it is sufficient for any particular purpose. So there seems to be no reason to declare thread-shared variables as volatile except as some kind of platform-specific optimization. POSIX mutexes are sufficient. They are necessary if there is no other way to get the guarantees you need. Nothing prevents GCC from providing any guarantees it wants for 'volatile' qualified data. But POSIX mutexes must work as POSIX specifies or GCC cannot support POSIX threading. This is the nightmare scenario (thanks to Hans-J. Boehm): int x; bool need_to_lock; pthread_mutex_t mutex; for(int i=0; i<50; i++) { if(unlikely(need_to_lock)) pthread_mutex_lock(&mutex); x++; if(unlikely(need_to_lock)) pthread_mutex_unlock(&mutex); } Now suppose the compiler optimizes this as follows: register=x; for(int i=0; i<50; i++) { if(need_to_lock) { x=register; pthread_mutex_lock(&mutex) register=x; } register++; if(need_to_lock) { x=register; pthread_mutex_unlock(&mutex); register=x; } } x=register; This is a perfectly legal optimization for single-threaded code. It may in fact be an actual optimization. Clearly, it totally destroys threaded code. This shows that, unfortunately, the normal assumption that not knowing anything about the pthread functions ensures that optimizations won't break them is incorrect. DS -
| Linus Torvalds | Linux 2.6.27-rc5 |
| Ingo Molnar | [announce] "kill the Big Kernel Lock (BKL)" tree |
| Christoph Lameter | Re: [RFC 00/15] x86_64: Optimize percpu accesses |
| Greg KH | [GIT PATCH] driver core patches against 2.6.24 |
git: | |
| Gerrit Renker | [PATCH 15/37] dccp: Set per-connection CCIDs via socket options |
| David Miller | Re: [PATCH] pkt_sched: Destroy gen estimators under rtnl_lock(). |
| Arjan van de Ven | Re: [GIT]: Networking |
| Jarek Poplawski | [PATCH iproute2] Re: HTB accuracy for high speed |
