Btw, I do think that the whole NOPL issue is separate from all the other
issues. There can be _other_ cases where it really is worth doing some
"generic" optimizations or being more "specific", and my argument really
is that NOPL is _not_ one of those cases.
So I'm still not sure that X86_GENERIC is necessarily the answer. The
answer may be:
- never use NOPL statically unless we _know_ it works (eg x86-64)
- never allow such a stupid decision by gcc as to use NOPL on x86-32.
..and then leave X86_GENERIC alone wrt everything else.
Peter - does gcc actually use NOPL in _32-bit_ code too? It really seems
to be a stupid decision to make a binary not run on other CPU's over
something as trivial as that. That's something I'd expect out of an Intel
compiler just to mess with AMD, not out of gcc.
Linus
--