hm, an extra optimization barrier might have worse effects than even an
extra instruction. I think if the detection and patching can be made
100% safe, we dont care about the remaining 4% of markers that gcc
somehow reorders.
and in parallel gcc folks might want to start helping us achieve
single-instruction branch points? Currently there's no way to get flags
values out of inline assembly, except via a register intermediary which
adds another instruction. For flags that are unaffected by gcc's
input/output constraint generation code it would make sense to allow
them to be exported out of inline assembly.
Ingo
--