Cc: H. Peter Anvin <hpa@...>, Richard Guenther <richard.guenther@...>, Joe Buck <Joe.Buck@...>, Andrew Haley <aph@...>, Aurelien Jarno <aurelien@...>, <linux-kernel@...>, <gcc@...>
Only an assumption, and in fact wrong. See upthread for a benchmark.
IIRC Uros also made measurements to justify the removal of cld (on P4 I
think), where it helps tremendously on small memcpy loops.
Ciao,
Michael.
--