You have to do something pretty weird for memcpy() to not work for
dst <= src even with overlap; this usually involves architectures that
have explicit cache control instructions to establish the dst in the
cache, if used before src is read.
This is not an issue on x86, though.
-hpa
--