Hi,
[reordering quoted text for convenience]
Bo Yang wrote:
Thanks! What you said is much more coherent than the vague things I
wrote.
[...]
If the code is copied verbatim from elsewhere, this is something ‘git
blame’ is already very good at. See [1].
Fuzzy matching is a big pain. ‘git blame’ knows how to ignore
whitespace. Dscho suggested counting common words. Maybe there are
some other ways. I think there is a real danger of getting lost in this
problem and wasting a lot of time, so although it is very interesting, I
would consider any progress in this area a bonus rather than a goal.
If you can make a heuristic along these lines this work well, I think it
would be great. I imagine it might work very well for commits that made
nice, small changes (like many of those in git.git). Jakub pointed out
some of the difficulties, and I like to hope your idea of “when in doubt,
include more lines” may work well in many cases in git.git still.
Good luck, and thank you for taking my crazy ideas seriously. :)
Regards,
Jonathan
[1] See v1.4.4-rc1~2 (Merge branch 'jc/pickaxe', 2006-11-07) and the
commits preceding it. About that series, Junio wrote:
Actually the plan is to make it do _true_ pickaxe,
although it will most likely end up either in dustbin or
replace blame.
It replaced blame.
I am not actually sure, but I assume “true pickaxe” refers to the
goals described in <http://gitster.livejournal.com/35628.html>
and the linked-to message.
--
To unsubscribe from this list: send the line "unsubscribe git" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html