Re: How to efficiently blame an entire repo?

Previous message: [thread] [date] [author]
Next message: [thread] [date] [author]
From: Jeff King
Date: Friday, April 30, 2010 - 2:21 pm

On Thu, Apr 29, 2010 at 07:12:27PM -0400, Jay Soffian wrote:


With the exception of your "blame only those files that you know your
authors have touched" optimization, I think you pretty much have to do
this. Anything else will just be reimplementing blame. You can't throw
away most content prematurely, because it may end up blaming to your
authors of interest eventually.

I think this is also what Junio ended up doing when presenting at
GitTogether '08:

  http://userweb.kernel.org/~junio/200810-Chron.pdf

In theory you might be able to do multi-file blame faster.  I would be
curious to see the performance difference between:

  $ git blame file1 file2 ;# not actually implemented

and

  $ for i in file1 file2; do git blame $i; done

Much of the work is O(content), but there is some overlap in walking the
history and generating diffs.

-Peff
--
To unsubscribe from this list: send the line "unsubscribe git" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Previous message: [thread] [date] [author]
Next message: [thread] [date] [author]

Messages in current thread:
How to efficiently blame an entire repo?, Jay Soffian, (Thu Apr 29, 4:12 pm)
Re: How to efficiently blame an entire repo?, Avery Pennarun, (Fri Apr 30, 12:45 pm)
Re: How to efficiently blame an entire repo?, Jay Soffian, (Fri Apr 30, 1:16 pm)
Re: How to efficiently blame an entire repo?, Jeff King, (Fri Apr 30, 2:21 pm)