Hi,
On Sat, 7 Jun 2008, Geoffrey Irving wrote:
quoted text > On Mon, Jun 2, 2008 at 11:15 AM, Johannes Schindelin
> <Johannes.Schindelin@gmx.de> wrote:
>
> > On Mon, 2 Jun 2008, Geoffrey Irving wrote:
> >
> >> On Mon, Jun 2, 2008 at 9:18 AM, Johannes Schindelin
> >> <Johannes.Schindelin@gmx.de> wrote:
> >>
> >> > On Mon, 2 Jun 2008, Geoffrey Irving wrote:
> >> >
> >> >> On Mon, Jun 2, 2008 at 8:37 AM, Johannes Schindelin
> >> >> <Johannes.Schindelin@gmx.de> wrote:
> >> >>
> >> >> > Another issue that just hit me: this cache is append-only, so if
> >> >> > it grows too large, you have no other option than to scratch and
> >> >> > recreate it. Maybe this needs porcelain support, too? (git gc?)
> >> >>
> >> >> If so, the correct operation is to go through the hash and remove
> >> >> entries that refer to commits that no longer exist. I can add
> >> >> this if you want. Hopefully somewhere along the way git-gc
> >> >> constructs an easy to traverse list of extant commits, and this
> >> >> will be straightforward.
> >> >
> >> > I don't know... if you have created a cached patch-id for every
> >> > commit (by mistake, for example) and do not need it anymore, it
> >> > might make git-cherry substantially faster to just scrap the cache.
> >>
> >> Well, ideally hash maps are O(1), but it could be a difference
> >> between a "compare 40 bytes" constant and a "read a 4k block into
> >> memory" constant, so in practice yes. Scrapping it entirely will
> >> also make the implementation much simpler.
> >>
> >> It seems a little sad to wipe all that effort each time, but
> >> regenerating the cache is likely to be less expensive than a git-gc,
> >> so it shouldn't change any amortized complexities.
> >
> > Well, how about only scrapping the cache if it is older than, say, 2
> > weeks, and is larger than, say, 200kB? That should help.
>
> That heuristic is insufficient, since it doesn't do anything in the
> normal case where a new entry appears every few days (e.g., when syncing
> between two branches with cherry-pick).
Right, it is insufficient in such a case, but then, it does not really
matter, methinks. The cache is small enough anyway, and I think that many
people will not really use it as much as you do.
However, I realized one very real issue with your patch: you do not
provide a way to _disable_ the caching. I think at least a config
variable is needed, and while at it, a fallback when you cannot write to
the repository.
Ciao,
Dscho
--
To unsubscribe from this list: send the line "unsubscribe git" in
the body of a message to
majordomo@vger.kernel.org
More majordomo info at
http://vger.kernel.org/majordomo-info.html