Re: [PATCH] gitattributes.txt: mention exceptions to gitignore rules

Previous thread: weird github capitalization problem by bolfo on Tuesday, January 4, 2011 - 6:04 am. (1 message)

Next thread: PostScript files: textconv and "git add -p" by Matthieu Moy on Tuesday, January 4, 2011 - 8:50 am. (1 message)
From: =?UTF-8?q?Nguy=E1=BB=85n=20Th=C3=A1i=20Ng=E1=BB=8Dc=20Duy?=
Date: Tuesday, January 4, 2011 - 6:31 am

gitattr and .gitignore are supposed to use the same rules for matching
patterns. Unfortunately it's not exactly the same in reality. Mention
the differences so users won't be surprised, until gitattr gets
updates.

Signed-off-by: Nguyễn Thái Ngọc Duy <pclouds@gmail.com>
---
 2011/1/4 Marcin Wiśnicki <mwisnicki@gmail.com>:
 > I think that for the time being at least the manual page must change to
 > reflect reality.

 Looks like changes will be more than just a few lines because path_matches()
 needs to learn about directories (iow less likely to get fixed right away).
 So, yes, good idea.

 I skimmed through excluded_from_list() (gitignore) and path_matches (gitattr).
 Seems no other differences.

 Documentation/gitattributes.txt |    1 +
 1 files changed, 1 insertions(+), 0 deletions(-)

diff --git a/Documentation/gitattributes.txt b/Documentation/gitattributes.txt
index 5a7f936..cfaf107 100644
--- a/Documentation/gitattributes.txt
+++ b/Documentation/gitattributes.txt
@@ -56,6 +56,7 @@ When more than one pattern matches the path, a later line
 overrides an earlier line.  This overriding is done per
 attribute.  The rules how the pattern matches paths are the
 same as in `.gitignore` files; see linkgit:gitignore[5].
+However patterns that end with a slash is not supported.
 
 When deciding what attributes are assigned to a path, git
 consults `$GIT_DIR/info/attributes` file (which has the highest
-- 
1.7.3.4.878.g439c7

--

From: Michael J Gruber
Date: Tuesday, January 4, 2011 - 7:50 am

Cheers,
Michael
--

From: Marcin =?iso-8859-2?q?Wi=B6nicki?=
Date: Tuesday, January 4, 2011 - 8:40 am

I'm afraid that is not all. The rules I've inferred:

  1. No pattern will match directory tree.
  2. It is only possible to match on path components.
  3. If pattern contains slash it is treated as absolute.

Example for file: d1/d2/f1.c

Patterns that match:
  *.c
  d1/d2/*
  /d1/d2/*
  */d2/*
  */*/*

Patterns that do not match but should:
  d2/*
  d2/
  d2
  d1/d2
  /d1/d2


--

From: Junio C Hamano
Date: Tuesday, January 4, 2011 - 12:17 pm

Not really.  I'd rather see a handful of test cases added to t0003 to help
interested parties to see what is broken and what is not.

Quoting from Marcin's other message, assuming that "Patterns" are stored


With slashes, so this is anchored at the toplevel of the working tree, and

The same as above;, the leading '/' is only to make it explicit that it is



This shouldn't match unless it appears in d1/.gitattributes.

The presense of '/' makes the pattern anchored to the directory it appear


We somehow don't do leading path match like we do for gitignore, but I do
not think this was intended.  My gut feeling is that these should match.

The thinking back, when we wrote the code, could have been that, unlike
gitignore that maintains only one bit (either "ignored" or "not"),
attributes are richer and giving the same attribute (say "whitespace
checking criteria") to files inside a directory and the containing
directory itself was nonsensical.  But if that was the reason, it is
faulty, as we do not track directories anyway.

Wouldn't it be sufficient to teach attr.c:path_matches() that a pattern
could also match with leading path?  That would automatically cover the
case where a pattern is terminated with a slash, as pattern "d/e/" would
never match path "d/e" but does match "d/e/f"?
--

From: Marcin =?iso-8859-2?q?Wi=B6nicki?=
Date: Tuesday, January 4, 2011 - 2:17 pm

After more carefully re-reading gitignore(5) I think that now I get it.

I presume that is is not possible to match certain pattern occurring 
*anywhere* in the path.

Would it be possible to extend pattern format to include double-star 
wildcard that matches anything including slashes ?

Like: **/whatever/**

Many tools (in java at least) and libraries support such extension to 
globs. Unfortunately standard fnmatch(3) that's used by git is not one of 
them, but glibc's implementation looks portable and self-contained so it 


--

Previous thread: weird github capitalization problem by bolfo on Tuesday, January 4, 2011 - 6:04 am. (1 message)

Next thread: PostScript files: textconv and "git add -p" by Matthieu Moy on Tuesday, January 4, 2011 - 8:50 am. (1 message)