On 9/4/07, Theodore Tso <tytso@mit.edu> wrote:It is very easy to get bogged down in performance arguments on database design when the correct answer is that there are always lots of different ways to achieve the same goal. I wanted to defer debating performance until we closely looked at the relationships between the data at an abstract level. Since git hasn't stored all of the fields in the object table (the path is encoded in the index) we are never going to be able to build an alternative way of indexing the object table. Not being able to build alternative indexes is likely to cause problems when the database starts getting really big. Without an index every query that can't use the path name index is reduced to doing full table scans. A few things that could benefit from alternative indexing, blame, full-text search, automating the Maintainers file, etc. I'm just asking if we really want to make full table scans the only possible way to implement these types of queries. If the answer is no, then let's first explore how to fix things at an abstract level before diving into the performance arguments. An obvious parallel from the file system world is the locate database and how it is forced to continuously rescan the file system and store full path names. -- Jon Smirl jonsmirl@gmail.com - To unsubscribe from this list: send the line "unsubscribe git" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
| Greg Kroah-Hartman | [PATCH 001/196] Chinese: Add the known_regression URI to the HOWTO |
| Andy Whitcroft | clam |
| Vladislav Bolkhovitin | Re: Integration of SCST in the mainstream Linux kernel |
git: | |
| Jarek Poplawski | [PATCH] pkt_sched: Destroy gen estimators under rtnl_lock(). |
| David Miller | [GIT]: Networking |
| Gerrit Renker | [PATCH 27/37] dccp: Integration of dynamic feature activation - part 2 (server side) |
| Evgeniy Polyakov | Re: [BUG] New Kernel Bugs |
