Re: [reiserfs-dev] Re: Ext2 directory index: ALS paper and benchmarks

Daniel Phillips (phillips@bonn-fries.net)
Fri, 7 Dec 2001 19:18:37 +0100


On December 7, 2001 07:03 pm, Ragnar Kjørstad wrote:
> > With ReiserFS we see slowdown due to random access even with small
> > directories. I don't think this is a cache effect.
>
> I can't see why the benefit from read-ahead on the file-data should be
> affected by the directory-size?
>
> I forgot to mention another important effect of hash-ordering:
> If you mostly add new files to the directory it is far less work if you
> almost always can add the new entry at the end rather than insert it in
> the middle. Well, it depends on your implementation of course, but this
> effect is quite noticable on reiserfs. When untaring a big directory of
> maildir the performance difference between the tea hash and a special
> maildir hash was approxemately 20%. The choice of hash should not affect
> the performance on writing the data itself, so it has to be related to
> the cost of the insert operation.

Yes, I think you're on the right track. HTree on the other hand is optimized
for inserting in arbitrary places, it takes no advantage at all of sequential
insertion. (And doesn't suffer from this, because it all happens in cache
anyway - a million-file indexed directory is around 30 meg.)

--
Daniel
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/