Re: stochastic fair queueing in the elevator [Re: [BENCHMARK] 2.4.20-ck3 / aa / rmap with contest]

Andrea Arcangeli (andrea@suse.de)
Mon, 10 Feb 2003 12:21:04 +0100


On Mon, Feb 10, 2003 at 02:48:10AM -0800, Andrew Morton wrote:
> Hans Reiser <reiser@namesys.com> wrote:
> >
> > readahead seems to be less effective for non-sequential objects. Or at
> > least, you don't get the order of magnitude benefit from doing only one
> > seek, you only get the better elevator scheduling from having more
> > things in the elevator, which isn't such a big gain.
> >
> > For the spectators of the thread, one of the things most of us learn
> > from experience about readahead is that there has to be something else
> > trying to interject seeks into your workload for readahead to make a big
> > difference, otherwise a modern disk drive (meaning, not 30 or so years
> > old) is going to optimize it for you anyway using its track cache.
> >
>
> The biggest place where readahead helps is when there are several files being
> read at the same time. Without readahead the disk will seek from one file
> to the next after having performed a 4k I/O. With readahead, after performing
> a (say) 128k I/O. It really can make a 32x difference.

definitely, but again the most important by far is to submit big
commands, if you get the 512k commands interleaved by different task
it isn't a panic situation.

> Readahead is brute force. The anticipatory scheduler solves this in a
> smarter way.

If you decrease readahead with anticipatory scheduler you will submit
the small commands, if you decrease the readahead you can stall 10
minutes any write in the queue, and still you won't grow the command in
the head of the queue.

> vmm:/mnt/hda6> for i in $(seq 1 8)
> for> do
> for> dd if=/dev/zero of=file$i bs=1M count=10
> for> sync
> for> fadvise file$i 0 999999999 dontneed

what is this fadvise doing?

> for> done
>
> vmm:/home/akpm> 0 blockdev --setra 240 /dev/hda1

> for i in $(seq 1 8)
> do
> time cat file$i >/dev/null&
> done
> cat file$i > /dev/null 0.00s user 0.01s system 0% cpu 4.422 total
> cat file$i > /dev/null 0.00s user 0.01s system 0% cpu 5.801 total
> cat file$i > /dev/null 0.00s user 0.01s system 0% cpu 6.012 total
> cat file$i > /dev/null 0.00s user 0.01s system 0% cpu 6.261 total
> cat file$i > /dev/null 0.00s user 0.02s system 0% cpu 6.401 total
> cat file$i > /dev/null 0.00s user 0.01s system 0% cpu 6.494 total
> cat file$i > /dev/null 0.00s user 0.01s system 0% cpu 6.754 total
> cat file$i > /dev/null 0.00s user 0.01s system 0% cpu 6.757 total
>
> vmm:/mnt/hda6> for i in $(seq 1 8)
> do
> fadvise file$i 0 999999999 dontneed
> done
> vmm:/home/akpm> 0 blockdev --setra 4 /dev/hda1
> vmm:/mnt/hda6> for i in $(seq 1 8)
> do
> time cat file$i >/dev/null&
> done
> cat file$i > /dev/null 0.00s user 0.02s system 0% cpu 13.373 total
> cat file$i > /dev/null 0.00s user 0.04s system 0% cpu 14.849 total
> cat file$i > /dev/null 0.00s user 0.02s system 0% cpu 15.816 total
> cat file$i > /dev/null 0.00s user 0.04s system 0% cpu 16.808 total
> cat file$i > /dev/null 0.00s user 0.05s system 0% cpu 17.824 total
> cat file$i > /dev/null 0.00s user 0.03s system 0% cpu 19.004 total
> cat file$i > /dev/null 0.00s user 0.03s system 0% cpu 19.274 total
> cat file$i > /dev/null 0.00s user 0.03s system 0% cpu 19.301 total
>
> So with eight files, it's a 300% drop from not using readahead.

yes, and anticipatory scheduler isn't going to fix it. furthmore with
IDE the max merging is 64k so you can see little of the whole picture.

Andrea
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/