Re: [NFS] Re: [PATCH] zerocopy NFS for 2.5.36

Andi Kleen (ak@suse.de)
Fri, 20 Sep 2002 03:23:46 +0200

Messages sorted by: [ date ][ thread ][ subject ][ author ]
Next message: Ulrich Drepper: "Re: [ANNOUNCE] Native POSIX Thread Library 0.1"
Next message: David S. Miller: "Re: [NFS] Re: [PATCH] zerocopy NFS for 2.5.36"
Previous message: Greg KH: "Re: [BK PATCH] More PCI Hotplug changes for 2.4.20-pre7"

On Thu, Sep 19, 2002 at 06:09:34PM -0700, Andrew Morton wrote:
> > Very interesting IMHO would be to find a heuristic to switch between
> > a write combining copy and a cache hot copy. Write combining is good
> > for blasting huge amounts of data quickly without killing your caches.
> > Cache hot is good for everything else.
>
> I expect that caching userspace and not pagecache would be
> a reasonable choice.

Normally yes, but not always. e.g. for squid you don't really want to
cache user space.

But I guess it would be a reasonable heuristic. Or at least worth a try :-)

>
> > But it'll need hints from the higher level code. e.g. read and write
> > could turn on write combining for bigger writes (let's say >8K)
> > I discovered that just unconditionally turning it on for all copies
> > is not good because it forces data out of cache. But I still have hope
> > that it helps for selected copies.
>
> Well if it's a really big read then bypassing the CPU cache on
> the userspace-side buffer would make sense.
>
> Can you control the cachability of the memory reads as well?

SSE2 has hints for that (prefetchnti and even prefetcht0,1 etc. for different
cache hierarchies), but it's not completely clear on how much
the CPUs follow these.

For writing it's much more obvious and usually documented even.

>
> What restrictions are there on these instructions? Would
> they force us to bear the cost of the aligment problem?

They should be aligned, otherwise it makes no sense. When you assume it's
more likely that one target or destination are unaligned then you can easily
align either target or destination. Trick is to chose the right one,
it varies on the call site.
(these are for big copies so a small alignment function is lost in the noise)

x86-64 copy_*_user currently aligns the destination, but hardcoding that
is a bit dumb and I'm not completely happy with it.

-Andi
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/

Next message: Ulrich Drepper: "Re: [ANNOUNCE] Native POSIX Thread Library 0.1"
Next message: David S. Miller: "Re: [NFS] Re: [PATCH] zerocopy NFS for 2.5.36"
Previous message: Greg KH: "Re: [BK PATCH] More PCI Hotplug changes for 2.4.20-pre7"