Please don't.
There's no way gcc will EVER get the SSE2 cases right. It just cannot do 
it. In fact, I live in fear that we will have to turn off the compiler 
intrisics entirely some day just because there is always the worry that 
gcc will start using FP.
So the advantage of doing our own memcpy() is not that it's necessarily 
faster than the gcc built-in, but simply because I do not believe that the 
gcc people care enough about the kernel to let them make the choice.
		Linus
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/