checksumming with mmx, comment in arch/i386/lib/mmx.c

nick black (dank@suburbanjihad.net)
Tue, 11 Feb 2003 13:37:07 -0500


i want to speed up my product's checksum verification code, and was
pondering the use of mmx (ip_fast_csum as implemented by cwik and
gulbrandsen from asm-i386/checksum.h is fast enough for my needs, but i
don't want to violate the gpl 8) ).

i'm refreshing myself on mmx currently, but noticed the following
comment from arch/i386/lib/mmx.c's _mmx_memcpy:

"Checksums are not a win with MMX on any CPU tested so far for any MMX
solution figured."

firstly, to what domain of checksums does this comment apply? secondly,
why is it true? it seems the PADDW family of instructions could work
well here; is the slowdown a result of the kernel's need to muck with
fpu state (from what i can tell, mmx uses the fp registers)?

thanks so much for any help!

-- 
nick black <dank@reflexsecurity.com>
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/