Re: [OT] 2.6 not 3.0 - (NUMA)

Linus Torvalds (torvalds@transmeta.com)
Fri, 4 Oct 2002 17:36:43 -0700 (PDT)


On Fri, 4 Oct 2002, Martin J. Bligh wrote:
>
> It would be nice to have it all working and in place by the time Hammer arrives
> and makes this much more widespread ;-)

I agree, the Hammer is going to be interesting. But one of the most
interesting things to do will be to see if using it as a per-CPU memory
NUMA machine is slower or faster than using it with the memory interleaved
across CPU's (in which case it won't look NUMA at all).

My personal guess (assuming hypertransport works well) is that you'd
actually en dup interleaving at least for dual setups, and quite possibly
for quads as well. The per-node non-interleaved setup probably makes for
best _aggregate_ memory throughput if you have a load that has very
NUMA-friendly behaviour, but interleaving should make for best sustained
throughput for not-very-balanced-loads.

> Just an order of magnitude figure for you ... number of seconds spent in kernel
> space across all CPUs during a kernel compile on a 16-way NUMA-Q ...
>
> 2.4 with every patch I had (including O(1) sched + NUMA mods) ... 120s.
> On 2.5.40-mm1 with one small NUMA scheduler patch ... 38s.

Yeah, looking good..

Linus

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/