When this message occurs, one of the 2 CPUs is "dead", meaning that any
processes running on it are copletly stalled and the rtc for the "dead"
processor in /proc/interupts does not increase.
An interesting datapoint is that the air conditioner failed one weekend
and some machines died because of the heat and these machines failed
with the same tlb flush timeouts.
Currently we are running 2.4.9 (redhat modified) kernel, and have tried
others up to and including 2.4.18 with the same results.
Any insight would be greatly appreciated.
I am not a subscriber on this list, so CCing me would be greatly
appreciated.
Mike
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/