Re: 2.4.8 NFS Problems

Mike Black (mblack@csihq.com)
Fri, 7 Sep 2001 09:13:29 -0400


But my timeouts were only 10 seconds -- well below the timeo and retrans
timeout periods.
And my network traffic shows that this is the client causing the problem NOT
the server.
It's the read() that pauses for 10 seconds and then the NFS write
immediately returns EIO.
So...I don't think soft mounts has anything to do with it.
Also...I've now seen this error once more even with the 4096 read/write
sizes.
________________________________________
Michael D. Black Principal Engineer
mblack@csihq.com 321-676-2923,x203
http://www.csihq.com Computer Science Innovations
http://www.csihq.com/~mike My home page
FAX 321-676-2355
----- Original Message -----
From: "Trond Myklebust" <trond.myklebust@fys.uio.no>
To: "Mike Black" <mblack@csihq.com>
Cc: "linux-kernel" <linux-kernel@vger.kernel.org>
Sent: Friday, September 07, 2001 7:49 AM
Subject: Re: 2.4.8 NFS Problems

>>>>> " " == Mike Black <mblack@csihq.com> writes:

> I've been getting random NFS EIO errors for a few months but
> now it's repeatable. Trying to copy a large file from one
> 2.4.8 SMP box to another is consistently failing (at different
> offsets each time). This doesn't appear to be a network
> problem as the last comm between the machines looks OK. By the
> timestamps it appears that a read() is taking too long and
> causing a timeout?

Morale: Don't use soft mounts: they are prone to these things. If you
insist on using them, then try playing around with the `timeo' and
`retrans' mount variables.

Soft mount timeouts are not only due to network problems, but can
equally well be due to internal congestion. The rate at which the
network can transmit requests is usually (unless you are using
Gigabit) way below the rate at which your machine can generate them.

Cheers,
Trond

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/