RE: select for UNIX sockets?

David Schwartz (davids@webmaster.com)
Mon, 9 Jun 2003 10:55:35 -0700


> "David Schwartz" <davids@webmaster.com> writes:

> > It really doesn't matter. UDP applications have to control
> > the transmit
> > pacing at application level. There is absolutely no way for the
> > kernel to
> > know whether the path to the recipient is congested or not.

> Because what? The kernel knows everything it has to know - i.e. complete
> state of socket queue in question.

For the last time, there is no socket queue. You wouldn't want there to be
one.

Consider a UDP application that is sending packets to two destinations, one
over a 56Kbps serial link running PPP and one over gigabit Ethernet. If
there were a socket send queue, the packets going over the 56Kbps serial
link would block the packets going over the gigabit Ethernet.

> But if select() on sockets is illegal, should we make it return -Esth
> instead of success. Certainly, we should get rid of invalid kernel code,
> right?

No, it is legal, you are just misusing it. If you don't want your socket
operations to ever block, use non-blocking socket operations. If you use
UDP, or another connectionless protocol, you should understand that *you*
are responsible for transmit pacing.

> > The kernel can't tell you when to send because that depends upon
> > factors
> > that are remote.

> Such as?

Such as where the packet you send is actually *going*.

> > Yes, it would be nice of the kernel helped more. But the application
> > has to
> > deal with remote packet loss as well.

> Could you please show me a place in the kernel which could cause such
> a loss on local datagram sockets?

I guess I'm not getting through. The fact is, you don't have the guarantee
that you think you have. I'm giving you examples to show you why you don't
have that guarantee. You argue that the examples don't apply to your
specific case. I'm not saying they do. I'm saying that because there are
unavoidable cases where what you're trying to do won't work, then what
you're trying to do is not guaranteed to work in all cases and you shouldn't
try to do it.

The kernel does not remember that you got a write hit on 'select' and use
it to somehow ensure that your next 'write' doesn't block. A 'write' hit
from 'select' is just a hint and not an absolute guarantee that whatever
'write' operation you happen to choose to do won't block.

DS

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/