Re: O_DIRECT foolish question

Andrew Morton (akpm@digeo.com)
Wed, 12 Feb 2003 14:03:38 -0800


Bruno Diniz de Paula <diniz@cs.rutgers.edu> wrote:
>
> Hi,
>
> I am trying to use O_DIRECT to read ordinary files and read syscall
> always returns 0, unless when the file size equals the fs block size.

It should be returning -1, with errno set to EINVAL.

> Is
> it true that I can only use O_DIRECT when the size of the file written
> in the inode is a multiple of block size?
>

The file can be of any size - the kernel will zero-fill any remaining bytes.

The address and length which you pass into the read() or write() system call
must both be a multiple of the filesystem block size.

It is always safe to just use the machine's page size for alignment
calculations - no filesystem has a blocksize larger than the pagesize.

A good way to do this is to run getpagesize(), and to then malloc a buffer
which is one page larger than you need. Then round that address up to the
next page boundary. And perform I/O into that memory with
multiple-of-page-size requests.

In the 2.5 kernel the "must be a multiple of blocksize" requirement was
relaxed. We now support alignments and lengths down to the minimum which is
supported by the underlying device. Typically 512 bytes, but not always.

Portable applications should not assume that 512-byte alignment is supported.
They should query the device's aligment requirements via the BLKSSZGET ioctl
against (say) /dev/hda1. Or they can simply try 512, 1024, 2048, ... at
initialisation time.
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/