Re: hdc: dma_intr: status=0x51 { DriveReady SeekComplete Error }

Jens Axboe (axboe@suse.de)
Thu, 27 Dec 2001 15:54:03 +0100


On Sun, Dec 23 2001, Andre Hedrick wrote:
> the content is primarily the FS. Should an APP close a file but it is
> still in buffer_cache, there is no way to notify the app or the user or
> anything associated with the creation/closing of that file, if a write
> error occurs.
>
> So we have user-space believing it is success.

We have a buggy user-space app believing it is a success -- do you
really believe programs like eg mta's ignorantly closes a file and just
hopes for the best? fsync.

> FS doing an initial ACK of success.
> BLOCK generating the request to the low_level.
> LOW_LEVEL goes OH CRAP, I am having a problem and can not complete.
>
> LOW_LEVEL goes, HEY BLOCK we have a problem.
> BLOCK, that is nice whatever ....

What does this _mean_?

> This is a bad model, an worse is
>
> LOW_LEVEL goes, HEY BLOCK we have a problem.
> BLOCK goes, HEY FS we have an annoying LOW_LEVEL asking for reissue.
> FS, duh which way did the rabbit go ...

retries belong at the low level, once you pass up info of failure to the
upper layers it's fatal. time for FS to shut down.

> > Incidentally the EVMS IBM volume manager code does support bad block
> > remapping in some situations.
>
> Well managing badblock can be a major pain, but it is the right thing to
> do. Now what is the cost, since there is surge in journaling FS's that
> have logs. The cost is coming up w/ a sane way to manage the mess.
> Even before we get to managing the mess, we have to be able to reissue the
> request to a reallocated location, and make all kinds of noise that we are
> doing heroic attempts to save the data. These may include --

Irk, software managed bad block remapping is horrible.

> The issue is we are doing nothing to address the point, and it is arrogant
> for the maintainers of the various storage classes and the supported upper
> layers not willing to address this issue.

How about showing solutions in form of patches instead bitching about
this again and again? Frankly, I'm pretty sick of just seeing pointless
talk about the issue.

-- 
Jens Axboe

- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/