Re: ext3 journal/IDE problems ? (softirq + sched assert ?)

Bill Huey (billh@gnuppy.monkey.org)
Fri, 9 Aug 2002 02:34:55 -0700


On Fri, Aug 09, 2002 at 12:37:25AM -0600, Andreas Dilger wrote:
> On Aug 08, 2002 21:04 -0700, Bill Huey wrote:
> > What's going on with this ?
> >
> > I get:
> > EXT3-fs error (device ide0(3,5)): ext3_free_blocks: bit already cleared for block 753210
> > :
> > :
> > EXT3-fs error (device ide0(3,5)): ext3_free_blocks: bit already cleared for block 753273
> > ext3_free_blocks: aborting transaction: Journal has aborted in __ext3_journal_get_undo_access<2>EXT3-fs error (device ide0(3,5)) in ext3_free_blocks: Journal has aborted
>
> Looks like you got a block of zeros from disk when it should have been a
> block bitmap, or your filesystem is otherwise corrupted. You need to do
> a full fsck on this filesystem.
>
> As for cause, I have no idea. IDE DMA, IDE cables, memory, kernel bug...

It's certainly a kernel bug. This machine has been reliable for years with
stable kernels and I started to run the 2.5 series over the last couple
of days just to test out the softirq stuff (+ preempt) and latency. I
applied Mingo's patch for a recent scheduler assertion bug and it looks
like some kind of race is possibly happening in the IDE layer.

And Mingo, yes, the patch you sent me did help, but it looks like there's
other problems.

Thanks

bill

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/