LinuxQuestions.org
Download your favorite Linux distribution at LQ ISO.
Home Forums Tutorials Articles Register
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Newbie
User Name
Password
Linux - Newbie This Linux forum is for members that are new to Linux.
Just starting out and have a question? If it is not in the man pages or the how-to's this is the place!

Notices


Reply
  Search this Thread
Old 12-29-2008, 11:05 AM   #1
garyg007
Member
 
Registered: Aug 2008
Location: north-east ohio
Distribution: Debian-squeeze/stable;
Posts: 279
Blog Entries: 1

Rep: Reputation: 31
ext3 file system journal error


Twice in the last 10 days, a journal failure has occurred on the partition that is my /home.

Is this a hard drive starting to go bad?
Or are there other things that could cause this?

The system re-mounts the partition that it has occurred on as read-only.
I have recovered by going to init 1 and unmounting the partition and running fsck. Will things have been lost?

I recall reading about an application that will check a hard drive and mark bad tracks and re-assigm them elsewhere - does such a program exist?

( My os is slackware 12.2)

Thanks,

Gary
 
Old 12-29-2008, 11:18 AM   #2
MS3FGX
LQ Guru
 
Registered: Jan 2004
Location: NJ, USA
Distribution: Slackware, Debian
Posts: 5,852

Rep: Reputation: 361Reputation: 361Reputation: 361Reputation: 361
The system will force a filesystem check if the volume was not unmounted properly. Are you sure you are correctly shutting down every time and not just powering off while the system is up?

If nothing you are doing is causing the checks, it is possible there is some corruption or hardware failure.

You can check a drive for bad blocks with the aptly named "badblocks" command. As you have already found, "fsck" is used to check for and fix filesystem level errors.

It is important to understand though that you can't fix a bad block on the drive; the best you can do is tell the filesystem to not use it. Once you start losing blocks on your drive, you need to be focused on backing up your data and putting a new drive in the system. The problem will only get worse with time, until eventually you might not be able to use the drive at all.
 
Old 12-29-2008, 11:28 AM   #3
jiml8
Senior Member
 
Registered: Sep 2003
Posts: 3,171

Rep: Reputation: 116Reputation: 116
Usually the symptoms you describe herald a hardware problem. Could be bad cable, or bad controller. Could also be a bad drive. I would assign a controller fault as being about equal probability with a drive fault, and a cable fault as being lower probability. However, since cables are cheap, I would start by changing the cable.

Badblocks is indeed a way to test the drive. You should start doing the nondestructive test routinely looking for changes in the drive. It is possible that you had a head crash due, for instance, to the cat bumping the box as it ran past fleeing the 2 year old that was pulling its tail. In this event, you have a bad place on the drive that more than likely won't propagate. Thus, if you map out the bad place, you'll be OK.

OTOH, if the bad blocks list is growing, then as you have already been told, this is your cue to buy a new drive rather soon.

If you can get your hands on a copy, or if you are willing to spend the money, get spinrite. This product is brilliant and will definitively tell you the condition of your drive, and very often will fix it.
 
Old 12-29-2008, 12:15 PM   #4
garyg007
Member
 
Registered: Aug 2008
Location: north-east ohio
Distribution: Debian-squeeze/stable;
Posts: 279

Original Poster
Blog Entries: 1

Rep: Reputation: 31
Thanks jiml8;
The drive is an ide drive; "bad controller" -- is their a controller on the hard drive? ow are you referring to the ide port on the mother board. I have two sata drives and the ide drive and frequently after powering off, when I power back on, none of the hard drives are found; until I re-seat the serial ata cables.

Spinrit's been around for ages; never thought of using it for a hard drive -- i have used it in my windows dark ages to test a 100mb zip drive.

Thanks for pointing me to "badblocks"

Thank you both for answering my query, all replys are appreciated.

Gary
 
Old 12-29-2008, 12:31 PM   #5
jailbait
LQ Guru
 
Registered: Feb 2003
Location: Virginia, USA
Distribution: Debian 12
Posts: 8,338

Rep: Reputation: 548Reputation: 548Reputation: 548Reputation: 548Reputation: 548Reputation: 548
Quote:
Originally Posted by garyg007 View Post

I have recovered by going to init 1 and unmounting the partition and running fsck. Will things have been lost?
Every partition has a lost+found directory. fsck places any orphan files, orphan directories, or file fragments in lost+found and uses the inode number for a file name. So look in the directory called /home/lost+found for any lost files.

---------------------
Steve Stites
 
Old 12-29-2008, 03:11 PM   #6
Junior Hacker
Senior Member
 
Registered: Jan 2005
Location: North America
Distribution: Debian testing Mandriva Ubuntu
Posts: 2,687

Rep: Reputation: 61
If your ATA drive is using a forty wire ATA cable, you may be getting cross talk. If the drive is ancient, you may not need a 80 wire cable, but it has to be very very old.
Just another avenue to walk in your troubleshooting.

The part about having to re-seat the SATA cables is odd. Normally, creeping is associated with too much heat in the box, possibly an expansion/contraction issue also. But I'm thinking it's probably a controller issue, the main chip set on the mother board.

SpinRite is the only software I'm aware of that can reallocate data in bad sectors/blocks to spare sectors using a tenacious method of reading them, then isolate the bad sectors so they can't be used again, thus saving your data and drive.
 
Old 12-29-2008, 05:37 PM   #7
jiml8
Senior Member
 
Registered: Sep 2003
Posts: 3,171

Rep: Reputation: 116Reputation: 116
Quote:
Originally Posted by garyg007 View Post
Thanks jiml8;
The drive is an ide drive; "bad controller" -- is their a controller on the hard drive? ow are you referring to the ide port on the mother board. I have two sata drives and the ide drive and frequently after powering off, when I power back on, none of the hard drives are found; until I re-seat the serial ata cables.

Spinrit's been around for ages; never thought of using it for a hard drive -- i have used it in my windows dark ages to test a 100mb zip drive.

Thanks for pointing me to "badblocks"

Thank you both for answering my query, all replys are appreciated.

Gary
I am referring to the chipset on the motherboard.

And Spinrite is at version 6. Works very well.
 
Old 12-29-2008, 05:58 PM   #8
garyg007
Member
 
Registered: Aug 2008
Location: north-east ohio
Distribution: Debian-squeeze/stable;
Posts: 279

Original Poster
Blog Entries: 1

Rep: Reputation: 31
Quote:
Originally Posted by jailbait View Post
Every partition has a lost+found directory. fsck places any orphan files, orphan directories, or file fragments in lost+found and uses the inode number for a file name. So look in the directory called /home/lost+found for any lost files.
Thanks for the info Steve; I think it's time to replace the drive; there are over 1800 files containing 77mb of data in that lost+found folder.

Quote:
Originally Posted by Junior Hacker View Post
The part about having to re-seat the SATA cables is odd. Normally, creeping is associated with too much heat in the box, possibly an expansion/contraction issue also. But I'm thinking it's probably a controller issue, the main chip set on the mother board.
Information appreciated. I do not feel real confident trouble shooting this stuff and it is always nice to get confirmation for some of my thoughts --- specifically heat and mainboard problems

Thank you,


Gary
 
Old 12-29-2008, 07:11 PM   #9
jiml8
Senior Member
 
Registered: Sep 2003
Posts: 3,171

Rep: Reputation: 116Reputation: 116
Quote:
Originally Posted by garyg007 View Post
Thanks for the info Steve; I think it's time to replace the drive; there are over 1800 files containing 77mb of data in that lost+found folder.

Gary
It could be the drive, but with that kind of lost+found, I for one would be looking at the controller.

You probably have reports of I/O errors in the logs. Why not look, then perhaps post some of them here.
 
  


Reply



Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
EXT3-fs: error loading journal Delphinusnz Fedora 12 07-28-2009 11:59 AM
[SOLVED] EXT3-fs error (device sd(8,7)) in start_transaction: Journal has aborted harikrishnan81 Linux - Newbie 4 04-25-2008 12:50 AM
on disc structure of journal file in ext3 ajay.talk Linux - General 3 11-02-2007 01:45 AM
EXT3-fs: error loading journal canyon289 Fedora 2 06-23-2005 03:02 PM
ext3: File system error big_gie Linux - General 6 07-16-2004 11:44 AM

LinuxQuestions.org > Forums > Linux Forums > Linux - Newbie

All times are GMT -5. The time now is 03:52 AM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration