Linux - GeneralThis Linux forum is for general Linux questions and discussion.
If it is Linux Related and doesn't seem to fit in any other forum then this is the place.
Notices
Welcome to LinuxQuestions.org, a friendly and active Linux Community.
You are currently viewing LQ as a guest. By joining our community you will have the ability to post topics, receive our newsletter, use the advanced search, subscribe to threads and access many other special features. Registration is quick, simple and absolutely free. Join our community today!
Note that registered members see fewer ads, and ContentLink is completely disabled once you log in.
If you have any problems with the registration process or your account login, please contact us. If you need to reset your password, click here.
Having a problem logging in? Please visit this page to clear all LQ-related cookies.
Get a virtual cloud desktop with the Linux distro that you want in less than five minutes with Shells! With over 10 pre-installed distros to choose from, the worry-free installation life is here! Whether you are a digital nomad or just looking for flexibility, Shells can put your Linux machine on the device that you want to use.
Exclusive for LQ members, get up to 45% off per month. Click here for more info.
Hi,
I booted up yesterday and got a S.M.A.R.T MSG:
Your hard disk drive is failing! SMART nessage: Device /dev/sda
537 currently unreadable (pending) sectors
I made an iso disk of my data of course and transferred a few help files to my laptop just in case.
I wanted to run fsck but got scared off by the warning that running it on a mounted disk risks severe damage.
I've only used fsck once before to ascertain that this disk's predescessor had failed. This is the replacement!
Question is: How do I use it safely?
Thank you in advance for your support!
p.s. two recent boot ups had no smart warning....??
Try Googling problems like this or search the forums. People have problems like this just about every day and the answers can be found here before posting their question.
S.M.A.R.T shows when your hard drive starts to go bad. If this is a replacement and there hasn't been a great deal of time between replacements it's possible you just got another bad drive. It's also possible there MIGHT be a bug in the BIOS. Try flashing your BIOS with the most recent version. However, that is an unlikely scenario. It doesn't hurt to have the most recent BIOS though.
To run fsck manually, you have to get into single user mode. You can achieve this after boot by going to a command line and typing in "init 1" wihtout the quotation marks. After that, mount the desired partition as read only and run fsck. Refer to the man pages as they will be able to answer most of your questions. Let us know how it goes.
Thanks for your tip Brandon.
I always formulate the Quest. I have as accurately as I can, enter it into "The new Thread" box and search for like problems. I most often find the exact answer I need ergo, post no thread. However, when there are several answers that sort-of- approach my problem, I choose to post a problem rather than risk galloping down the wrong path.
I guess I could Google the problem but then I wouldn't be using Linux Questions, would I? I find this site to be most helpful.
When you say "going to a command line", do you mean open a terminal?
Since the Err.Msg. refers only to /dev/sda and not a particular partition, I assume that I should enter :
init 1
mount /dev/sda ro ?
matthew
Hi Tredegar, thanks for your help!
As root, I typed in: shutdown -rF now
Suse brought up the splash screen and I Esc out of it and I watched the machine load various things. At several points, I could read that fsck was working through the fstab listing but it went very, very fast and concluded with the full Suse booted up.
I expected it to take longer. Does that sound about right to you?
BTW, I have booted this distro at least 7 times since receiving that ErrMsg. warning about impending disk failure, and it only ever appeared once?
Matthew
@ nx5000
Well, we live in hope!
@ drmjh
If it fsck'd fast, maybe that is good. I don't know.
Quote:
BTW, I have booted this distro at least 7 times since receiving that ErrMsg. warning about impending disk failure, and it only ever appeared once?
Difficult for me to say. The warning of an impending disk failure is not something I would want hanging over my head, but I suppose it depends on how much you value your data: If I were you, I'd:
-Establish regular backups (but we all do this anyway, right? )
-Try & find out if there are any disk-testing utilities for your drive (maybe from the manufacturer), and try them.
-Maybe move to a new hdd, and keep the "Failing" one for a good testing (there's got to be some software out there to thrash a disk & look for errors).
That said, the fact that any disk is working today doesn't mean that it will be working tomorrow
I definitely saw fsck rolling through the root section and later, sda7, the last partition. I got no Msg.s and as mentioned, it was fast.
I'd like to thank you once again for your help, I'll look into some of the options you mentioned. I do regularly backup my data but it's still a hastle when HD goes down.
Matthew
Due to necessity I have been away from my Desktop for almost 2 months. Since I've returned, the error Msg.s continue sporadically. I have downloaded and run Seagate Untilities on the entire hard disk twice and no problems were found. The exit Msg. was "examined disk has passed".
Can I asume that the SMART error Msg. is in Error?
How can I get rid of the Msg.?
Distribution: Mac OS X Leopard 10.6.2, Windows 2003 Server/Vista/7/XP/2000/NT/98, Ubuntux64, CentOS4.8/5.4
Posts: 2,986
Rep:
I also get this message continually on one of my server. I did a check on it and everything is fine. For whatever reason, maybe hardware, it is just misreporting SMART. I just ignore the message now.
If it is a seagate, the SMART message may be in error. I have two Seagate Barracudas (SCSI) that have been reporting themselves as failing since the very first time I powered them up. These drives report too many read errors and therefore claim to be bad. I had purchased a pair of these drives as a new but surplus from Ubid.com, and both drives said the same thing.
So I exercised the hell out of 'em before putting them into service, and neither drive failed. That was 4 years ago; both drives show 24/7/365 service and both work fine. Smartctl shows that both of them DO have high rates of read errors, but no failures after error correction is applied. So this is normal for this drive. I just don't use them in critical roles; they are used for scratchpad and daily backup.
In your case, I would be very worried about the claim of unreadable sectors. Probably SMART has gotten that right, and the number of sectors being shown is very high and does indicate a failure is impending (probably). You should fsck the disk, as a beginning step, but you need to do that from a live CD so that you can do it with the drive not mounted. Beyond that, the tool I would recommend is spinrite. It is entirely possible that running spinrite on that drive will clear all the errors, and often enough those errors will stay cleared.
However, spinrite is not cheap. If you can borrow a copy, do so. If you can't, then replacing the HD is probably about the same price as purchasing spinrite. Personally, I consider the cost of spinrite fully justified the first time it recovers a HD with data on it that you need, or the first time it saves you from having to do a reinstall with all the time and effort that entails. But that is just me; your mileage may vary.
Thanks Jim,
I can try to fsck the drive from a knoppix live dvd. I've only ever used it once before and as I recall, there are some dire warnings about possibly making the drive unusable by employing some of the repair functions. Any words of wisdom here?
I wonder what the Seagate utilities check, since they didn't seem to find anything wrong when I ran them?
Matthew
Gawd, can I burn up time with Linux, spinning my wheels!
After booting live knoppix and using a disk utility, I see listed:
.../dev/hdb which has /dev/hdb1 (windows)
/dev/hdb6 (linux)
/dev/hdb7 (linux)
/dev/hdb5 (Mem swap)
I see a 2nd ... HP /dev/sda (i only have 1 disk, so I assume this is my printer listed as a block device?
In knoppix when I try to umount as root the hard disk I get err. msgs. canot umount. if i look for the device as /dev/hdb1, I get "no such device"
Yes, thank you Tredegar,
I just looked at Msg. # 4 again and forcing a fsck gets me a scary msg. about possibly doing bad damage to the disk and that's why I hesitate.
I did not mount the hard as ro that must be Suse's security idea.
what about if I edit the fstab to make it rw ?
Matthew
LinuxQuestions.org is looking for people interested in writing
Editorials, Articles, Reviews, and more. If you'd like to contribute
content, let us know.