Linux - HardwareThis forum is for Hardware issues.
Having trouble installing a piece of hardware? Want to know if that peripheral is compatible with Linux?
Notices
Welcome to LinuxQuestions.org, a friendly and active Linux Community.
You are currently viewing LQ as a guest. By joining our community you will have the ability to post topics, receive our newsletter, use the advanced search, subscribe to threads and access many other special features. Registration is quick, simple and absolutely free. Join our community today!
Note that registered members see fewer ads, and ContentLink is completely disabled once you log in.
If you have any problems with the registration process or your account login, please contact us. If you need to reset your password, click here.
Having a problem logging in? Please visit this page to clear all LQ-related cookies.
Get a virtual cloud desktop with the Linux distro that you want in less than five minutes with Shells! With over 10 pre-installed distros to choose from, the worry-free installation life is here! Whether you are a digital nomad or just looking for flexibility, Shells can put your Linux machine on the device that you want to use.
Exclusive for LQ members, get up to 45% off per month. Click here for more info.
I noticed this the other night when my Samba share suddenly became read only. I reboot the server and things work again for about 24 hours and then it happens again. Drives are kept cool (30-33C).
Here is the hardware:
AMD Athlon XP 2100+
MSI KM4M-L KM400 motherboard (pulled from an eMachines)
512MB Mushkin DDR400
20GB WD PATA HDD (hda OS drive)
500GB Seagate PATA HDD (hdc)
2x500GB Seagate PATA HDDs (RAID 0 /dev/md0, hde, hdg on Promise ULTRA133 TX2 PCI controller card)
HDA and HDC are attached to the mobo on their own channels and HDE and HDG are on their own channels on the RAID card.
Initially, I thought the drives were going bad but how could all of them be going bad at the same time? I've got a new, good quality PSU (Fortron AX450-PN) so I'd hope that's not frying the drives.
Any suggestions? I'm leaning towards a bad motherboard.
Have you recently upgraded the kernel? I've been trying to bring a computer out of retirement, and it boots with the 2.4 series kernel from the Slackware install disk, but not the most recent 2.6 series kernel (after getting similar error messages as you).
I forgot to mention that. The system is running Ubuntu 6.06. Over a month ago I upgraded to 2.6.17.13 kernel so I could use my Gb NIC. These hard drive problems started about 5 days ago.
Why not apt-get install smartmontools and check the status of your Hard drive ? drive probably has errors on it, an indication it may be starting to fail.
I don't think it's the HDD myself, as I've had roughly the same problem. I actually have 2 HDDs installed, and both give the same error. I booted from a knoppix live dvd and it didn't report any issues.
I think it may be a bug in the recent kernels for older Mobos. I'm going to try going back to a prior 2.6 kernel that I know was working fine with that hardware before, and see what happens.
Please let me know what you find out with an older kernel. I've already ordered a different motherboard to replace mine thinking that it would be the problem.
Since it's a VIA KT880 chipset and my current board is VIA KM400 I'm hoping I can just swap boards without reloading Ubuntu.
As for smartctl, I had to enable it on both drives. No errors are logged as of now.
I ran a short test on each and they both passed.
If the drives are good (which I think they are) either the kernel is buggy or the mobo is bad.
Hmmm ... I got similar errors always at boot time on Mandriva 2006/2007. Once kernel is booted everything seems OK though? While run smartctl when I'm at home.
Ok, I finally got around to working on the old machine again. It turns out that I am getting errors booting with a 2.4.29 kernel (the stock kernel with Slack 10.1). The errors I'm getting are different, however:
As for your issue Child of Wonder, there's a kernel config option that specifically addresses the error you are getting, and perhaps that option was enabled on the previous kernel but not the newer one.
It's under Device Drivers -> ATA/ATAPI/MFF/RLL support and it's "Use multi-mode by default". The help document for this option is as follows:
Code:
CONFIG_IDEDISK_MULTI_MODE:
If you get this error, try to say Y here:
hda: set_multmode: status=0x51 { DriveReady SeekComplete Error }
hda: set_multmode: error=0x04 { DriveStatusError }
If in doubt, say N.
If you run "hdparm /dev/hda" and "hdparm /dev/hdc" and the multcount option is set to 16 then this means that multi-mode is being set later, but it might be helpful to have it initially set to 16 using the kernel option above.
Odd thing is, the errors stopped yesterday in /var/log/messages.
The only thing I did yesterday was enabled SMART on the two drives and performed a short self test on each.
Multimode support was not set nor is it "16" when running hdparm on the drives. I'm going to recompile the kernel with this enabled and see what happens.
Just to follow up, I just recently disconnected the old 2GB drive which was hdb, and the errors disappeared. It's interesting that the errors mentioned hda, when it was hdb that was causing the problem. I think it was a message being sent on the primary channel to the master drive that the slave drive was returning an error for.
LinuxQuestions.org is looking for people interested in writing
Editorials, Articles, Reviews, and more. If you'd like to contribute
content, let us know.