Linux - Hardware This forum is for Hardware issues.
Having trouble installing a piece of hardware? Want to know if that peripheral is compatible with Linux? |
Notices |
Welcome to LinuxQuestions.org, a friendly and active Linux Community.
You are currently viewing LQ as a guest. By joining our community you will have the ability to post topics, receive our newsletter, use the advanced search, subscribe to threads and access many other special features. Registration is quick, simple and absolutely free. Join our community today!
Note that registered members see fewer ads, and ContentLink is completely disabled once you log in.
Are you new to LinuxQuestions.org? Visit the following links:
Site Howto |
Site FAQ |
Sitemap |
Register Now
If you have any problems with the registration process or your account login, please contact us. If you need to reset your password, click here.
Having a problem logging in? Please visit this page to clear all LQ-related cookies.
Get a virtual cloud desktop with the Linux distro that you want in less than five minutes with Shells! With over 10 pre-installed distros to choose from, the worry-free installation life is here! Whether you are a digital nomad or just looking for flexibility, Shells can put your Linux machine on the device that you want to use.
Exclusive for LQ members, get up to 45% off per month. Click here for more info.
|
 |
|
04-16-2004, 04:49 PM
|
#1
|
LQ Newbie
Registered: Apr 2004
Posts: 3
Rep:
|
dma_timer_expiry when using hard disk
Hi,
I've been getting hard-disk related errors. It first happened with a 1 year old disk, and I thought it was a hardware failure. However I replaced it with a new one, and I still get errors. Here is an excerpt from /var/log/kern.log:
Apr 15 11:14:47 localhost kernel: hda: dma_timer_expiry: dma status == 0x21
Apr 15 11:14:57 localhost kernel: hda: error waiting for DMA
Apr 15 11:14:57 localhost kernel: hda: dma timeout retry: status=0x58 { DriveReady SeekComplete DataRequest }
Apr 15 11:14:57 localhost kernel:
Apr 15 11:14:58 localhost kernel: hda: read_intr: status=0x51 { DriveReady SeekComplete Error }
Apr 15 11:14:58 localhost kernel: hda: read_intr: error=0x04 { DriveStatusError
}
Apr 15 11:14:58 localhost kernel: hda: read_intr: status=0x51 { DriveReady SeekComplete Error }
Apr 15 11:14:58 localhost kernel: hda: read_intr: error=0x04 { DriveStatusError
}
Apr 15 11:14:58 localhost kernel: hda: read_intr: status=0x51 { DriveReady SeekComplete Error }
Apr 15 11:14:58 localhost kernel: hda: read_intr: error=0x04 { DriveStatusError
}
Apr 15 11:14:58 localhost kernel: hda: read_intr: status=0x51 { DriveReady SeekComplete Error }
Apr 15 11:14:58 localhost kernel: hda: read_intr: error=0x04 { DriveStatusError
}
Apr 15 11:14:58 localhost kernel: ide0: reset: success
Apr 15 11:15:18 localhost kernel: hda: dma_timer_expiry: dma status == 0x21
Apr 15 11:15:28 localhost kernel: hda: error waiting for DMA
Apr 15 11:15:28 localhost kernel: hda: dma timeout retry: status=0x58 { DriveReady SeekComplete DataRequest }
Apr 15 11:15:28 localhost kernel:
Apr 15 11:15:28 localhost kernel: hda: status timeout: status=0xd0 { Busy }
Apr 15 11:15:28 localhost kernel:
Apr 15 11:15:28 localhost kernel: hda: no DRQ after issuing WRITE
Apr 15 11:15:29 localhost kernel: ide0: reset: success
Sometimes this completely locks up the system. Since this happened with two disks, I suppose it's some other factor. I tried using another IDE cable, and different kernels (2.4.25 and 2.6.5) without success. Finally, I found out that disabling dma (with ide=nodma) makes the problem disappear, but of course with a big performance penalty. The strange thing is also that the disk worked flawlessly for about a week.
How can I discover the culprit? Thanks for any idea or help.
|
|
|
04-16-2004, 09:01 PM
|
#2
|
LQ Guru
Registered: Feb 2003
Location: Virginia, USA
Distribution: Debian 12
Posts: 8,381
|
"How can I discover the culprit?"
It may be the IDE chipset. You could switch the cable from IDE0 to IDE1 (and make the corresponding changes in /etc/fstab and your bootloader configuration). This only a partial test because IDE0 and IDE1 do have some circuitry in common. But if the problem is in the IDE chipset and in the part of the circuitry dedicated to IDE0 then this swap will show that the problem is in the IDE chipset.
___________________________________
Be prepared. Create a LifeBoat CD.
http://users.rcn.com/srstites/LifeBo...home.page.html
Steve Stites
|
|
|
04-17-2004, 01:22 PM
|
#3
|
LQ Newbie
Registered: Apr 2004
Posts: 3
Original Poster
Rep:
|
Thanks for the suggestion.
|
|
|
11-08-2004, 08:36 PM
|
#4
|
Member
Registered: Feb 2004
Location: Washington, USA
Distribution: Debian
Posts: 265
Rep:
|
I started having these same type of errors all the time when I upgraded to a stock debian 2.6.x kernel. Prior to my upgrade, I was dead-on stable (no glitches, etc) for a full year!
I've even downloaded HDD mfgr's disk scanning utilities and done their exhaustive diagnostics on all of my disks and come up with no errors.
I'm starting to think that the late 2.4 (and upwards) kernels introduced the culprit....
|
|
|
11-13-2004, 03:38 PM
|
#5
|
LQ Newbie
Registered: Nov 2004
Posts: 1
Rep:
|
Like the previous poster, I started getting this error (and a bunch of related ones) on a system that had been formerly stable for several years, and it started the second I installed Fedora Core 3. Before, I had been using the default RedHat 2.4 kernel, and now I'm using the 2.6 kernel. I'm also using reiserfs, and this error has been corrupting the file systems on my second IDE controller quite dramatically. I can't even back the files up.
|
|
|
11-13-2004, 04:23 PM
|
#6
|
Member
Registered: Feb 2004
Location: Washington, USA
Distribution: Debian
Posts: 265
Rep:
|
I found a work around for backing up files while I nail down the source of this problem....
I grabbed an ISO of Knoppix a while ago (version 3.3) and I booted off that Knoppix disc. Then I was able to mount and rsync my important stuff onto other disk drives in case I corrupt my filesystems beyond recovery while trouble-shooting.
I've been playing with re-compiling kernel 2.6.9 but haven't yet been able to nail down which disk option is causing these errors.
Maybe DMA code is changed in new kernels? Maybe it's enabled by default and it wasn't in the older kernels?
My hdparm -I /dev/hde output shows an asterisk next to UDMA5... but does that really mean DMA is on or off? I dunno.....
|
|
|
12-16-2004, 02:56 PM
|
#7
|
LQ Newbie
Registered: Dec 2004
Posts: 3
Rep:
|
I'm having similar problems. I'm wondering if it's the driver for a certain ide controller or something more generic. What ide controler are people using? This is mine:
0000:00:10.0 IDE interface: ALi Corporation M5229 IDE (rev c4)
|
|
|
12-19-2004, 07:44 PM
|
#8
|
Member
Registered: Feb 2004
Location: Washington, USA
Distribution: Debian
Posts: 265
Rep:
|
I've got 2 controllers....
Onboard:
ICH4: IDE controller at PCI slot 0000:00:1f.1
PCI add-in card:
PDC20267: IDE controller at PCI slot 0000:01:05.0
Since my last post, I recompiled dozens of times (literally) from kernel 2.4.22 up through 2.6.10 RC2 and enabling disabling many different options related to IDE. I also swapped controller cards to rule out hardware failure. The best I have come up with was grabbing the .config file from Knoppix and re-compiled 2.6.10 RC2, turning off just enough "options" to get a successful compile.
I'm stable again, but at a serious hit on drive performance.
Timing buffered disk reads: 10 MB in 3.27 seconds = 3.06 MB/sec used to be more like 60 MB in the same 3.27 seconds.
The Knoppix .config file turns off DMA......
|
|
|
12-20-2004, 04:43 PM
|
#9
|
LQ Newbie
Registered: Dec 2004
Posts: 3
Rep:
|
@elfoozo: Do you know at which kernel you started seeing this behavior? I've always used a 2.5/2.6 kernel on this machine and I'm pretty sure this problem was absent before. I do not really remember when it started, because at first I thought it was a failing drive. If it hadn't been a laptop, I probably would have replaced it already...
|
|
|
12-20-2004, 05:21 PM
|
#10
|
Member
Registered: Feb 2004
Location: Washington, USA
Distribution: Debian
Posts: 265
Rep:
|
I first noticed it on kernel 2.6.9 because I upgraded using apt-get.
Other Internet searches I've read mention 2.4.18 when first noted.
|
|
|
01-02-2005, 12:31 AM
|
#11
|
Member
Registered: Feb 2004
Location: Washington, USA
Distribution: Debian
Posts: 265
Rep:
|
Success! I re-compiled 2.6.10 and stripped out everything - including module support and have been stable on 2.6.10 for a full week. I'm even running at UDMA 5, disks are zippy again... Life is good. 
|
|
|
01-05-2005, 05:13 PM
|
#12
|
LQ Newbie
Registered: Dec 2004
Posts: 3
Rep:
|
I'm also running 2.6.10 now. For two days, so I shouldn't be cheering to early, but so far so good!
|
|
|
01-17-2005, 02:57 AM
|
#13
|
Member
Registered: Jan 2005
Distribution: Arch
Posts: 35
Rep:
|
Oh My Gods.
I went through the Exact same experience as the first poster, with the 1 year old drive, exchange, one week wait etc, except I didn't switch cables because this is on my Toshiba Laptop. I'll try installing that 2.6.10-kernel now. Must I "strip out everything" to make it work? I'm fairly n00 to that stuff...
THANKS so much for the tips.
I should've noticed how FC2test3 worked but newer distros (including Skolelinux) all got messed up. I didn't notice until I managed to do a minimal install w/o my HD breaking down, so that was in a terminal and saw the errors come.
|
|
|
01-17-2005, 05:09 AM
|
#14
|
Member
Registered: Jan 2005
Distribution: Arch
Posts: 35
Rep:
|
Scratch that. When my laptop is warm, I can't even reformat the drive from my Partition Magic floppies (the longer it's been on, the . I guess I need a new one... although it seems weird to be dying on me after just one week. How sad.
|
|
|
01-17-2005, 06:41 PM
|
#15
|
Member
Registered: Feb 2004
Location: Washington, USA
Distribution: Debian
Posts: 265
Rep:
|
By "strip out everything" I mean: I selected No on every "option" unless it specifically matched my hardware.
I've since recompiled a few more times turning on more kernel features and module support and USB support and still the disk errors are gone. I'm liking 2.6.10 a lot!
|
|
|
All times are GMT -5. The time now is 05:04 AM.
|
LinuxQuestions.org is looking for people interested in writing
Editorials, Articles, Reviews, and more. If you'd like to contribute
content, let us know.
|
Latest Threads
LQ News
|
|