LinuxQuestions.org
LinuxAnswers - the LQ Linux tutorial section.
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Hardware
User Name
Password
Linux - Hardware This forum is for Hardware issues.
Having trouble installing a piece of hardware? Want to know if that peripheral is compatible with Linux?

Notices

Reply
 
Search this Thread
Old 04-16-2004, 05:49 PM   #1
dbr
LQ Newbie
 
Registered: Apr 2004
Posts: 3

Rep: Reputation: 0
dma_timer_expiry when using hard disk


Hi,

I've been getting hard-disk related errors. It first happened with a 1 year old disk, and I thought it was a hardware failure. However I replaced it with a new one, and I still get errors. Here is an excerpt from /var/log/kern.log:


Apr 15 11:14:47 localhost kernel: hda: dma_timer_expiry: dma status == 0x21
Apr 15 11:14:57 localhost kernel: hda: error waiting for DMA
Apr 15 11:14:57 localhost kernel: hda: dma timeout retry: status=0x58 { DriveReady SeekComplete DataRequest }
Apr 15 11:14:57 localhost kernel:
Apr 15 11:14:58 localhost kernel: hda: read_intr: status=0x51 { DriveReady SeekComplete Error }
Apr 15 11:14:58 localhost kernel: hda: read_intr: error=0x04 { DriveStatusError
}
Apr 15 11:14:58 localhost kernel: hda: read_intr: status=0x51 { DriveReady SeekComplete Error }
Apr 15 11:14:58 localhost kernel: hda: read_intr: error=0x04 { DriveStatusError
}
Apr 15 11:14:58 localhost kernel: hda: read_intr: status=0x51 { DriveReady SeekComplete Error }
Apr 15 11:14:58 localhost kernel: hda: read_intr: error=0x04 { DriveStatusError
}
Apr 15 11:14:58 localhost kernel: hda: read_intr: status=0x51 { DriveReady SeekComplete Error }
Apr 15 11:14:58 localhost kernel: hda: read_intr: error=0x04 { DriveStatusError
}
Apr 15 11:14:58 localhost kernel: ide0: reset: success
Apr 15 11:15:18 localhost kernel: hda: dma_timer_expiry: dma status == 0x21
Apr 15 11:15:28 localhost kernel: hda: error waiting for DMA
Apr 15 11:15:28 localhost kernel: hda: dma timeout retry: status=0x58 { DriveReady SeekComplete DataRequest }
Apr 15 11:15:28 localhost kernel:
Apr 15 11:15:28 localhost kernel: hda: status timeout: status=0xd0 { Busy }
Apr 15 11:15:28 localhost kernel:
Apr 15 11:15:28 localhost kernel: hda: no DRQ after issuing WRITE
Apr 15 11:15:29 localhost kernel: ide0: reset: success

Sometimes this completely locks up the system. Since this happened with two disks, I suppose it's some other factor. I tried using another IDE cable, and different kernels (2.4.25 and 2.6.5) without success. Finally, I found out that disabling dma (with ide=nodma) makes the problem disappear, but of course with a big performance penalty. The strange thing is also that the disk worked flawlessly for about a week.

How can I discover the culprit? Thanks for any idea or help.
 
Old 04-16-2004, 10:01 PM   #2
jailbait
Guru
 
Registered: Feb 2003
Location: Blue Ridge Mountain
Distribution: Debian Wheezy, Debian Jessie
Posts: 7,590

Rep: Reputation: 187Reputation: 187
"How can I discover the culprit?"

It may be the IDE chipset. You could switch the cable from IDE0 to IDE1 (and make the corresponding changes in /etc/fstab and your bootloader configuration). This only a partial test because IDE0 and IDE1 do have some circuitry in common. But if the problem is in the IDE chipset and in the part of the circuitry dedicated to IDE0 then this swap will show that the problem is in the IDE chipset.

___________________________________
Be prepared. Create a LifeBoat CD.
http://users.rcn.com/srstites/LifeBo...home.page.html

Steve Stites
 
Old 04-17-2004, 02:22 PM   #3
dbr
LQ Newbie
 
Registered: Apr 2004
Posts: 3

Original Poster
Rep: Reputation: 0
Thanks for the suggestion.
 
Old 11-08-2004, 09:36 PM   #4
elfoozo
Member
 
Registered: Feb 2004
Location: MI
Distribution: debian unstable
Posts: 252

Rep: Reputation: 32
I started having these same type of errors all the time when I upgraded to a stock debian 2.6.x kernel. Prior to my upgrade, I was dead-on stable (no glitches, etc) for a full year!

I've even downloaded HDD mfgr's disk scanning utilities and done their exhaustive diagnostics on all of my disks and come up with no errors.

I'm starting to think that the late 2.4 (and upwards) kernels introduced the culprit....
 
Old 11-13-2004, 04:38 PM   #5
ptomblin
LQ Newbie
 
Registered: Nov 2004
Posts: 1

Rep: Reputation: 0
Like the previous poster, I started getting this error (and a bunch of related ones) on a system that had been formerly stable for several years, and it started the second I installed Fedora Core 3. Before, I had been using the default RedHat 2.4 kernel, and now I'm using the 2.6 kernel. I'm also using reiserfs, and this error has been corrupting the file systems on my second IDE controller quite dramatically. I can't even back the files up.
 
Old 11-13-2004, 05:23 PM   #6
elfoozo
Member
 
Registered: Feb 2004
Location: MI
Distribution: debian unstable
Posts: 252

Rep: Reputation: 32
I found a work around for backing up files while I nail down the source of this problem....

I grabbed an ISO of Knoppix a while ago (version 3.3) and I booted off that Knoppix disc. Then I was able to mount and rsync my important stuff onto other disk drives in case I corrupt my filesystems beyond recovery while trouble-shooting.

I've been playing with re-compiling kernel 2.6.9 but haven't yet been able to nail down which disk option is causing these errors.

Maybe DMA code is changed in new kernels? Maybe it's enabled by default and it wasn't in the older kernels?

My hdparm -I /dev/hde output shows an asterisk next to UDMA5... but does that really mean DMA is on or off? I dunno.....
 
Old 12-16-2004, 03:56 PM   #7
tdykstra
LQ Newbie
 
Registered: Dec 2004
Posts: 3

Rep: Reputation: 0
I'm having similar problems. I'm wondering if it's the driver for a certain ide controller or something more generic. What ide controler are people using? This is mine:

0000:00:10.0 IDE interface: ALi Corporation M5229 IDE (rev c4)
 
Old 12-19-2004, 08:44 PM   #8
elfoozo
Member
 
Registered: Feb 2004
Location: MI
Distribution: debian unstable
Posts: 252

Rep: Reputation: 32
I've got 2 controllers....

Onboard:
ICH4: IDE controller at PCI slot 0000:00:1f.1

PCI add-in card:
PDC20267: IDE controller at PCI slot 0000:01:05.0

Since my last post, I recompiled dozens of times (literally) from kernel 2.4.22 up through 2.6.10 RC2 and enabling disabling many different options related to IDE. I also swapped controller cards to rule out hardware failure. The best I have come up with was grabbing the .config file from Knoppix and re-compiled 2.6.10 RC2, turning off just enough "options" to get a successful compile.

I'm stable again, but at a serious hit on drive performance.

Timing buffered disk reads: 10 MB in 3.27 seconds = 3.06 MB/sec used to be more like 60 MB in the same 3.27 seconds.

The Knoppix .config file turns off DMA......
 
Old 12-20-2004, 05:43 PM   #9
tdykstra
LQ Newbie
 
Registered: Dec 2004
Posts: 3

Rep: Reputation: 0
@elfoozo: Do you know at which kernel you started seeing this behavior? I've always used a 2.5/2.6 kernel on this machine and I'm pretty sure this problem was absent before. I do not really remember when it started, because at first I thought it was a failing drive. If it hadn't been a laptop, I probably would have replaced it already...
 
Old 12-20-2004, 06:21 PM   #10
elfoozo
Member
 
Registered: Feb 2004
Location: MI
Distribution: debian unstable
Posts: 252

Rep: Reputation: 32
I first noticed it on kernel 2.6.9 because I upgraded using apt-get.

Other Internet searches I've read mention 2.4.18 when first noted.
 
Old 01-02-2005, 01:31 AM   #11
elfoozo
Member
 
Registered: Feb 2004
Location: MI
Distribution: debian unstable
Posts: 252

Rep: Reputation: 32
Success! I re-compiled 2.6.10 and stripped out everything - including module support and have been stable on 2.6.10 for a full week. I'm even running at UDMA 5, disks are zippy again... Life is good.
 
Old 01-05-2005, 06:13 PM   #12
tdykstra
LQ Newbie
 
Registered: Dec 2004
Posts: 3

Rep: Reputation: 0
I'm also running 2.6.10 now. For two days, so I shouldn't be cheering to early, but so far so good!
 
Old 01-17-2005, 03:57 AM   #13
kiwibird
Member
 
Registered: Jan 2005
Distribution: Arch
Posts: 35

Rep: Reputation: 15
Oh My Gods.
I went through the Exact same experience as the first poster, with the 1 year old drive, exchange, one week wait etc, except I didn't switch cables because this is on my Toshiba Laptop. I'll try installing that 2.6.10-kernel now. Must I "strip out everything" to make it work? I'm fairly n00 to that stuff...

THANKS so much for the tips.
I should've noticed how FC2test3 worked but newer distros (including Skolelinux) all got messed up. I didn't notice until I managed to do a minimal install w/o my HD breaking down, so that was in a terminal and saw the errors come.
 
Old 01-17-2005, 06:09 AM   #14
kiwibird
Member
 
Registered: Jan 2005
Distribution: Arch
Posts: 35

Rep: Reputation: 15
Scratch that. When my laptop is warm, I can't even reformat the drive from my Partition Magic floppies (the longer it's been on, the . I guess I need a new one... although it seems weird to be dying on me after just one week. How sad.
 
Old 01-17-2005, 07:41 PM   #15
elfoozo
Member
 
Registered: Feb 2004
Location: MI
Distribution: debian unstable
Posts: 252

Rep: Reputation: 32
By "strip out everything" I mean: I selected No on every "option" unless it specifically matched my hardware.

I've since recompiled a few more times turning on more kernel features and module support and USB support and still the disk errors are gone. I'm liking 2.6.10 a lot!
 
  


Reply

Tags
controller, kernel, pci, promise


Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off


Similar Threads
Thread Thread Starter Forum Replies Last Post
Cloning 1 hard disk to smaller hard disk, dd or ghost? fireman949 Linux - Hardware 8 12-23-2012 01:51 AM
Reiser file system / Hard Disk/ Hard Drive Problems Oxyacetylene Linux - Software 4 10-10-2005 03:24 PM
Reclaim disk partition from hard disk installation EStester Linux - Enterprise 2 03-09-2005 12:46 PM
Reclaim disk partition from hard disk installation EStester Linux - General 1 03-09-2005 12:25 PM
Hard Disk question - Making a big hard disk MrAnonym Linux - General 2 09-15-2003 07:08 PM


All times are GMT -5. The time now is 08:41 AM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
identi.ca: @linuxquestions
Facebook: linuxquestions Google+: linuxquestions
Open Source Consulting | Domain Registration