LinuxQuestions.org
Review your favorite Linux distribution.
Home Forums Tutorials Articles Register
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Hardware
User Name
Password
Linux - Hardware This forum is for Hardware issues.
Having trouble installing a piece of hardware? Want to know if that peripheral is compatible with Linux?

Notices


Reply
  Search this Thread
Old 12-18-2009, 05:45 AM   #1
fawrell
LQ Newbie
 
Registered: Jul 2008
Location: Slovakia
Posts: 13

Rep: Reputation: 0
Write I/O error on logical drive


Hello.

I have some weird problem with one disk on HP DL385 with Adaptec 2200S SCSI controller. Using Red Hat Enterprise Linux 4 Update 3. The BIOS/driver/firmware for the controller is the newest one. Everything started when I was copyring some data through ftp from another server. The server suddenly got frozen, we couldn't even log on it. After a while it was ok again but in /var/log/messages we saw tons of messages like this:
Code:
kernel: SCSI error : <0 0 3 0> return code = 0x8000002
kernel: Info fld=0x0, Current sdd: sense key Hardware Error
kernel: Additional sense: Internal target failure
kernel: end_request: I/O error, dev sdd, sector 163749039
kernel: Buffer I/O error on device sdd1, logical block 20468622
kernel: lost page write due to I/O error on sdd1
And because the Adaptec diagnostic utility marked the logical drive (the logical drive was not redundant, it contained only one physical disk) and the phys. drive as failed, we replaced the failed drive with the same but new drive (its HP 146GB drive, both have same firmware). The process of deleting old logical disk and creating new one was done without any error messages and we did it without restarting the system. Then we created partition on the log. disk with fdisk. All fine just few error messages in logs:

Code:
kernel: SCSI error : <0 0 3 0> return code = 0x8000002
kernel: Info fld=0x0, Current sdd: sense key Hardware Error
kernel: Additional sense: Internal target failure
kernel: end_request: I/O error, dev sdd, sector 286749480
kernel: Buffer I/O error on device sdd, logical block 35843685
All are identical (same sector/logical block). Because the disk has got same dev name as the disk before, and because the partition was created successfully we think that kernel had some old data and this triggered the error messages. When we were accessing the disk later with fdisk, no error mesages were logged. Then we created file system on the partition without any problems just again some messages in logs like this:

Code:
kernel: Buffer I/O error on device sdd1, logical block 143371968
kernel: lost page write due to I/O error on sdd1
But the fs was created successfully and the forced fs check didn't show any errors. We tried to copy 1GB of data onto it without problems. So we thank its all ok. But at the end of copying 70GB of data the system frozen again. We had to hard restart it and this was in logs:

Code:
kernel: Buffer I/O error on device sdd1, logical block 35834863
kernel: lost page write due to I/O error on sdd1
Lots of them, logical blocks are different. But the phys. and log. drive isn't marked as failed. No other error messages are logged (just fs ones). So why it can write almost 70GB of data without any problems but it fails at the last file? There are 3 other log. drives on the same controller and they are without any problem. It fails just on this one. The phys drive there is a new one, so I don't think it is broken. What may be the reason of the problem?

Sorry for the longer post. Ty for answers!
 
Old 12-18-2009, 07:19 AM   #2
onebuck
Moderator
 
Registered: Jan 2005
Location: Central Florida 20 minutes from Disney World
Distribution: SlackwareŽ
Posts: 13,925
Blog Entries: 44

Rep: Reputation: 3159Reputation: 3159Reputation: 3159Reputation: 3159Reputation: 3159Reputation: 3159Reputation: 3159Reputation: 3159Reputation: 3159Reputation: 3159Reputation: 3159
Hi,

Since you have replaced the hdd with another and the same errors occur then the potential problems indicate a controller.

I would do some testing on the controller with manufacture diagnostics to insure that's not the problem. Check the cable and connectors integrity.

You may need to power down the system to allow the cleaning of the controller edge & connector. Do not use a pencil eraser or the like. You can use one inch folded 20# paper and denatured alcohol to wipe the edge. For the connector a credit card with the paper folded over, wet then push into the connector. One wipe per paper and in one direction for the edge. One insert per piece for the connector. Discard the used paper and use a new piece each time, if you don't then all you will be doing is transferring dirt between wipes.

 
Old 12-21-2009, 03:52 AM   #3
fawrell
LQ Newbie
 
Registered: Jul 2008
Location: Slovakia
Posts: 13

Original Poster
Rep: Reputation: 0
Hey.

Ty for answer. Hmm power down system, cleaning, it doesn't sound good :-P . But prob. we will have to do it. I was thinking about differences between this log. drive and the others 3 there. And I found out that this one has enabled write cache. Can the problem occur because of some problems with the cache? Or can be it due some kernel bug? Using Red Hat Enterprise Linux 4, kernel 2.6.9-34.ELsmp x64.

Again ty for answer!
 
Old 01-04-2010, 02:52 AM   #4
konsolebox
Senior Member
 
Registered: Oct 2005
Distribution: Gentoo, Slackware, LFS
Posts: 2,248
Blog Entries: 8

Rep: Reputation: 235Reputation: 235Reputation: 235
There's a little possibility that it's a kernel prob but you might also upgrade your kernel to at least 2.6.27.XX. 2.6.9 is already a bit too old. It can also be the controller or the wire.
 
  


Reply



Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
IBM 3590 Tape Drive, Ubuntu Sever, MT read/write error penguinboy08 Linux - Hardware 11 02-14-2009 09:44 AM
Trying to run fsck and get an error message, hard drive is write protected. RMHusers Linux - Newbie 3 04-01-2008 01:32 PM
hard drive write error aloishis89 Linux - Hardware 4 01-16-2008 08:26 PM
problem writing dvds get ab error Illegal write mode for this drive procfs Linux - Hardware 2 10-10-2006 01:44 AM
Bash - how to write logical expressions? lowpro2k3 Programming 2 03-24-2005 03:39 PM

LinuxQuestions.org > Forums > Linux Forums > Linux - Hardware

All times are GMT -5. The time now is 08:06 AM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration