LinuxQuestions.org
Latest LQ Deal: Latest LQ Deals
Home Forums Tutorials Articles Register
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Hardware
User Name
Password
Linux - Hardware This forum is for Hardware issues.
Having trouble installing a piece of hardware? Want to know if that peripheral is compatible with Linux?

Notices


Reply
  Search this Thread
Old 03-23-2005, 09:25 PM   #1
inaki
Member
 
Registered: Mar 2005
Posts: 94

Rep: Reputation: 15
hardware scsi error


How to interprate this message error taken from kern.log

Mar 21 09:16:12 kap kernel: Current sd08:03: sense key Hardware Error
Mar 21 09:16:12 kap kernel: Additional sense indicates Defect list error
Mar 21 09:16:12 kap kernel: I/O error: dev 08:03, sector 64243976
Mar 21 09:16:13 kap kernel: Info fld=0x3e4f77c, Current sd08:03: sense key Recovered Error
Mar 22 17:58:04 kap kernel: Info fld=0x24cab94 (nonstd), Current sd08:03: sense
key None
Mar 22 17:58:04 kap kernel: I/O error: dev 08:03, sector 37486672
Mar 22 17:58:59 kap kernel: SCSI disk error : host 0 channel 0 id 0 lun 0 return
code = 8000002
Mar 22 17:58:59 kap kernel: Info fld=0x24cab94 (nonstd), Current sd08:03: sense
key None


How to solve this error
 
Old 03-24-2005, 01:57 PM   #2
rnturn
Senior Member
 
Registered: Jan 2003
Location: Illinois (SW Chicago 'burbs)
Distribution: openSUSE, Raspbian, Slackware. Previous: MacOS, Red Hat, Coherent, Consensys SVR4.2, Tru64, Solaris
Posts: 2,803

Rep: Reputation: 550Reputation: 550Reputation: 550Reputation: 550Reputation: 550Reputation: 550
It looks as though the disk has developed (or developing) a bad block. From the messages, it appears to be somewhere in the sda3 partition (sd 08:03). That bit about the defect list error leads me to believe that the block in question might not have been found on the drives list of known defects, i.e., it's a newly developed bad block or one that's getting questionable. It does look like it eventually recovered from the first error. This is definitely something to keep an eye on. Caveat: I'm not getting this from reading the SCSI device drivers to see where the messages are coming from but rather just trying to interpret the messages.

If the problem persists, you might need to look into backing everything up and running "badblocks" on the disk after booting from a rescue CD. Space permitting, transferring everything to another disk could work as a backup (making a large tar archive from each filesystem on sdb, for example). Otherwise, tape is probably your best bet. To update the badblock table and remove the block that caused this error from use, you could either run "badblocks" followed by "mke2fs" on each partition to update the badblock table or, if the SCSI HBA allows it, you could do the media check on the entire drive via the HBA's firmware. (I'm assuming that this is even possible with your HBA.) Both of these will put your data at risk. I've done the firmware-based checks on Adaptec boards quite a lot and know that it's a destructive operation; the firmware makes darn sure you understand this as well. After you've found all the bad blocks and updated the badblock table on the disk, recreate the partitions (mandatory if the HBA does the badblock checking), and restore from your backup. Your boot loader might be toast following this (definitely so if the firmware check was done) so you might need to reinstall that as well while you're booted off your rescue CD. This is a messy process and one to avoid if this kind of error is not frequent. But if you're noticing it more and more and/or you're seeing files come up corrupted, the disk may be going seriously bad and should be replaced before you lose everything on it.

If all this sounds pretty gory it's because, well, it is. But if you're careful, you should not have to lose any files or even go through an entire reinstallation of the OS and your applications. (BTW, I've got a similar sort of operation planned in the near future. Not because of bad blocks but because of a spin-up problem. Can't say I'm looking forward to it.)

Good luck...
 
Old 03-24-2005, 08:46 PM   #3
inaki
Member
 
Registered: Mar 2005
Posts: 94

Original Poster
Rep: Reputation: 15
Thank You very much... i appreciate of your information given... Thanks buddy
 
  


Reply



Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
Scanning new scsi hardware with RH AS3 consty Linux - Hardware 0 11-16-2005 03:21 PM
SCSI host adapter hardware failure. DeamonSocar Linux - Hardware 0 10-19-2005 07:43 AM
Error editing /proc/scsi scsi file Thaidog Linux - Newbie 2 08-26-2004 08:19 AM
scsi error or hardware incompatible? Freaky Dave Linux - Hardware 4 07-05-2004 11:29 PM
scsi error in wine (i dont have scsi) evensen Linux - Games 3 05-11-2004 03:13 PM

LinuxQuestions.org > Forums > Linux Forums > Linux - Hardware

All times are GMT -5. The time now is 12:48 PM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration