LinuxQuestions.org
Latest LQ Deal: Latest LQ Deals
Home Forums Tutorials Articles Register
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Hardware
User Name
Password
Linux - Hardware This forum is for Hardware issues.
Having trouble installing a piece of hardware? Want to know if that peripheral is compatible with Linux?

Notices


Reply
  Search this Thread
Old 07-07-2009, 05:03 PM   #1
jsteel
Member
 
Registered: Mar 2007
Location: England
Distribution: Arch
Posts: 392

Rep: Reputation: 34
Hard drive Failing?


I installed some hard drives to a new computer (haven't use these drives in a while). I created a new ext3 file system on them and tested them with smartctl. When I run a test with smartctl it fails with "Completed: read failure" with 90% remaining.

I found some information regarding bad blocks and a way to tell the computer to not use various portions of the drive, but I don't know where the problem is as the "LBA_of_first_error" shown after the test is run is blank. After a bit of research, every case I have found shows some information here. Is it worth looking into this further or does it mean that the drive is dying and it's better off in the bin?

The overall-health self-assessment test result is passed, so it seems functional for now. It's a Samsung 250GB IDE HDD.
 
Old 07-07-2009, 05:30 PM   #2
xeleema
Member
 
Registered: Aug 2005
Location: D.i.t.h.o, Texas
Distribution: Slackware 13.x, rhel3/5, Solaris 8-10(sparc), HP-UX 11.x (pa-risc)
Posts: 988
Blog Entries: 4

Rep: Reputation: 254Reputation: 254Reputation: 254
Greetingz!

Bad blocks should be automatically scanned for and reallocated by the drive's integrated disk electronics (IDE) and the accompanying firmware. The best test to find out if a disk is dying would be to run a "dd" against it, and watch your syslog (/var/log/messages in most Linux distributions).

In one termnial window, run the following command;
time dd if=/dev/hda1 of=/dev/null
(NOTE: Don't get the "if=" and "of=" designations swapped around, or you will wipe out the contents of the drive.)

In another, run something similar to;
tail -f /var/log/messages

If you start to see "Drive Seek" errors, or something that looks like this;
hda: dma_intr: status=0x51 { DriveReady SeekComplete Error }
hda: dma_intr: error=0x84 { DriveStatusError BadCRC }
ide: failed opcode was: unknown
hda: dma_intr: status=0x51 { DriveReady SeekComplete Error }
hda: dma_intr: error=0x84 { DriveStatusError BadCRC }
ide: failed opcode was: unknown
hda: dma_intr: status=0x51 { DriveReady SeekComplete Error }
hda: dma_intr: error=0x84 { DriveStatusError BadCRC }
ide: failed opcode was: unknown

Then you've got a dying drive on your hands.
Also, listening for the "Click of Death" is a good indicator.

G'luck in yer struggles, chummer!

Last edited by xeleema; 07-07-2009 at 07:42 PM. Reason: Added note regarding the "dd" command.
 
1 members found this post helpful.
Old 07-08-2009, 04:17 AM   #3
H_TeXMeX_H
LQ Guru
 
Registered: Oct 2005
Location: $RANDOM
Distribution: slackware64
Posts: 12,928
Blog Entries: 2

Rep: Reputation: 1301Reputation: 1301Reputation: 1301Reputation: 1301Reputation: 1301Reputation: 1301Reputation: 1301Reputation: 1301Reputation: 1301Reputation: 1301
smartctl is used to test entire disks not partitions, I'm hoping you ran it on the whole disk. What command did you use ? Should be something like:

Code:
smartctl -t long /dev/sda
 
Old 07-08-2009, 05:08 AM   #4
jsteel
Member
 
Registered: Mar 2007
Location: England
Distribution: Arch
Posts: 392

Original Poster
Rep: Reputation: 34
xeleema,
Thanks I will try that later.

H_TeXMeX_H,
Yes that is the command I ran (not sda1 for example).
 
Old 07-08-2009, 05:19 AM   #5
H_TeXMeX_H
LQ Guru
 
Registered: Oct 2005
Location: $RANDOM
Distribution: slackware64
Posts: 12,928
Blog Entries: 2

Rep: Reputation: 1301Reputation: 1301Reputation: 1301Reputation: 1301Reputation: 1301Reputation: 1301Reputation: 1301Reputation: 1301Reputation: 1301Reputation: 1301
Well, if the test failed, then there is a problem with the disk. Can you post the attributes and results of the test as it appears on 'smartctl -a /dev/sda'.
 
Old 07-08-2009, 04:07 PM   #6
jsteel
Member
 
Registered: Mar 2007
Location: England
Distribution: Arch
Posts: 392

Original Poster
Rep: Reputation: 34
xeleema,
Yes it came up with "media errors" after a minute or two of running.

H_TeXMeX_H,
I'm not near the computer right now, I'll paste the output as soon as I can.
 
Old 07-08-2009, 07:19 PM   #7
onebuck
Moderator
 
Registered: Jan 2005
Location: Central Florida 20 minutes from Disney World
Distribution: SlackwareŽ
Posts: 13,925
Blog Entries: 44

Rep: Reputation: 3159Reputation: 3159Reputation: 3159Reputation: 3159Reputation: 3159Reputation: 3159Reputation: 3159Reputation: 3159Reputation: 3159Reputation: 3159Reputation: 3159
Hi,

I would get the hdd manufacture diagnostics and run those. 'smartctl' is great but I would still get the original diagnostics.

'UBCD (Ultimate Boot CD)' allows users to run floppy-based diagnostic tools from most CDROM drives on Intel-compatible machines, no operating system required. The bootable cd includes many diagnostic utilities.

The above link and others available from 'Slackware-Links'. More than just SlackwareŽ links!
 
Old 07-09-2009, 01:40 PM   #8
xeleema
Member
 
Registered: Aug 2005
Location: D.i.t.h.o, Texas
Distribution: Slackware 13.x, rhel3/5, Solaris 8-10(sparc), HP-UX 11.x (pa-risc)
Posts: 988
Blog Entries: 4

Rep: Reputation: 254Reputation: 254Reputation: 254
Greetingz!

'onebuck' makes a good point about the manufacturer's diagnostics. However, if you ran the full "dd" test I mentioned previously, let me save you some trouble;

This: ..."media errors" after a minute or two of running.

Means: Toss the drive. Move along, nothing to see here.

If the disk isn't stone-dead now, it will be in short order. Besides, those "media errors" typically cause I/O-wait hangs. Which usually run 5 to 20 seconds. I don't know about you, but I don't want my server/project_box/workstation/etc screeching to a halt every time the "special" hard disk is accessed.

(NOTE: My advice does not apply if you're trying to scrape usable data off of that drive. _IF_ that's the case, start the file-copy now while you still can!)

Last edited by xeleema; 07-09-2009 at 01:41 PM.
 
Old 07-13-2009, 09:19 AM   #9
jsteel
Member
 
Registered: Mar 2007
Location: England
Distribution: Arch
Posts: 392

Original Poster
Rep: Reputation: 34
Yes I thought that after I saw the output. There's nothing on it so it's binned now. Thanks for your help. That's the last Samsung disk I use.
 
Old 07-16-2009, 01:33 AM   #10
xeleema
Member
 
Registered: Aug 2005
Location: D.i.t.h.o, Texas
Distribution: Slackware 13.x, rhel3/5, Solaris 8-10(sparc), HP-UX 11.x (pa-risc)
Posts: 988
Blog Entries: 4

Rep: Reputation: 254Reputation: 254Reputation: 254
Hold on there, jsteel.

That disk might have been kaput, but there's something you have to remember about hard drive manufacturers;

Most of the aluminum chassis are cut in Korea, the integrated circuits burned in Malaysia, and everything's assembled in Taiwan.

After dealing with literally *thousands* of hard drive failures in a Production data center, I can tell you two things;

1) If ,according to the manufacture date, a drive is six months old or newer - watch it for a year.

2) If, according to the manufacture date, a drive is 18 months old, or older (and has run continuously without issue), then it'll last you five more years.

Corollary; if the drive is five years old, watch it for a year.

Have a good one!
 
Old 08-23-2009, 09:27 AM   #11
H_TeXMeX_H
LQ Guru
 
Registered: Oct 2005
Location: $RANDOM
Distribution: slackware64
Posts: 12,928
Blog Entries: 2

Rep: Reputation: 1301Reputation: 1301Reputation: 1301Reputation: 1301Reputation: 1301Reputation: 1301Reputation: 1301Reputation: 1301Reputation: 1301Reputation: 1301
Quote:
Originally Posted by xeleema View Post
Hold on there, jsteel.

That disk might have been kaput, but there's something you have to remember about hard drive manufacturers;

Most of the aluminum chassis are cut in Korea, the integrated circuits burned in Malaysia, and everything's assembled in Taiwan.

After dealing with literally *thousands* of hard drive failures in a Production data center, I can tell you two things;

1) If ,according to the manufacture date, a drive is six months old or newer - watch it for a year.

2) If, according to the manufacture date, a drive is 18 months old, or older (and has run continuously without issue), then it'll last you five more years.

Corollary; if the drive is five years old, watch it for a year.

Have a good one!
I agree, see the bathtub curve:
http://en.wikipedia.org/wiki/Bathtub_curve
 
  


Reply



Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
Hard drive failing! Please help! mahdif Linux - Hardware 5 02-26-2009 09:57 AM
Possible failing hard drive? ub3rj3phf *BSD 4 11-04-2008 12:27 PM
Hard Drive Failing? keysorsoze Linux - Hardware 24 12-09-2006 12:53 PM
hard drive failing, how can I create an image of my drive? oily_rags SUSE / openSUSE 6 07-07-2005 02:19 PM
Is my hard drive failing? HGeneAnthony General 1 11-23-2004 01:37 AM

LinuxQuestions.org > Forums > Linux Forums > Linux - Hardware

All times are GMT -5. The time now is 03:57 PM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration