LinuxQuestions.org
Share your knowledge at the LQ Wiki.
Home Forums Tutorials Articles Register
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Newbie
User Name
Password
Linux - Newbie This Linux forum is for members that are new to Linux.
Just starting out and have a question? If it is not in the man pages or the how-to's this is the place!

Notices


Reply
  Search this Thread
Old 09-20-2013, 12:47 AM   #1
hahnhahnhahn
LQ Newbie
 
Registered: Sep 2013
Posts: 6

Rep: Reputation: Disabled
how to check for hard disk failure?


Hi to all

I'm a newbie here. I need to check for hard disk failure in my linux. Suspect "intermittent" hard disk failure is causing some of my tables in oracle database to have missing records. May I know which log and how can I check for hard disk failure? Below are my details:

Linux:
Red Hat Enterprise Linux ES release 4 (Nahant Update 8)
Kernel \r on an \m
Linux version 2.6.9-89.ELsmp (mockbuild@hs20-bc1-2.build.redhat.com) (gcc version 3.4.6 20060404 (Red Hat 3.4.6-11)) #1 SMP Mon Apr 20 10:34:33 EDT 2009

Oracle Database
Oracle 9i SE(9.2.0.7) Red Hat Linux 4.8(32 bit)

Last edited by hahnhahnhahn; 09-20-2013 at 12:49 AM.
 
Old 09-20-2013, 12:59 AM   #2
spazticclown
Member
 
Registered: Sep 2010
Distribution: Fedora, Android, CentOS
Posts: 91
Blog Entries: 2

Rep: Reputation: 21
First off back up any important data.

Smartctl can be used to check the SMART status of the drive:
Code:
# smartctl -a /dev/sda
Will print off the smart status for sda

badblocks can scan for read errors on the drives
Code:
# badblocks -s /dev/sda
-s Provides a nice output and percent complete.

If you have a hardware RAID controller then SMART data should be available in the RAID management utility.

Hope that helps you out.
 
2 members found this post helpful.
Old 09-20-2013, 01:11 AM   #3
hahnhahnhahn
LQ Newbie
 
Registered: Sep 2013
Posts: 6

Original Poster
Rep: Reputation: Disabled
Quote:
Originally Posted by spazticclown View Post
First off back up any important data.

Smartctl can be used to check the SMART status of the drive:
Code:
# smartctl -a /dev/sda
Will print off the smart status for sda

badblocks can scan for read errors on the drives
Code:
# badblocks -s /dev/sda
-s Provides a nice output and percent complete.

If you have a hardware RAID controller then SMART data should be available in the RAID management utility.

Hope that helps you out.
Hi, how about logs? Which log will show me hard disk failure details?
 
Old 09-20-2013, 01:19 AM   #4
astrogeek
Moderator
 
Registered: Oct 2008
Distribution: Slackware [64]-X.{0|1|2|37|-current} ::12<=X<=15, FreeBSD_12{.0|.1}
Posts: 6,269
Blog Entries: 24

Rep: Reputation: 4196Reputation: 4196Reputation: 4196Reputation: 4196Reputation: 4196Reputation: 4196Reputation: 4196Reputation: 4196Reputation: 4196Reputation: 4196Reputation: 4196
You might see some related errors in /var/log/{syslog, messages}, but there is no log for hard disk failures - until it fails 100% and fails to mount.

Last edited by astrogeek; 09-20-2013 at 01:21 AM. Reason: Added messages thanks to spazticclown's note... failing memory
 
Old 09-20-2013, 01:19 AM   #5
spazticclown
Member
 
Registered: Sep 2010
Distribution: Fedora, Android, CentOS
Posts: 91
Blog Entries: 2

Rep: Reputation: 21
Good question, dmesg (/var/log/messages) may show you some info regarding the drive (sda, sdb etc), mdraid (md0, md1 etc).
Code:
# cat /var/log/messages | grep -i "sda"
Is a good starting point.

Good luck.
 
Old 09-20-2013, 02:00 AM   #6
hahnhahnhahn
LQ Newbie
 
Registered: Sep 2013
Posts: 6

Original Poster
Rep: Reputation: Disabled
Alright. thanks everyone for helping.
 
Old 09-20-2013, 02:28 AM   #7
John VV
LQ Muse
 
Registered: Aug 2005
Location: A2 area Mi.
Posts: 17,627

Rep: Reputation: 2651Reputation: 2651Reputation: 2651Reputation: 2651Reputation: 2651Reputation: 2651Reputation: 2651Reputation: 2651Reputation: 2651Reputation: 2651Reputation: 2651
You might want to consider installing a supported OS
RHEL 4.8 is unsupported
you could upgrade to RHEL 4.9
it is now on EXTRA extended life support ( and you have to buy the extra support )
but that support will be ENDING in mid 2014 ( that is for 4.9 , 4.8 is ALREADY NOT supported )
 
Old 09-20-2013, 02:36 AM   #8
hahnhahnhahn
LQ Newbie
 
Registered: Sep 2013
Posts: 6

Original Poster
Rep: Reputation: Disabled
Quote:
Originally Posted by John VV View Post
You might want to consider installing a supported OS
RHEL 4.8 is unsupported
you could upgrade to RHEL 4.9
it is now on EXTRA extended life support ( and you have to buy the extra support )
but that support will be ENDING in mid 2014 ( that is for 4.9 , 4.8 is ALREADY NOT supported )
Hi John VV,

Greatly appreciate your sharing. However on another note, my company is reducing their budget. As such, we cant have upgrade anytime soon. Likewise for Oracle 9i which is already unsupported by Oracle.
 
Old 09-21-2013, 01:38 AM   #9
gdejonge
Member
 
Registered: Aug 2010
Location: Netherlands
Distribution: Kubuntu, Debian, Suse, Slackware
Posts: 317

Rep: Reputation: 73
Quote:
Originally Posted by hahnhahnhahn View Post
Hi John VV,

Greatly appreciate your sharing. However on another note, my company is reducing their budget. As such, we cant have upgrade anytime soon. Likewise for Oracle 9i which is already unsupported by Oracle.
You really should asked your boss what would happen to the department/company when this database system will fail.
I've worked for a company where every minute of down-time would costs them thousands of dollars of lost revenue.

This is why companies that really depend on their IT infrastructure have a DR plan. (DR=disaster recovery). And that every sysadmin worth his salt at least has thought about it.

Cheers
 
Old 09-21-2013, 07:51 AM   #10
zeebra
Senior Member
 
Registered: Dec 2011
Distribution: Slackware
Posts: 1,832
Blog Entries: 17

Rep: Reputation: 638Reputation: 638Reputation: 638Reputation: 638Reputation: 638Reputation: 638
Quote:
Originally Posted by gdejonge View Post
You really should asked your boss what would happen to the department/company when this database system will fail.
I've worked for a company where every minute of down-time would costs them thousands of dollars of lost revenue.

This is why companies that really depend on their IT infrastructure have a DR plan. (DR=disaster recovery). And that every sysadmin worth his salt at least has thought about it.

Cheers
There is no reason for old systems to fail. You never need to always have the newest version to be able to use a system successfully. Many people and companies run very old systems and they work perfectly well.

Depending on the company size and needs I would recommend moving over to a free and unsupported system that they can manage themselves instead, if their resources cannot support moving to the newest and latests and having all the best support packages.
 
  


Reply



Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
how do I check my hard disk for errors. possible hard disk failure lgtrean Linux - Hardware 6 06-22-2011 04:06 PM
Hard Disk Failure: How do I move data to a new hard disk drive? spyros Linux - Software 2 10-31-2008 03:01 PM
Hard Disk failure? bhert Linux - Hardware 7 04-08-2007 02:07 PM
hard DIsk Failure frankkky Linux - Hardware 5 11-12-2006 03:23 PM
Hard disk failure? GazzaJ Linux - Hardware 2 08-15-2006 07:28 AM

LinuxQuestions.org > Forums > Linux Forums > Linux - Newbie

All times are GMT -5. The time now is 03:24 AM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration