how to check for hard disk failure?
Hi to all
I'm a newbie here. I need to check for hard disk failure in my linux. Suspect "intermittent" hard disk failure is causing some of my tables in oracle database to have missing records. May I know which log and how can I check for hard disk failure? Below are my details: Linux: Red Hat Enterprise Linux ES release 4 (Nahant Update 8) Kernel \r on an \m Linux version 2.6.9-89.ELsmp (mockbuild@hs20-bc1-2.build.redhat.com) (gcc version 3.4.6 20060404 (Red Hat 3.4.6-11)) #1 SMP Mon Apr 20 10:34:33 EDT 2009 Oracle Database Oracle 9i SE(9.2.0.7) Red Hat Linux 4.8(32 bit) |
First off back up any important data.
Smartctl can be used to check the SMART status of the drive: Code:
# smartctl -a /dev/sda badblocks can scan for read errors on the drives Code:
# badblocks -s /dev/sda If you have a hardware RAID controller then SMART data should be available in the RAID management utility. Hope that helps you out. |
Quote:
|
You might see some related errors in /var/log/{syslog, messages}, but there is no log for hard disk failures - until it fails 100% and fails to mount.
|
Good question, dmesg (/var/log/messages) may show you some info regarding the drive (sda, sdb etc), mdraid (md0, md1 etc).
Code:
# cat /var/log/messages | grep -i "sda" Good luck. |
Alright. thanks everyone for helping.
|
You might want to consider installing a supported OS
RHEL 4.8 is unsupported you could upgrade to RHEL 4.9 it is now on EXTRA extended life support ( and you have to buy the extra support ) but that support will be ENDING in mid 2014 ( that is for 4.9 , 4.8 is ALREADY NOT supported ) |
Quote:
Greatly appreciate your sharing. However on another note, my company is reducing their budget. As such, we cant have upgrade anytime soon. Likewise for Oracle 9i which is already unsupported by Oracle. |
Quote:
I've worked for a company where every minute of down-time would costs them thousands of dollars of lost revenue. This is why companies that really depend on their IT infrastructure have a DR plan. (DR=disaster recovery). And that every sysadmin worth his salt at least has thought about it. Cheers |
Quote:
Depending on the company size and needs I would recommend moving over to a free and unsupported system that they can manage themselves instead, if their resources cannot support moving to the newest and latests and having all the best support packages. |
All times are GMT -5. The time now is 03:06 PM. |