LinuxQuestions.org

LinuxQuestions.org (/questions/)
-   Linux - Server (http://www.linuxquestions.org/questions/linux-server-73/)
-   -   Is My Hard Disk failing? (http://www.linuxquestions.org/questions/linux-server-73/is-my-hard-disk-failing-765559/)

Diubidone 10-30-2009 08:19 AM

Is My Hard Disk failing?
 
3 Attachment(s)
Hi,

Ok so we have this red hat machine at my work. It's been up since 2007 and never gave problems whatsoever. All updates have always been correctly installed even the latest ones.

attached file yum.txt

I don't know exactly when but arround october 29th we had this system freeze. We had to manually restart the machine.

attached file dmesg.txt

What's with the "ext3_orphan_cleanup: deleting unreferenced inode" message?

Am I loosing my disk or my filesystem? Should I prepare for the worst (I already backuped what needed to be saved).

Output from lscpi comand for hardware specs: attached lspci.txt

System kernel: 2.6.18-164.2.1.el5

Anyone can give some directions to understand if system is about to fail?

TB0ne 10-30-2009 12:06 PM

Quote:

Originally Posted by Diubidone (Post 3737900)
Hi,

Ok so we have this red hat machine at my work. It's been up since 2007 and never gave problems whatsoever. All updates have always been correctly installed even the latest ones.

I don't know exactly when but arround october 29th we had this system freeze. We had to manually restart the machine.

What's with the "ext3_orphan_cleanup: deleting unreferenced inode" message?

Am I loosing my disk or my filesystem? Should I prepare for the worst (I already backuped what needed to be saved).

System kernel: 2.6.18-164.2.1.el5

Anyone can give some directions to understand if system is about to fail?

Well, two years of constant uptime is a good bit. The inode messages could be from the system not being shut down properly, but the best place to look for suspicious things is /var/log/messages. Look for anything related to the hard drive device name.

Two years isn't very long, but the disk could be failing. You've taken the good steps of making sure you've got a backup, and making sure you're prepared for a crash. And that isn't an "if"...it's a "when"..:) All systems die eventually. If you see drive-related failures, I would try to schedule a time to replace the drives, before an unplanned outage. And if you're not short on $$$, I'd suggest upgrading the entire server, too...hardware is cheap these days, and you can migrate the services/files over at your leisure.

Diubidone 11-03-2009 03:46 AM

Checking if disk is failing...
 
What are the best steps in order to understand if my disk is going to fail?

I have RAID 1 Arrayd (2 disks) with LVM configuration.

amani 11-03-2009 05:37 AM

smartctl is very accurate

amani 11-03-2009 05:42 AM

Your files do not suggest impending drive failure.

#man hdparm

use -I option

post output


All times are GMT -5. The time now is 12:27 AM.