LinuxQuestions.org
Download your favorite Linux distribution at LQ ISO.
Home Forums Tutorials Articles Register
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Server
User Name
Password
Linux - Server This forum is for the discussion of Linux Software used in a server related context.

Notices


Reply
  Search this Thread
Old 09-08-2016, 02:39 PM   #1
zpimp
Member
 
Registered: Oct 2014
Posts: 73

Rep: Reputation: Disabled
s.m.a.r.t. alternative for servers


for ide/sata we have smart but not for servers

i know there are some proprietary stuff, wich work sometimes

but for sas hdd , hardware raid i dont know

the reason i ask, is because i see some poor bastards, whose hdds are failing
and only thing they do is pray

i read about predictive failure analysis
but i dont know much about this

i cant accept you can prevent losing data on pcs but not on servers with raid
 
Old 09-08-2016, 03:46 PM   #2
thesnow
Member
 
Registered: Nov 2010
Location: Minneapolis, MN
Distribution: Ubuntu, Red Hat, Mint
Posts: 172

Rep: Reputation: 56
It is possible to monitor RAID members through smartctl, but it can be tricky to find the correct syntax and depends on your RAID card. For example,

Code:
smartctl -i -H -d megaraid,0 /dev/sda
smartctl -i -H -d megaraid,1 /dev/sda
smartctl -i -H -d megaraid,2 /dev/sda
 
Old 09-12-2016, 07:27 AM   #3
wildwizard
Member
 
Registered: Apr 2009
Location: Oz
Distribution: slackware64-14.0
Posts: 875

Rep: Reputation: 282Reputation: 282Reputation: 282
For real servers this is done at the hardware level and the servers hardware management system, all the OS can see is the virtual disks presented by the RAID hardware.
 
1 members found this post helpful.
Old 09-13-2016, 03:10 PM   #4
zpimp
Member
 
Registered: Oct 2014
Posts: 73

Original Poster
Rep: Reputation: Disabled
ok so how do i check the server's hardware management system?

the crap i saw logs when a hdd failed and the spare kicks in
but in raids with minimum number of hdd
when second hdd fails after the first, its lost

maybe i dont know enough but its not normal to not be able to see the phisycal health of a drive in a server before failing
 
Old 09-13-2016, 07:20 PM   #5
frankbell
LQ Guru
 
Registered: Jan 2006
Location: Virginia, USA
Distribution: Slackware, Ubuntu MATE, Mageia, and whatever VMs I happen to be playing with
Posts: 19,339
Blog Entries: 28

Rep: Reputation: 6144Reputation: 6144Reputation: 6144Reputation: 6144Reputation: 6144Reputation: 6144Reputation: 6144Reputation: 6144Reputation: 6144Reputation: 6144Reputation: 6144
Smartmontools has a page about support for RAID controllers. Frankly, I'm not sure I understand it, but I've never needed to use RAID. You may find it a good starting point:

https://www.smartmontools.org/wiki/S...ID-Controllers
 
Old 09-13-2016, 08:58 PM   #6
Ser Olmy
Senior Member
 
Registered: Jan 2012
Distribution: Slackware
Posts: 3,341

Rep: Reputation: Disabled
Quote:
Originally Posted by zpimp View Post
for ide/sata we have smart but not for servers
Yes, we do. All modern hard drives support S.M.A.R.T., but on larger storage systems the S.M.A.R.T. status is checked by the controller firmware, so you would use the RAID controller management software rather than smartd.

Quote:
Originally Posted by zpimp View Post
i know there are some proprietary stuff, wich work sometimes

but for sas hdd , hardware raid i dont know
If by "proprietary stuff" you're referring to management software from the likes of HP, Dell, Fujitsu, LSI, Areca and others, I'm happy to report that it all works wery well.

Quote:
Originally Posted by zpimp View Post
the reason i ask, is because i see some poor bastards, whose hdds are failing
and only thing they do is pray
I've seen that too, and it's most unfortunate. Some individuals seem to believe that hard drives aren't subject to wear and tear, even though the manufacturers' MTBF data are readily available, and companies like Backblaze routinely publish reports with empirical data on the failure rates of hard drives of various types and sizes.

But you started off talking about servers. I hope you haven't seen a server going down because of faulty hard drives? Because that would indicate a seriously incompetent system administrator.
Quote:
Originally Posted by zpimp View Post
i read about predictive failure analysis
but i dont know much about this
S.M.A.R.T. does a decent job when it comes to predicting failure, but not before a number of sectors have gone bad. In a non-RAID setup that means data has probably been lost.

Of course, if the drive suffers a sudden, catastrophic failure (head stopper coming loose, bearings seizing, electronics malfunctioning), S.M.A.R.T. will be no help at all.
Quote:
Originally Posted by zpimp View Post
i cant accept you can prevent losing data on pcs but not on servers with raid
You don't really believe that's the case, do you? Server manufacturers solved that problem many years ago with management and monitoring software.

Here's what you do:

Make sure the management software for your server/controller is configured to send notifications whenever a drive fails. If you use software RAID, mdadm in monitor mode and smartd will get the job done.

Make sure at least two different notification mechanisms are used to minimize the risks of silent failure.

Make sure the RAID set is verified/scrubbed regularly to catch growing defects ("bit rot") on rarely used areas of the disks. Most hardware RAID controllers can do either scheduled or continuous background/idle scrubbing.

Consider installing an online spare if there are no systems administrators on-site.

Consider using RAID 6 if the RAID set is large enough that there's a real risk of a second drive failing during rebuild.
 
  


Reply



Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
Excellent book on alternative DNS servers (DRM-free pdf) timetraveler Linux - Server 9 12-15-2011 04:36 PM
mysqld_multi errors when starting servers, servers start when run manually jason.rohde Linux - Server 2 10-29-2011 08:18 AM
Servers: Is there VOIP Solutions with LIRC (no X) , secured alternative to Skype frenchn00b Linux - Server 3 09-20-2009 02:40 PM
Are IBM servers the Best Servers, for a music download website.? mixhypnatist General 30 09-16-2009 09:26 PM

LinuxQuestions.org > Forums > Linux Forums > Linux - Server

All times are GMT -5. The time now is 03:19 AM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration