Linux - SoftwareThis forum is for Software issues.
Having a problem installing a new program? Want to know which application is best for the job? Post your question in this forum.
Notices
Welcome to LinuxQuestions.org, a friendly and active Linux Community.
You are currently viewing LQ as a guest. By joining our community you will have the ability to post topics, receive our newsletter, use the advanced search, subscribe to threads and access many other special features. Registration is quick, simple and absolutely free. Join our community today!
Note that registered members see fewer ads, and ContentLink is completely disabled once you log in.
If you have any problems with the registration process or your account login, please contact us. If you need to reset your password, click here.
Having a problem logging in? Please visit this page to clear all LQ-related cookies.
Get a virtual cloud desktop with the Linux distro that you want in less than five minutes with Shells! With over 10 pre-installed distros to choose from, the worry-free installation life is here! Whether you are a digital nomad or just looking for flexibility, Shells can put your Linux machine on the device that you want to use.
Exclusive for LQ members, get up to 45% off per month. Click here for more info.
Is there a lightweight SMART exception reporting solution? Ideally something that will periodically gather SMART statistics and, when there is an exception indicating a problem worthy of investigation, send an email?
smartmontools provides the base, the ability to query drives and report statistics. It would, of course, be possible to run a script periodically that runs smartmontools commands, analyses the output and, on detecting an exception, sends an email. A sophisticated implementation would send emails reporting the same problem with decreasing frequency and would infrequently send a "green light" email to assure the recipient that is was still working.
This seems such an "obviously" useful function that it may have been developed already but so far I have only found graphical utilities such as munin, GSmartControl and maybe CrystalDiskInfo which would be fine tools to run constantly in an operations centre but are overkill for a single workstation.
infact, smartmontools does support this, see man smartd...
Thanks doc.nice Sweet as! I had been looking at smartctl but smartd is the bees knees. Some great information written by smartmontools creator, Bruce Allen, in Linux Journal. Reading that in conjunction with the smartmond man page and following its cross-references to the smartctl page, I came up with the following line in /etc/smartd.conf
Code:
/dev/sda -a -I 194 -M diminishing m root -o on -s (S/../.././17|L/../../6/15) -W 3,41,46
Is testing OK so far (was holding off posting until tested a little more) but I can't think of a way to test it properly without an (ideally simulated) failing HDD.
I use /dev/hda -a -I 194 -n standby -o on -S on -s S/../.././02 -m (myadminmailaddress) -M diminishing
but I must have missed the -W switch, will add it immediately
About the usefullness:
I already had a dying disc and must say, SMART has done its job. It told me about 3 mins before the disc finally gave up that something is bad...
I have another disc (used as swap in a private pc) that is SMART defect for more than 3 years now...
So I wouldn't give that much, but the thermal check ist quite a good indicator for real trouble in your case, I think...
LinuxQuestions.org is looking for people interested in writing
Editorials, Articles, Reviews, and more. If you'd like to contribute
content, let us know.