Daily server checklist for newbies
Hi guys. We have a variety of Linux servers, and I want to start getting my non-Linux folks into the habit of checking them every morning for like disk space, logs, at least the basics for now.
Does anyone have, or know a link to, a good basic/general daily Linux server checklist that says what to check and how? Or maybe we can make-up one in this thread, as i'm not having any luck finding one. Heck, maybe i'll even learn a few things from this :) Thanks in advance for everyone chiming in, this could be fun. |
Quote:
|
That's the kind of response I was looking for, kinda. We're trying to evaluate different free monitoring software now to see what works, although we haven't found one yet that does everything. All the servers I was thinking of having them monitor are very old, and do have problems from time to time, so thought maybe a daily morning check of certain things they could do on each one might give us an idea before something bad happens. Ideally we're hoping we'll find some monitoring software that can email us scheduled reports of disk space/usage, deamon errors, etc, but really not sure what would best do it. Not so concerned with the network anomalies as of yet. The next couple we're going to try are Nagios and OpenNMS, see how those work. but was just thinking of hopefully putting together a basic, until-we-get-monitoring-setup morning routine going. plus it would help familiarize these people with linux a bit more, as they have no experience now really.
but yeah, I do agree, a dashboard and email alerting is the end game for us, just looking for a good intermediate (manual labor) step till we get that squared away. |
Well, as far as manual labour goes, you could run 'top' to check cpu load, RAM, swap space.
Also 'df -h' for disk space. You should probably also check some key logfiles, but which ones would depend on the services each system runs. The generic/default logfile is /var/log/messages. Depending on the exact Distro, you could check /var/log/secure. HTH |
Thanks Chris, that does help, and those things you mentioned should probably be at the top of the list. Great start to the list, maybe some other people will chime in with other stuff as well.
|
Try cacti
It's free and it does a decent job displaying all kinds of stuff in neat graphics. You can find a lot of templates on the internet for various devices (such as Linux machines, Cisco Routers etc..) |
Quote:
|
I tried cacti, but all it did was graphs from what I could tell.
That was a funny reference. |
Generally I'd go Nagios for alerts and Cacti for graphs.
Theoretically Nagios has graphs ( sort of) but I can't recommend them. There are many alert and graphing tools, some of which purport to do both, but as with Prog Langs or Linux distros, there's no 'best'; its a subjective choice. As for my prev comment, it'd be pretty easy to write a simple script to do those basic checks once a day (eg 4am) and email the results. Obviously there's an extensive list of things you could check for, but I'd keep it pared down if I were you; you do want people to actually read them, right? ;) Actually, a good tool that does a lot for you already as far as log check+email goes it the logwatch tool. |
Thanks Chris. I think here it's going to come down to Nagios or OpenNMS, just not sure which one yet. Seems Nagios has been around longer so more people use that one.
|
All times are GMT -5. The time now is 03:34 PM. |