Linux - ServerThis forum is for the discussion of Linux Software used in a server related context.
Notices
Welcome to LinuxQuestions.org, a friendly and active Linux Community.
You are currently viewing LQ as a guest. By joining our community you will have the ability to post topics, receive our newsletter, use the advanced search, subscribe to threads and access many other special features. Registration is quick, simple and absolutely free. Join our community today!
Note that registered members see fewer ads, and ContentLink is completely disabled once you log in.
If you have any problems with the registration process or your account login, please contact us. If you need to reset your password, click here.
Having a problem logging in? Please visit this page to clear all LQ-related cookies.
Get a virtual cloud desktop with the Linux distro that you want in less than five minutes with Shells! With over 10 pre-installed distros to choose from, the worry-free installation life is here! Whether you are a digital nomad or just looking for flexibility, Shells can put your Linux machine on the device that you want to use.
Exclusive for LQ members, get up to 45% off per month. Click here for more info.
I am constantly receiving this below message from Nagios to my email about 10 or more times a day..
Is this something I need to worry about? anyway to reduce the amount of times i get the email?
Thanks
Code:
[10-18-2016 09:03:18] SERVICE ALERT: localhost;Total Processes;WARNING;SOFT;3;PROCS WARNING: 252 processes with STATE = RSZDT
Service Warning[10-18-2016 09:02:18] SERVICE ALERT: localhost;Total Processes;WARNING;SOFT;2;PROCS WARNING: 253 processes with STATE = RSZDT
Service Warning[10-18-2016 09:01:18] SERVICE ALERT: localhost;Total Processes;WARNING;SOFT;1;PROCS WARNING: 255 processes with STATE = RSZDT
I read that forum from nagios but couldnt understand what exactly this error indicates or what I should do to resolve it.. they said to increase the warning threshold but not sure how to do that.. any idea what this error means?
If you don't understand the alert or what you should do to resolve it, why bother configuring it?
That alert is the total count of processes that are running, and what's normal for your server can vary VASTLY. For example, there will be a number of worker processes that will increase depending on the number of CPUs/cores that the hardware has.
As you've provided no information on your server spec or what it's running there's no clue to what would be normal for it.
Just a few minutes' Google searching will show you what the various letters that make up "RSZDT" means, and likewise what this Nagios filter's settings mean.
(The letters correspond to specific bits in the wait-reason mask ...)
I don't find this particular setting very useful because, as others have said, the "total number of processes" can vary wildly from moment to moment, depending on exactly what your server is doing and for whom. Nagios installations customarily include it, but I customarily turn it off.
What is of more concern to me would be specific bottlenecks. A very large number of processes that are waiting on a particular thing. A large number of processes in a non-idle wait state, indicating that they are otherwise ready to proceed but cannot. Not just waiting for "any" thing, but waiting for "very specific" things.
I suggest that it is fairly useless (but, very common) to configure Nagios to "squawk" about things that might be an issue. Instead, you should formulate very clear notions of exactly what sort of thing would require your intervention or at least your attention, and configure Nagios to vigilantly look for that.
If Nagios is, in effect, "spamming your mailbox" about things that do not cause you to immediately take a specific action, then it is ... well ... just "spamming your mailbox," and you should stop it. You've got an electronic "Boy Who Cries 'Wolf!'"
Last edited by sundialsvcs; 10-18-2016 at 09:47 AM.
LinuxQuestions.org is looking for people interested in writing
Editorials, Articles, Reviews, and more. If you'd like to contribute
content, let us know.