LinuxQuestions.org

LinuxQuestions.org (/questions/)
-   Linux - Server (https://www.linuxquestions.org/questions/linux-server-73/)
-   -   Nagios - Latency issue (https://www.linuxquestions.org/questions/linux-server-73/nagios-latency-issue-642030/)

amrit.singh 05-13-2008 10:51 PM

Nagios - Latency issue
 
Hi All,

Greetings.

I am new to Nagios and would like some suggestions on resolving the latest issue on Nagios latency. We have 2 Nagios servers in our organization Server 1 & Server2. We are facing the latency issues on Server 1. When we go to the performance information, I see...

<= 1 minute: 796 (11.3%)
<= 5 minutes: 5448 (77.0%)
<= 15 minutes: 7072 (100.0%)

However its normal in the second server. When I go to the Server1 and run #top, I see that status.cgi is consuming a lot of CPU. I have researched a lot of blogs and tried testing my configuration file but to no luck. Even I tried running ./status.cgi locally and I couldn't see any error.

Please do let me know if I need to post any configuration files as well.

Please help !!!!!

unSpawn 05-14-2008 07:20 AM

Quote:

Originally Posted by amrit.singh (Post 3152493)
When we go to the performance information, I see...

<= 1 minute: 796 (11.3%)
<= 5 minutes: 5448 (77.0%)
<= 15 minutes: 7072 (100.0%)

Are you sure you're looking right? And unless the Nagios interface changed radically, without the table heading this is meaningless. I think you're looking at the time frame / checks completed fields and the nfo you posted shows you Nagios completed onehundred percent of its checks. If you want to see latency nfo then you have to look at the tables with the "metric" headers.


Quote:

Originally Posted by amrit.singh (Post 3152493)
However its normal in the second server.

And what is "normal"?


Quote:

Originally Posted by amrit.singh (Post 3152493)
I couldn't see any error.

Since you have two Nagios servers you could diff for dissimilarities in dependencies (Perl?), maybe the Nagios version itself, resource polling configuration, plugins, network conditions and check the logs.

amrit.singh 05-14-2008 09:26 PM

Thank you for your reply.....

With reference to your suggestion about looking for latency at the right place, following is seen when we look under Metric:

Metric Min. Max. Average
Check Execution Time: 0.04 sec 26.29 sec 0.282 sec
Check Latency: 70.34 sec 91.16 sec 78.911 sec
Percent State Change: 0.00% 42.04% 0.05%

So, we can say that there is some problem somewhere. Now what I have done is, I have checked the configuration files and everything seems to be fine.

The Nagios Version and Plugins are of the same version on both the servers. I would like to check the version of Dependencies once again however I am sure they are of the same version on both the servers. Also, I have checked the Nagios logs & httpd logs and no related entries are shown.

I will still check the suggested and post the information. However, please do let me now if you want me post any specific config file or logs?

Thanks again for your help.

unSpawn 05-15-2008 05:04 AM

You've only shown one set of metrics and I don't know what it refers to: services or hosts. Your check latency is over one minute which is kind of high. For some reason it takes Nagios a lot of time to get data returned from services and hosts. See if you can trace it back to some specific services or hosts and maybe also see http://nagios.sourceforge.net/docs/$VERSION/tuning.html where VERSION is 2_0 or 3_0, depending on your version of Nagios.


All times are GMT -5. The time now is 02:30 AM.