LinuxQuestions.org
Review your favorite Linux distribution.
Home Forums Tutorials Articles Register
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Server
User Name
Password
Linux - Server This forum is for the discussion of Linux Software used in a server related context.

Notices


Reply
  Search this Thread
Old 08-06-2009, 04:53 AM   #1
lin*x
LQ Newbie
 
Registered: Aug 2009
Posts: 15

Rep: Reputation: 1
Weird Nagios server issue? - It seems to just stop running


Hi All,
I've got a weird nagios issue that has just re-occurred for a second time. The first time round was about a week ago now, and all that seemed to happen was everything stayed running ie... a 'ps -ef| grep nagios' showed that the nagios process was still running and the nagios web GUI was still running, and NDO looked to still be running, the backed database looked to be working fine. However in the GUI it show for most services the last check was several hours ago.

Now the services checks are set to be actively monitored so they should be getting checked. The nagios scheduling queue shows lots of checks awaiting in the queue all for around the same time the rest of the status of all other services and hosts was last checked, so it looks like its basically just been paused in a round about way.

Also nagios seems un-responsive when sending commands to it VIA the web GUI of such things as forcing a re-schedule. Restarting the nagios process and stop starting of active checks. It looks as though nagios take no notice of what I tell it to do from the GUI but doesn't show any errors in any logs anywhere what so ever.

I've seen in some areas it could be related to NDO problems, but i'm running a powerful box talking hp BL460 blade system, which should easily cover resource requirements, considering i'm not exactly monitoring as many things that most other people seem to be monitoring when they are talking about NDO issues.

Nagios 3.0.6 so not exactly over the hills old, OS SLES 10.2. Mysql 5.0.26 and NDO was the latest verison available which I think is only a beta from years ago...

Thanks for any help if anyone can.
Cheers,
M

EDIT:-
The only way it seems to make it recover is by stopping the nagios process and then also stopping the NDO process, if I just stop and start the nagios process manually this makes no difference so there must be an issue with NDO that i'm having.

Last edited by lin*x; 08-06-2009 at 05:23 AM.
 
Old 09-21-2009, 03:29 AM   #2
mrabris
LQ Newbie
 
Registered: Sep 2009
Location: Sweden
Posts: 1

Rep: Reputation: 0
Hi,

We have almost the same problem with our Nagios(ver 3.0a4), Mysql(4.1.22)

It stops updating the services/host status. In the Nagios GUI everything seams to be up and running but when you take a closer look at the service "last check" time it’s an old timestamp.

If you are using Smistat you can also see short gaps in the graphs before Nagios enters this weird mode.

As a QDS we have we added a script in cron which sends us a mail when the nagios process indicated not to be running.(/etc/init.d/nagios status). After a #/etc/init.d/nagios start
Everything workes fine again.

Please let me know if you have any solution/fix for this problem?

Thanks
/M
 
Old 09-21-2009, 06:29 AM   #3
lin*x
LQ Newbie
 
Registered: Aug 2009
Posts: 15

Original Poster
Rep: Reputation: 1
Hi There,

I never found a solution as such... the problems i was seeing were related to the NDO and stuff which has recently been under going new development. However i've updated my Nagios and NDO to the latest available versions and all has been fine. I suggest you guys certainly update as your currently running on a version thats an alpha release.. so non-stable. Its difficult to help out when your running a version that is not marked as stable so i suggest an upgrade will help you. Its a simple and easy process anyway.

I did the same as you though during the period i was having these problems... but it never got used because it hasn't failed for ages now which is great. I can't say what it is specifically thats sorted it out but latest NDO and latest nagios versions seem to be working well so far.

Hope this helps,
Regards,
Mark
 
  


Reply



Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
Nagios 3.1.2 + RHEL 5.3 You don't have permission to access /nagios/ on this server psix Linux - Server 13 08-04-2015 02:25 AM
Can't access Windows. Weird, weird grub issue. MightyHard Linux - General 2 12-31-2008 04:35 PM
start und stop windows service using nagios cccc Debian 1 03-11-2008 07:43 AM
installing first linux my PClinuxOS livecd stop running at fatal server error agussuwarso Linux - Newbie 2 03-09-2008 09:19 AM
weird server issue Nic-MDKman Linux - Security 1 04-26-2004 04:46 PM

LinuxQuestions.org > Forums > Linux Forums > Linux - Server

All times are GMT -5. The time now is 07:44 AM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration