Nagios/NRPE oddity on similar Centos machines
Hi there,
We have Nagios running on a server, and are installing NRPE as a daemon on web servers we need to monitor. I'm two servers in and have hit a small snag.
The two servers we're currently trying to monitor are, to the best of our knowledge, very similar (Centos 5.4, Apache, MySQL, PHP etc etc), and the installations of NRPE on them should be the same. One is being monitored just fine, however we can't check on the other one without including the -n switch (to disable SSL) in the check_nrpe call. If we don't, however, the error we get is, "CHECK_NRPE: Socket timeout after 10 seconds.". In /var/log/messages on the machine that fails we see:
May 24 17:23:49 ourserver xinetd[23583]: START: nrpe pid=27932 from=123.123.123.123
May 24 17:23:59 ourserver nrpe[27932]: Could not read request from client, bailing out...
May 24 17:23:59 ourserver xinetd[23583]: EXIT: nrpe status=0 pid=27932 duration=10(sec)
The only kind of related thing I can think of that's different between the two machines is that one (the one that works) has actually had an SSL certificate installed on it, for the site it hosts.
Has anyone run into this, or something similar? Any tips would be greatly appreciated.
Cheers,
Toby
|