Server is Pinging But Not Connecting

devUnix · 03-05-2013, 04:14 AM

Hi,

Let's say we have a Linux / UNIX server example1.com which is responding to a ping request but when we try to connect to it through ssh or telnet, it does not respond. Similarly we have a Windows server example2.com and it responds to a ping request successfully (0% packets loss), but we can't connect to it through RDP.

Does it mean the servers are in a hung state? Can a server still respond (successfully) to a ping request while it is dying?

If yes, then consider a production environment wherein we have 100 UNIX and Windows servers and we want to make sure that they are alive. I usually write a shell script which ping(s) each of them and if a green signal is returned, I ignore this node otherwise an email alert is sent to the admin / concerned group.

(This is critical because we have other services such as Databases running on the servers and if the servers are pinging but are not actually functioning properly then the services are likely to be impacted.)

acid_kewpie · 03-05-2013, 04:57 AM

A half dead server can certainly often respond to ICMP but not open TCP connections, but there's no 100% guarantee that that's what's the case. Maybe the ssh service itself has just frozen, or a firewall was updated

pan64 · 03-05-2013, 06:20 AM

imagine, a single network card without running OS can answer to the ping request, so ping is not reliable for this purpose. also a firewall can block any port or protocol. If you want to be sure I suggest you to create-install your own health check service on all your hosts and ask that service about the state.

TenTenths · 03-05-2013, 06:39 AM

Quote:

Originally Posted by devUnix

consider a production environment wherein we have 100 UNIX and Windows servers and we want to make sure that they are alive. I usually write a shell script which ping(s) each of them and if a green signal is returned, I ignore this node otherwise an email alert is sent to the admin / concerned group.

Just checking ping is nowhere near enough to determine if a host is down or not. As previous posters have indicated a host can reply to pings while other services are impacted, conversly firewall configurations could prevent the host from responding to pings while other services are unaffected.

You need to define what services you are expecting on each of your hosts and check them accordingly.

Rather than a single script you might want to consider a monitoring suite, my personal preference is nagios but I'm sure others will have their own opinions.

devUnix · 03-05-2013, 06:40 AM

Quote:

Originally Posted by pan64

create-install your own health check service on all your hosts and ask that service about the state.

To make sure I got your point exactly, let me put it this way:

I put a script on server_1, server_2, and server_n and let the script on each of these servers create a log at some common place let's say logging_server:\var\log\myLogs\servers_health.log and then see if I get logs/answers from all the named servers or not?

pan64 · 03-05-2013, 06:56 AM

No, not really. A small daemon process which will listen on a given port and will reply to the central host. Something like "are you ok?" "yes/no/whatever".
The daemon process knows on every and each host how to check if it works well and runs some test periodically.

But also you can try to write a log on a common filesystem and check those logs...

devUnix · 03-05-2013, 07:03 AM

Quote:

Originally Posted by acid_kewpie

A half dead server can certainly often respond to ICMP but not open TCP connections

How do we make sure that a particular Port is opened only when the server is alive?

Quote:

Originally Posted by pan64;

A small daemon process which will listen on a given port and will reply to the central host. Something like "are you ok?" "yes/no/whatever".
The daemon process knows on every and each host how to check if it works well and runs some test periodically.

What Port number do you suggest?

pan64 · 03-05-2013, 07:34 AM

Quote:

Originally Posted by devUnix

How do we make sure that a particular Port is opened only when the server is alive?

It costs so much (to check all the other ports are closed). nmap can do that for you.

Quote:

Originally Posted by devUnix

What Port number do you suggest?

I suggest you to select any port which will not conflict with your configuration, 9563 or 34712 can also be used.

TenTenths · 03-05-2013, 07:43 AM

Quote:

Originally Posted by pan64

No, not really. A small daemon process which will listen on a given port and will reply to the central host. Something like "are you ok?" "yes/no/whatever".

Why re-invent something that already exists? Most distros now come with an extendable SNMP daemon.

acid_kewpie · 03-05-2013, 07:48 AM

Quote:

Originally Posted by TenTenths

Why re-invent something that already exists? Most distros now come with an extendable SNMP daemon.

Yeah, I'm baffled at the discussion too. There are loads of monitoring solutions that already exist in all sorts of forms.

chrism01 · 03-05-2013, 07:29 PM

As with the others; if this is a serious prod qn, get monitoring tool eg nagios, zabbix, zenoss, opennms etc etc.
I wouldn't write your own for 100 systems.

Re ping: that just tests the network cxn to the remote host's network stack.
Tells you nothing about the state of rest of the system/services.

devUnix · 03-05-2013, 11:57 PM

Thanks to all of you for your inputs!