I have a Dell poweredge server running Red Hat 9 that serves a fair amount of traffic. It ran well for several months, but recently it has taken to occasionally losing it's network connection.
Actually, as far as I can tell, it doesn't recognise any problem with the connection, but it just stops working and can no longer ping the gateway (although the gateway is up, running fine and pingable from the outside). It is on a persistent 10 Mbps ethernet connection, the nic card continues to show lights, but the packets just seem to stop getting through.
Oddly enough, doing a 'service network restart' doesn't seem to fix the problem, only a full system restart. I've tried switching cables and using the other nic card (there are two on the machine) but the problem persists. So, I've got an ugly hack solution in place right now where a cron job restarts the whole machine if it can't ping the gateway once per hour. Obviosuly far from ideal.
Anyway, I'm primarily a programmer, so my system administrative skills are pretty thin. I suppose what I need to do is start scouring the logs, but I'm not really sure which ones I should look at. Any guidance would be greatly appreciated.
Thanks for your time,