LinuxQuestions.org
Latest LQ Deal: Latest LQ Deals
Home Forums Tutorials Articles Register
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - General
User Name
Password
Linux - General This Linux forum is for general Linux questions and discussion.
If it is Linux Related and doesn't seem to fit in any other forum then this is the place.

Notices


Reply
  Search this Thread
Old 09-25-2008, 04:57 AM   #1
Kandahar
LQ Newbie
 
Registered: Sep 2008
Posts: 3

Rep: Reputation: 0
Server crashing once a day. CentOS 4.4


Hello,
I am having problems on a server which is constantly crashing once a day. (Some times it lasts two but rarely)

Its been going on for 2 weeks now and its the first problems I've had with this box for 2 years.
I run a network of Windows boxes because of the software we use here, so when it comes to Linux I completely fall over and don't know what to do to help track down this issue.

I am running top currently but nothing looks out of the ordinary.
Is there a way to monitor so I know when something spikes?
The load is ALWAYS below 1. So if anything goes higher then it should be that process.


When it crashes... It basically stays connected (I have a KVM/IP connected so I could post screenshots next time it happens - Probably tomorrow) - However you can't connect via SSH and all websites stop responding on it.
I think it responds to ping.

Any help will be very helpful. This is a web box so having it go down once a day is terribly dammaging.

Thanks
 
Old 09-25-2008, 05:45 AM   #2
dyasny
Member
 
Registered: Dec 2007
Location: Canada
Distribution: RHEL,Fedora
Posts: 995

Rep: Reputation: 115Reputation: 115
/var/log/messages should have something

do you see any kernel panics or ooopses when the server crashes?
 
Old 09-25-2008, 06:32 AM   #3
Kandahar
LQ Newbie
 
Registered: Sep 2008
Posts: 3

Original Poster
Rep: Reputation: 0
Here are things which jump out at me as being suspicious.

Code:
Sep 21 09:42:57 www sshd(pam_unix)[7295]: authentication failure; logname= uid=0 euid=0 tty=ssh ruser= rhost=196-209-58-100-rrba-esr-3.dynamic.isadsl.co.za 
Sep 21 09:42:58 www sshd(pam_unix)[7336]: check pass; user unknown
Sep 21 09:42:58 www sshd(pam_unix)[7336]: authentication failure; logname= uid=0 euid=0 tty=ssh ruser= rhost=196-209-58-100-rrba-esr-3.dynamic.isadsl.co.za 
Sep 21 09:43:01 www sshd(pam_unix)[7352]: check pass; user unknown
Sep 21 09:43:01 www sshd(pam_unix)[7352]: authentication failure; logname= uid=0 euid=0 tty=ssh ruser= rhost=196-209-58-100-rrba-esr-3.dynamic.isadsl.co.za 
Sep 21 09:43:02 www sshd(pam_unix)[7366]: check pass; user unknown
Lots and lots of them..



And then this happens:

Code:
Sep 21 09:43:39 www psa: Stopping Plesk:  succeeded
Sep 21 09:43:39 www named:  succeeded
Sep 21 10:43:39 www named[3214]: shutting down: flushing changes
Sep 21 10:43:39 www named[3214]: stopping command channel on 127.0.0.1#953
Sep 21 10:43:40 www named[3214]: no longer listening on 127.0.0.1#53
Sep 21 10:43:40 www named[3214]: no longer listening on 77.74.1XX.XXX#53
Sep 21 10:43:40 www named[3214]: no longer listening on 77.74.1XX.XXX#53
Sep 21 10:43:40 www named[3214]: no longer listening on 77.74.1XX.XXX#53
Sep 21 10:43:40 www named[3214]: no longer listening on 77.74.1XX.XXX#53
Sep 21 10:43:40 www named[3214]: no longer listening on 77.74.1XX.XXX#53
Sep 21 10:43:40 www named[3214]: no longer listening on 77.74.1XX.XXX#53
Sep 21 10:43:40 www named[3214]: no longer listening on 77.74.1XX.XXX#53
Sep 21 10:43:40 www named[3214]: no longer listening on 77.74.1XX.XXX#53
Sep 21 10:43:40 www named[3214]: no longer listening on 77.74.1XX.XXX#53
Sep 21 10:43:40 www named[3214]: exiting
More things like this:

Code:
Sep 22 12:57:18 www sshd(pam_unix)[718]: authentication failure; logname= uid=0 euid=0 tty=ssh ruser= rhost=s030.silver.fastwebserver.de 
Sep 22 12:57:18 www sshd(pam_unix)[715]: check pass; user unknown
Sep 22 12:57:18 www sshd(pam_unix)[715]: authentication failure; logname= uid=0 euid=0 tty=ssh ruser= rhost=s030.silver.fastwebserver.de 
Sep 22 12:57:19 www sshd(pam_unix)[723]: check pass; user unknown

I have the messages file hosted if you would like to look at it.
I can PM you the link.
 
Old 09-25-2008, 06:47 AM   #4
Kandahar
LQ Newbie
 
Registered: Sep 2008
Posts: 3

Original Poster
Rep: Reputation: 0
This also:

Quote:
"Sep 25 08:39:54 www kernel: Out of Memory: Killed process 32254 (httpd)."
 
Old 09-25-2008, 07:06 AM   #5
salasi
Senior Member
 
Registered: Jul 2007
Location: Directly above centre of the earth, UK
Distribution: SuSE, plus some hopping
Posts: 4,070

Rep: Reputation: 897Reputation: 897Reputation: 897Reputation: 897Reputation: 897Reputation: 897Reputation: 897
Once you've got to this point:
Quote:
Sep 21 10:43:39 www named[3214]: stopping command channel on 127.0.0.1#953
(i.e., you are shutting down loopback#953), you can expect things to go wrong. And they do...

It looks as if this stops named working (surprise, surprise) and things get worse from there on in. I'm guessing the 77.74.1XX.XXX#53 is a legitimate name server for you to be trying to use for external name resolution, but when you do try it you get confused about what to do with the response, because the data normally flows along loopback and that isn't now working.

I'm sure I have known what Plesk is (but my feeble memory needs an extra stick of ram), but is it possible that:
Quote:
Sep 21 09:43:39 www psa: Stopping Plesk: succeeded
is the root cause?

Eventually
Quote:
"Sep 25 08:39:54 www kernel: Out of Memory: Killed process 32254 (httpd)."
it looks as if all memory becomes exhausted and the box has no option but to start shutting things down.
 
Old 09-25-2008, 09:16 PM   #6
chrism01
LQ Guru
 
Registered: Aug 2004
Location: Sydney
Distribution: Rocky 9.2
Posts: 18,358

Rep: Reputation: 2751Reputation: 2751Reputation: 2751Reputation: 2751Reputation: 2751Reputation: 2751Reputation: 2751Reputation: 2751Reputation: 2751Reputation: 2751Reputation: 2751
Re those attempted ssh logins:
uid=0 euid=0
means an attempt as root user. I hope you've disabled root login via ssh (see sshd config). There are thousands of bots on the net trying to break in that way. Always login as normal user and su or sudo up as needed.
 
  


Reply



Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
FMS Interactive 3.0 Linux Crashing on CentOS 5 Kernel 2.6.18-53.1.14.el5PAE oldscratch Linux - Software 2 01-12-2010 06:45 AM
LXer: CentOS Directory Server On CentOS 5.2 LXer Syndicated Linux News 0 08-06-2008 09:20 PM
Server keeps crashing yepp Linux - Enterprise 7 11-08-2005 07:39 AM
My system is crashing 5 times a day! Is linux really more stable than windows? little_penguin Linux - Software 25 07-29-2005 04:15 AM
MY server keeps crashing and I don't know why... Electrode Linux - General 6 07-06-2003 10:53 AM

LinuxQuestions.org > Forums > Linux Forums > Linux - General

All times are GMT -5. The time now is 02:56 PM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration