LinuxQuestions.org

LinuxQuestions.org (/questions/)
-   Linux - Server (https://www.linuxquestions.org/questions/linux-server-73/)
-   -   Server restarting again and again (https://www.linuxquestions.org/questions/linux-server-73/server-restarting-again-and-again-926495/)

raunaq 01-30-2012 08:28 AM

Server restarting again and again
 
Hi all,

I am facing a very peculiar problem from the last two days.
Actually my server is restarting again and again.Here are the specs and the things i already tried.

Centos 5.2 32 bit > OS
/var/log/messages just says system going to HALT.nothing more than this
load average is 1.2 approx
no heating issue
no result in chrootkit
changed smps, ram and applied terminal solution over processor,
No fixed time of shut down.
no cron set for any user.
removed every known host from .ssh/known_hosts.

Really stuck .. Please help..

corp769 01-30-2012 08:44 AM

Could you kindly post the said output directly from your messages file for us?

hamzar.pm 01-31-2012 12:32 AM

Pls check that your server restarting problem is related to any hardware fails..... most probably that is the chance....i had faced so much situations like this this it may be bue to memmory, hdd , or board is failing.....pls check with mcelog....or install that package ... and see the log....

raunaq 01-31-2012 04:13 AM

/var/log/messages
 
shutdown[20224]: shutting down for system halt
shutdown[20253]: shutting down for system halt
avahi-daemon[4370]: Got SIGTERM, quitting.
avahi-daemon[4370]: Leaving mDNS multicast group on interface eth1.IPv6 with address fe80::221:5eff:fec2:854a.
xinetd[4204]: Exiting...
rpc.statd[3883]: Caught signal 15, un-registering and exiting.
auditd[3752]: The audit daemon is exiting.
kernel: audit(1327837590.501:285): audit_pid=0 old=3752 by auid=4294967295
pcscd: pcscdaemon.c:572:signal_trap() Preparing for suicide
pcscd: hotplug_libusb.c:376:HPRescanUsbBus() Hotplug stopped
pcscd: readerfactory.c:1379:RFCleanupReaders() entering cleaning function
pcscd: pcscdaemon.c:532:at_exit() cleaning /var/run
kernel: Kernel logging (proc) stopped.
kernel: Kernel log daemon terminating.
exiting on signal 15
syslogd 1.4.1: restart.
kernel: klogd 1.4.1, log source = /proc/kmsg started.
kernel: Linux version 2.6.18-92.el5PAE (mockbuild@builder16.centos.org) (gcc version 4.1.2 20071124 (Red Hat 4.1.2-42)) #1 SMP Tue

kernel: BIOS-provided physical RAM map:
kernel: BIOS-e820: 0000000000000000 - 000000000009dc00 (usable)
kernel: BIOS-e820: 000000000009dc00 - 00000000000a0000 (reserved)
kernel: BIOS-e820: 00000000000ce000 - 00000000000d0000 (reserved)
kernel: BIOS-e820: 00000000000e0000 - 0000000000100000 (reserved)
kernel: BIOS-e820: 00000000dfe60000 - 00000000dfe6d000 (ACPI data)
kernel: BIOS-e820: 00000000dfe6d000 - 00000000dfe6e000 (ACPI NVS)
kernel: BIOS-e820: 00000000dff00000 - 00000000e0000000 (reserved)
kernel: BIOS-e820: 00000000f0000000 - 00000000f8000000 (reserved)
kernel: BIOS-e820: 00000000fec00000 - 00000000fec10000 (reserved)
kernel: BIOS-e820: 00000000fee00000 - 00000000fee01000 (reserved)
kernel: BIOS-e820: 00000000ff000000 - 0000000100000000 (reserved)
kernel: BIOS-e820: 0000000100000000 - 0000000120000000 (usable)
kernel: 3712MB HIGHMEM available.
kernel: 896MB LOWMEM available.
kernel: found SMP MP-table at 000f6770
kernel: Memory for crash kernel (0x0 to 0x0) notwithin permissible range
kernel: disabling kdump


top>>>>>>>

[root@xyz~]# top

top - 15:48:19 up 50 min, 1 user, load average: 0.00, 0.01, 0.00
Tasks: 160 total, 1 running, 159 sleeping, 0 stopped, 0 zombie
Cpu(s): 0.0%us, 0.2%sy, 0.0%ni, 99.8%id, 0.0%wa, 0.0%hi, 0.0%si, 0.0%st
Mem: 4150660k total, 747164k used, 3403496k free, 56700k buffers
Swap: 8385920k total, 0k used, 8385920k free, 450424k cached


As mentioned earlier there is no visible hardware issue.No heating disk failure else..

cbtshare 01-31-2012 04:16 AM

try a memory test from bios, or swap out the memory you currently have.

raunaq 01-31-2012 04:21 AM

Thanks for all your replies
 
Quote:

Originally Posted by cbtshare (Post 4589195)
try a memory test from bios, or swap out the memory you currently have.

Will try this out. Thanks guyz for helping me out ..

Reuti 01-31-2012 04:16 PM

So the system is even shutting down in a proper manner before starting up again. A loose connection of a broken power button could do it too. And as it’s a loose connection it’s never “pressed” for 4 seconds for a hard shutdown.


All times are GMT -5. The time now is 05:57 AM.