LinuxQuestions.org - Server restarting again and again

- Linux - Server (https://www.linuxquestions.org/questions/linux-server-73/)

- - Server restarting again and again (https://www.linuxquestions.org/questions/linux-server-73/server-restarting-again-and-again-926495/)

Server restarting again and again

Hi all,

I am facing a very peculiar problem from the last two days.
Actually my server is restarting again and again.Here are the specs and the things i already tried.

Centos 5.2 32 bit > OS
/var/log/messages just says system going to HALT.nothing more than this
load average is 1.2 approx
no heating issue
no result in chrootkit
changed smps, ram and applied terminal solution over processor,
No fixed time of shut down.
no cron set for any user.
removed every known host from .ssh/known_hosts.

Really stuck .. Please help..

Could you kindly post the said output directly from your messages file for us?

Pls check that your server restarting problem is related to any hardware fails..... most probably that is the chance....i had faced so much situations like this this it may be bue to memmory, hdd , or board is failing.....pls check with mcelog....or install that package ... and see the log....

/var/log/messages

shutdown[20224]: shutting down for system halt
shutdown[20253]: shutting down for system halt
avahi-daemon[4370]: Got SIGTERM, quitting.
avahi-daemon[4370]: Leaving mDNS multicast group on interface eth1.IPv6 with address fe80::221:5eff:fec2:854a.
xinetd[4204]: Exiting...
rpc.statd[3883]: Caught signal 15, un-registering and exiting.
auditd[3752]: The audit daemon is exiting.
kernel: audit(1327837590.501:285): audit_pid=0 old=3752 by auid=4294967295
pcscd: pcscdaemon.c:572:signal_trap() Preparing for suicide
pcscd: hotplug_libusb.c:376:HPRescanUsbBus() Hotplug stopped
pcscd: readerfactory.c:1379:RFCleanupReaders() entering cleaning function
pcscd: pcscdaemon.c:532:at_exit() cleaning /var/run
kernel: Kernel logging (proc) stopped.
kernel: Kernel log daemon terminating.
exiting on signal 15
syslogd 1.4.1: restart.
kernel: klogd 1.4.1, log source = /proc/kmsg started.
kernel: Linux version 2.6.18-92.el5PAE (mockbuild@builder16.centos.org) (gcc version 4.1.2 20071124 (Red Hat 4.1.2-42)) #1 SMP Tue

kernel: BIOS-provided physical RAM map:
kernel: BIOS-e820: 0000000000000000 - 000000000009dc00 (usable)
kernel: BIOS-e820: 000000000009dc00 - 00000000000a0000 (reserved)
kernel: BIOS-e820: 00000000000ce000 - 00000000000d0000 (reserved)
kernel: BIOS-e820: 00000000000e0000 - 0000000000100000 (reserved)
kernel: BIOS-e820: 00000000dfe60000 - 00000000dfe6d000 (ACPI data)
kernel: BIOS-e820: 00000000dfe6d000 - 00000000dfe6e000 (ACPI NVS)
kernel: BIOS-e820: 00000000dff00000 - 00000000e0000000 (reserved)
kernel: BIOS-e820: 00000000f0000000 - 00000000f8000000 (reserved)
kernel: BIOS-e820: 00000000fec00000 - 00000000fec10000 (reserved)
kernel: BIOS-e820: 00000000fee00000 - 00000000fee01000 (reserved)
kernel: BIOS-e820: 00000000ff000000 - 0000000100000000 (reserved)
kernel: BIOS-e820: 0000000100000000 - 0000000120000000 (usable)
kernel: 3712MB HIGHMEM available.
kernel: 896MB LOWMEM available.
kernel: found SMP MP-table at 000f6770
kernel: Memory for crash kernel (0x0 to 0x0) notwithin permissible range
kernel: disabling kdump

top>>>>>>>

[root@xyz~]# top

top - 15:48:19 up 50 min, 1 user, load average: 0.00, 0.01, 0.00
Tasks: 160 total, 1 running, 159 sleeping, 0 stopped, 0 zombie
Cpu(s): 0.0%us, 0.2%sy, 0.0%ni, 99.8%id, 0.0%wa, 0.0%hi, 0.0%si, 0.0%st
Mem: 4150660k total, 747164k used, 3403496k free, 56700k buffers
Swap: 8385920k total, 0k used, 8385920k free, 450424k cached

As mentioned earlier there is no visible hardware issue.No heating disk failure else..

try a memory test from bios, or swap out the memory you currently have.

Thanks for all your replies

Quote:

Originally Posted by cbtshare (Post 4589195)

try a memory test from bios, or swap out the memory you currently have.

Will try this out. Thanks guyz for helping me out ..

So the system is even shutting down in a proper manner before starting up again. A loose connection of a broken power button could do it too. And as it’s a loose connection it’s never “pressed” for 4 seconds for a hard shutdown.