Server restarting again and again
Hi all,
I am facing a very peculiar problem from the last two days. Actually my server is restarting again and again.Here are the specs and the things i already tried. Centos 5.2 32 bit > OS /var/log/messages just says system going to HALT.nothing more than this load average is 1.2 approx no heating issue no result in chrootkit changed smps, ram and applied terminal solution over processor, No fixed time of shut down. no cron set for any user. removed every known host from .ssh/known_hosts. Really stuck .. Please help.. |
Could you kindly post the said output directly from your messages file for us?
|
Pls check that your server restarting problem is related to any hardware fails..... most probably that is the chance....i had faced so much situations like this this it may be bue to memmory, hdd , or board is failing.....pls check with mcelog....or install that package ... and see the log....
|
/var/log/messages
shutdown[20224]: shutting down for system halt
shutdown[20253]: shutting down for system halt avahi-daemon[4370]: Got SIGTERM, quitting. avahi-daemon[4370]: Leaving mDNS multicast group on interface eth1.IPv6 with address fe80::221:5eff:fec2:854a. xinetd[4204]: Exiting... rpc.statd[3883]: Caught signal 15, un-registering and exiting. auditd[3752]: The audit daemon is exiting. kernel: audit(1327837590.501:285): audit_pid=0 old=3752 by auid=4294967295 pcscd: pcscdaemon.c:572:signal_trap() Preparing for suicide pcscd: hotplug_libusb.c:376:HPRescanUsbBus() Hotplug stopped pcscd: readerfactory.c:1379:RFCleanupReaders() entering cleaning function pcscd: pcscdaemon.c:532:at_exit() cleaning /var/run kernel: Kernel logging (proc) stopped. kernel: Kernel log daemon terminating. exiting on signal 15 syslogd 1.4.1: restart. kernel: klogd 1.4.1, log source = /proc/kmsg started. kernel: Linux version 2.6.18-92.el5PAE (mockbuild@builder16.centos.org) (gcc version 4.1.2 20071124 (Red Hat 4.1.2-42)) #1 SMP Tue kernel: BIOS-provided physical RAM map: kernel: BIOS-e820: 0000000000000000 - 000000000009dc00 (usable) kernel: BIOS-e820: 000000000009dc00 - 00000000000a0000 (reserved) kernel: BIOS-e820: 00000000000ce000 - 00000000000d0000 (reserved) kernel: BIOS-e820: 00000000000e0000 - 0000000000100000 (reserved) kernel: BIOS-e820: 00000000dfe60000 - 00000000dfe6d000 (ACPI data) kernel: BIOS-e820: 00000000dfe6d000 - 00000000dfe6e000 (ACPI NVS) kernel: BIOS-e820: 00000000dff00000 - 00000000e0000000 (reserved) kernel: BIOS-e820: 00000000f0000000 - 00000000f8000000 (reserved) kernel: BIOS-e820: 00000000fec00000 - 00000000fec10000 (reserved) kernel: BIOS-e820: 00000000fee00000 - 00000000fee01000 (reserved) kernel: BIOS-e820: 00000000ff000000 - 0000000100000000 (reserved) kernel: BIOS-e820: 0000000100000000 - 0000000120000000 (usable) kernel: 3712MB HIGHMEM available. kernel: 896MB LOWMEM available. kernel: found SMP MP-table at 000f6770 kernel: Memory for crash kernel (0x0 to 0x0) notwithin permissible range kernel: disabling kdump top>>>>>>> [root@xyz~]# top top - 15:48:19 up 50 min, 1 user, load average: 0.00, 0.01, 0.00 Tasks: 160 total, 1 running, 159 sleeping, 0 stopped, 0 zombie Cpu(s): 0.0%us, 0.2%sy, 0.0%ni, 99.8%id, 0.0%wa, 0.0%hi, 0.0%si, 0.0%st Mem: 4150660k total, 747164k used, 3403496k free, 56700k buffers Swap: 8385920k total, 0k used, 8385920k free, 450424k cached As mentioned earlier there is no visible hardware issue.No heating disk failure else.. |
try a memory test from bios, or swap out the memory you currently have.
|
Thanks for all your replies
Quote:
|
So the system is even shutting down in a proper manner before starting up again. A loose connection of a broken power button could do it too. And as it’s a loose connection it’s never “pressed” for 4 seconds for a hard shutdown.
|
All times are GMT -5. The time now is 05:57 AM. |