LinuxQuestions.org

LinuxQuestions.org (/questions/)
-   Linux - Newbie (https://www.linuxquestions.org/questions/linux-newbie-8/)
-   -   kernel panic- not syncing : fatal exception on RHEL 5 (https://www.linuxquestions.org/questions/linux-newbie-8/kernel-panic-not-syncing-fatal-exception-on-rhel-5-a-4175464332/)

patelra 06-01-2013 04:53 AM

kernel panic- not syncing : fatal exception on RHEL 5
 
Hi All,

One of the RHEL 5 server suddenly went in to kernel panic with error "kernel panic- not syncing : fatal exception on RHEL 5".

In order to revive the system we hard rest the system using ILO. No core dump was generated due to kdump mistakenly not operational due to which we are not able to figure out the root cause of the kernel panic.

I am sharing the logs before server was restarted, Could the trap divide error be the root cause as it could be seen before restart.

Note: This server is a part of Oracle cluster.

May 28 03:33:34 nxdbsd05 snmpd[11718]: Connection from UDP: [127.0.0.1]:59803
May 28 03:33:49 nxdbsd05 snmpd[11718]: Connection from UDP: [127.0.0.1]:57607
May 28 03:33:49 nxdbsd05 snmpd[11718]: Connection from UDP: [127.0.0.1]:57607
May 28 03:33:49 nxdbsd05 snmpd[11718]: Connection from UDP: [127.0.0.1]:37767
May 28 03:33:49 nxdbsd05 snmpd[11718]: Connection from UDP: [127.0.0.1]:10862
May 28 03:34:03 nxdbsd05 kernel: nmupm[8938] trap divide error rip:804a36a rsp:fffed274 error:0
May 28 03:34:04 nxdbsd05 snmpd[11718]: Connection from UDP: [127.0.0.1]:62322
May 28 03:34:04 nxdbsd05 snmpd[11718]: Connection from UDP: [127.0.0.1]:62322
May 28 03:34:04 nxdbsd05 snmpd[11718]: Connection from UDP: [127.0.0.1]:63851
May 28 03:34:04 nxdbsd05 snmpd[11718]: Connection from UDP: [127.0.0.1]:43416
May 28 03:34:18 nxdbsd05 kernel: nmupm[9015] trap divide error rip:804a36a rsp:ffc40334 error:0
May 28 03:34:19 nxdbsd05 snmpd[11718]: Connection from UDP: [127.0.0.1]:62934
May 28 03:34:19 nxdbsd05 snmpd[11718]: Connection from UDP: [127.0.0.1]:62934
May 28 03:34:19 nxdbsd05 snmpd[11718]: Connection from UDP: [127.0.0.1]:62302
May 28 03:34:19 nxdbsd05 snmpd[11718]: Connection from UDP: [127.0.0.1]:10581
May 28 03:34:23 nxdbsd05 snmpd[11718]: Connection from UDP: [10.40.198.110]:55385
May 28 03:34:23 nxdbsd05 last message repeated 2 times
May 28 03:34:23 nxdbsd05 xinetd[18803]: START: nrpe pid=9043 from=10.40.111.152
May 28 03:34:23 nxdbsd05 nrpe[9043]: INFO: SSL/TLS initialized. All network traffic will be encrypted.
May 28 03:34:23 nxdbsd05 xinetd[18803]: EXIT: nrpe status=0 pid=9043 duration=0(sec)
May 28 03:56:30 nxdbsd05 syslogd 1.4.1: restart.
May 28 03:56:30 nxdbsd05 kernel: klogd 1.4.1, log source = /proc/kmsg started.

business_kid 06-04-2013 10:30 AM

If you've nothing else, it's a good place to start. Your choice of colour made the message illegible, btw.

My guess is: Something went badly wrong; the kernel trapped it and logged it; then there was a crash. Is a hack attempt a possibility?

patelra 06-07-2013 09:31 AM

This was the error logged before the kernel panic and reset. No possibility of hacking.

May 28 03:34:18 nxdbsd05 kernel: nmupm[9015] trap divide error rip:804a36a rsp:ffc40334 error:0

patelra 06-10-2013 03:35 PM

the server rebooted again (This time no panic) with the same error logged before reboot. Please let me know if anybody has seen this error before


kernel: nmupm[30394] trap divide error rip:804a36a rsp:ffd09094 error:0

business_kid 06-11-2013 03:52 AM

This thread started 1st June. Haven't you updated the kernel since then?


All times are GMT -5. The time now is 05:18 PM.