NMI received for unkown reason
I had a server that was hung so had to power cycle it. Looking in the messages file, these are the last three lines it has prior to the reboot. Anyone know what this means? Dell server with an external array, RHEL.
Jun 3 05:18:38 ServerName kernel: Uhhuh. NMI received for unknown reason 21 on CPU 0. Jun 3 05:18:38 ServerName kernel: Dazed and confused, but trying to continue Jun 3 05:18:38 ServerName kernel: Do you have a strange power saving mode enabled? |
This is some kind of hardware related issue, you can boot maybe with unknown_nmi_panic=1 and see what happens.
|
Thanks for the reply. I restarted the system without that and it appears to be working ok now. not sure how to tell what caused it though.
|
Make sure to have your firmware and drivers up to date. Another thing that I have noticed is some erratic behavior with cstates turned on in the bios on Dell servers.
|
There are a lot of issues with NMI due to hardware, so listen to grim76 and keep your hardware up to date.
|
Firmware and drivers are up to date as far as I know. Not sure about the cstate settings though. Any way to check them when the system is already powered up?
|
I don't think you can check them while the system is online as they are BIOS settings. I would just confirm that all your firmware is where it should be and on your next reboot check your cstates.
|
Thanks, i'll give it a shot.
|
All times are GMT -5. The time now is 12:17 PM. |