LinuxQuestions.org
Visit Jeremy's Blog.
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - General
User Name
Password
Linux - General This Linux forum is for general Linux questions and discussion.
If it is Linux Related and doesn't seem to fit in any other forum then this is the place.

Notices


Reply
  Search this Thread
Old 06-02-2014, 04:17 AM   #1
kaito.7
LQ Newbie
 
Registered: Jun 2014
Posts: 6

Rep: Reputation: Disabled
BL460c G8 host unexpectedly reset


Hi all,

We have a HP BL460c G8 that 6 days ago did an unexpectedly reset.

The system runs SLES 11 with sp2. I checked all the logs (/var/log/messages, crash, etc.) and i did not find something usefull to understand the reason of the fail.

The only errors that i found was the above in OnBoard Administrator --> IML Log

System error ---> An Unrecoverable System Error (NMI) has occurred (System error code 0x0000002B, 0x00000000)

ASR ---> ASR Detected by System ROM

Any help will be welcome.

BRs,
Spyros
 
Old 06-02-2014, 06:00 AM   #2
Ser Olmy
Senior Member
 
Registered: Jan 2012
Distribution: Slackware
Posts: 3,348

Rep: Reputation: Disabled
The chipset detected an unrecoverable error condition and caused an "Automatic Server Recovery" (basically a hard reset) by triggering a Non-Maskable Interrupt (NMI). There's an excellent chance there's something physically wrong with the server, and you should probably run a diagnostic check as soon as possible.

If the OS locks up hard, watchdog timers (if configured) would eventually trigger an NMI. Non-correctable RAM errors, severe bus errors and over-/undervoltage are among the hardware errors that could trigger an NMI.

Do you have all the HP management agents installed and running? If not, some conditions that would normally result in a graceful shutdown (typically overheating) could progress to the point where a forced reboot would be considered necessary.

Edit: Have you seen this thread? It seems a buggy iLO driver can cause NMI ASRs under some conditions.

Last edited by Ser Olmy; 06-02-2014 at 06:03 AM. Reason: Added link to the HP forum
 
Old 06-02-2014, 07:33 AM   #3
kaito.7
LQ Newbie
 
Registered: Jun 2014
Posts: 6

Original Poster
Rep: Reputation: Disabled
Hi Olmy and thanks for your quick response.

First of all I have checked the thread at HP forum. That's why I wrote that our systems runs SLES 11 with sp2. So I beleive that works now and it is not a problem.

I ran HP diagnostic tools and all seem normal. Since then I monitor the hardware from Onboard Administrator and there is no something strange.
 
Old 06-02-2014, 08:16 AM   #4
Ser Olmy
Senior Member
 
Registered: Jan 2012
Distribution: Slackware
Posts: 3,348

Rep: Reputation: Disabled
Just because you're running SLES 11 SP2 doesn't mean all HP-specific drivers are updated. Have you installed a recent ProLiant Support Pack (PSP)?

Does HP System Management Homepage show any errors or warnings?
 
Old 06-02-2014, 08:51 AM   #5
kaito.7
LQ Newbie
 
Registered: Jun 2014
Posts: 6

Original Poster
Rep: Reputation: Disabled
The two servers have the 2013.09.0(C) which is not the latest. The latest version is 2014.02.0(B). The reason that I have not installed the new version is that when I installed in one of them the system them did not recognize the Emulex cards.

Furthermore the HP System Management Homepage do not show any errors or warnings.
 
Old 06-03-2014, 10:59 AM   #6
Soadyheid
Senior Member
 
Registered: Aug 2010
Location: Near Edinburgh, Scotland
Distribution: Cinnamon Mint 20.1 (Laptop) and 20.2 (Desktop)
Posts: 1,704

Rep: Reputation: 490Reputation: 490Reputation: 490Reputation: 490Reputation: 490
I don't think the SPP (Support Pack for Proliants, Yup! They replaced the PSP with another acronym for G7s and above) supports Emulex cards, That's Emulex rather than the HP branded variety.

I believe you should be able to boot directly into hardware diagnostics via the Intelligent Provisioning Utility accessed via the iLO on a Gen 8 server (or would that be within the POST?) Have you tried that to check for hardware problems? It's sort of like built in "SmartStart" Diagnostics.


Play Bonny!

 
Old 09-22-2014, 05:48 AM   #7
kaito.7
LQ Newbie
 
Registered: Jun 2014
Posts: 6

Original Poster
Rep: Reputation: Disabled
Thanks all for your support,

The problem was that there was an upgrade in iLo. For the HP was a known problem. I installed the patch one month ago and still now everything works properly.

BRs,

Spyros
 
  


Reply


Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
Descripton and impact of SCSI device,target,bus,host reset brianmcgee Linux - Hardware 1 02-20-2012 08:33 PM
Machine shutdown unexpectedly unexpectedly pinga123 Linux - Newbie 1 02-25-2011 04:48 PM
How to reset/decrease scsi host number neo3matrix Linux - Kernel 2 02-18-2011 03:24 PM
error for aacraid: Host adapter reset request. SCSI hang ? pankajd Linux - Newbie 1 01-03-2010 03:49 AM
Weird Host Unreachable and Connection Reset errors mcdown75 Linux - Newbie 4 07-09-2009 04:52 PM

LinuxQuestions.org > Forums > Linux Forums > Linux - General

All times are GMT -5. The time now is 11:18 AM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration