LinuxQuestions.org
Latest LQ Deal: Linux Power User Bundle
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Newbie
User Name
Password
Linux - Newbie This Linux forum is for members that are new to Linux.
Just starting out and have a question? If it is not in the man pages or the how-to's this is the place!

Notices


Reply
  Search this Thread
Old 07-02-2014, 04:26 PM   #1
rookee
LQ Newbie
 
Registered: Jun 2014
Posts: 28

Rep: Reputation: Disabled
Hardware error


Hi, I'm trying to understand what hardware errors these alerts correspond to. Some one please help. Thanks in advance.

Jul 2 15:22:01 host462 kernel: {110}[Hardware Error]: Hardware error from APEI Generic Hardware Error Source: 229
Jul 2 15:22:01 host462 kernel: {110}[Hardware Error]: APEI generic hardware error status
Jul 2 15:22:01 host462 kernel: {110}[Hardware Error]: severity: 2, corrected
Jul 2 15:22:01 host462 kernel: {110}[Hardware Error]: section: 0, severity: 2, corrected
Jul 2 15:22:01 host462 kernel: {110}[Hardware Error]: flags: 0x01
Jul 2 15:22:01 host462 kernel: {110}[Hardware Error]: primary
Jul 2 15:22:01 host462 kernel: {110}[Hardware Error]: section_type: memory error
Jul 2 15:22:01 host462 kernel: {110}[Hardware Error]: error_status: 0x0000000000000004
Jul 2 15:22:01 host462 kernel: {110}[Hardware Error]: physical_address: 0x0000000a28d18bc0
Jul 2 15:22:01 host462 kernel: {110}[Hardware Error]: node: 3
Jul 2 15:22:01 host462 kernel: {110}[Hardware Error]: card: 5
Jul 2 15:22:01 host462 kernel: {110}[Hardware Error]: module: 1
Jul 2 15:22:01 host462 kernel: {110}[Hardware Error]: bank: 3
Jul 2 15:22:01 host462 kernel: {110}[Hardware Error]: device: 3
Jul 2 15:22:01 host462 kernel: {110}[Hardware Error]: row: 4525
Jul 2 15:22:01 host462 kernel: {110}[Hardware Error]: column: 524
Jul 2 15:22:01 host462 kernel: {110}[Hardware Error]: error_type: 2, single-bit ECC
Jul 2 15:22:01 host462 snmpd[9012]: refused smux peer: oid SNMPv2-SMI::enterprises.674.10892.1, descr Systems Management SNMP MIB Plug-in Manager
Jul 2 15:23:04 host462 snmpd[9012]: last message repeated 21 times
Jul 2 15:23:23 host462 snmpd[9012]: last message repeated 6 times
Jul 2 15:23:23 host462 kernel: {111}[Hardware Error]: Hardware error from APEI Generic Hardware Error Source: 229
Jul 2 15:23:23 host462 kernel: {111}[Hardware Error]: APEI generic hardware error status
Jul 2 15:23:23 host462 kernel: {111}[Hardware Error]: severity: 2, corrected
Jul 2 15:23:23 host462 kernel: {111}[Hardware Error]: section: 0, severity: 2, corrected
Jul 2 15:23:23 host462 kernel: {111}[Hardware Error]: flags: 0x01
Jul 2 15:23:23 host462 kernel: {111}[Hardware Error]: primary
Jul 2 15:23:23 host462 kernel: {111}[Hardware Error]: section_type: memory error
Jul 2 15:23:23 host462 kernel: {111}[Hardware Error]: error_status: 0x0000000000000004
Jul 2 15:23:23 host462 kernel: {111}[Hardware Error]: physical_address: 0x0000000a28d18bc0
Jul 2 15:23:23 host462 kernel: {111}[Hardware Error]: node: 3
Jul 2 15:23:23 host462 kernel: {111}[Hardware Error]: card: 5
Jul 2 15:23:23 host462 kernel: {111}[Hardware Error]: module: 1
Jul 2 15:23:23 host462 kernel: {111}[Hardware Error]: bank: 3
Jul 2 15:23:23 host462 kernel: {111}[Hardware Error]: device: 3
Jul 2 15:23:23 host462 kernel: {111}[Hardware Error]: row: 4525
Jul 2 15:23:23 host462 kernel: {111}[Hardware Error]: column: 524
Jul 2 15:23:23 host462 kernel: {111}[Hardware Error]: error_type: 2, single-bit ECC

I'm using Red Hat 6.5
 
Old 07-02-2014, 04:34 PM   #2
metaschima
Senior Member
 
Registered: Dec 2013
Distribution: Slackware
Posts: 1,982

Rep: Reputation: 491Reputation: 491Reputation: 491Reputation: 491Reputation: 491
Looks like one of the RAM modules may be failing. It looks like you have ECC memory, so the error was corrected, but you may want to replace the module at some point. It gives some info on which one it is, but it may not be enough to pinpoint the exact one without some trial and error.
 
1 members found this post helpful.
Old 07-02-2014, 05:13 PM   #3
rookee
LQ Newbie
 
Registered: Jun 2014
Posts: 28

Original Poster
Rep: Reputation: Disabled
Is there a way that I can figure out which module is failing?
 
Old 07-02-2014, 05:22 PM   #4
metaschima
Senior Member
 
Registered: Dec 2013
Distribution: Slackware
Posts: 1,982

Rep: Reputation: 491Reputation: 491Reputation: 491Reputation: 491Reputation: 491
How many RAM sticks are there ?

You can try running memtest86+, it may provide more detailed info.

When you decide on which one to try, you have to turn off and unplug the system and remove the one you think is bad. Then keep running the system to see if the error appears again.
 
Old 07-03-2014, 07:00 PM   #5
rookee
LQ Newbie
 
Registered: Jun 2014
Posts: 28

Original Poster
Rep: Reputation: Disabled
Unfortunately I don't have the privileges to install other software. Is there any other way?
 
Old 07-03-2014, 07:47 PM   #6
metaschima
Senior Member
 
Registered: Dec 2013
Distribution: Slackware
Posts: 1,982

Rep: Reputation: 491Reputation: 491Reputation: 491Reputation: 491Reputation: 491
The only hints here are the data you posted, maybe you can figure it out using the number of RAM sticks and the data above (card, bank, etc).
 
1 members found this post helpful.
Old 07-03-2014, 08:02 PM   #7
EDDY1
LQ Addict
 
Registered: Mar 2010
Location: Oakland,Ca
Distribution: wins7, Debian wheezy
Posts: 6,838

Rep: Reputation: 649Reputation: 649Reputation: 649Reputation: 649Reputation: 649Reputation: 649
Why not just remove 1 stick at a time between reboots until you find the culprit.
 
1 members found this post helpful.
Old 07-03-2014, 09:49 PM   #8
rookee
LQ Newbie
 
Registered: Jun 2014
Posts: 28

Original Poster
Rep: Reputation: Disabled
Thanks Guys!! I'll see if I can figure it out that way.

Last edited by rookee; 07-03-2014 at 09:55 PM.
 
Old 07-04-2014, 12:48 AM   #9
pan64
LQ Guru
 
Registered: Mar 2012
Location: Hungary
Distribution: debian/ubuntu/suse ...
Posts: 8,993

Rep: Reputation: 2623Reputation: 2623Reputation: 2623Reputation: 2623Reputation: 2623Reputation: 2623Reputation: 2623Reputation: 2623Reputation: 2623Reputation: 2623Reputation: 2623
Quote:
Originally Posted by rookee View Post
Unfortunately I don't have the privileges to install other software. Is there any other way?
You do not need to install anything but boot into memtest (mode). Usually there is a menu entry related to memtest86+ during boot....
 
1 members found this post helpful.
  


Reply


Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
Help! Hardware error or something... See my picture for the error. barnac1e Mandriva 2 01-06-2012 09:34 AM
Hardware Error Handling with C or C++ benz Programming 1 10-26-2007 10:18 AM
82801db No Error But No Sound In 10.2, successfuly detect hardware, no error, just no iromrs Linux - Hardware 2 09-08-2007 07:46 AM
hardware scsi error inaki Linux - Hardware 2 03-24-2005 08:46 PM


All times are GMT -5. The time now is 07:56 PM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Facebook: linuxquestions Google+: linuxquestions
Open Source Consulting | Domain Registration