LinuxQuestions.org
LinuxAnswers - the LQ Linux tutorial section.
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Hardware
User Name
Password
Linux - Hardware This forum is for Hardware issues.
Having trouble installing a piece of hardware? Want to know if that peripheral is compatible with Linux?

Notices

Reply
 
Search this Thread
Old 10-27-2009, 02:14 PM   #1
catkin
LQ 5k Club
 
Registered: Dec 2008
Location: Tamil Nadu, India
Distribution: Servers: Debian Squeeze and Wheezy. Desktop: Slackware64 14.0. Netbook: Slackware 13.37
Posts: 8,563
Blog Entries: 29

Rep: Reputation: 1179Reputation: 1179Reputation: 1179Reputation: 1179Reputation: 1179Reputation: 1179Reputation: 1179Reputation: 1179Reputation: 1179
ECC memory reporting possible with these components?


Hello

I'm trying to set up EDAC to report on ECC memory and it's not working. AFAIK the components are all suitable but I'm a hardware dilettante and this is the first time I've built a kernel.

Here are the components.The motherboard documentation says "(Note 3) Use of a CPU that supports ECC is required if you wish to install ECC memory". AFAIK the Athlon X2 range of CPUs do support ECC but ... ?

The memory description is "4GB 800MHz DDR2 ECC CL5 DIMM (Kit of 2)".

The new kernel does have EDAC support as evidenced by
Code:
root:~# zcat /proc/config.gz | grep EDAC | grep -v '^#'
CONFIG_EDAC=y
CONFIG_EDAC_MM_EDAC=y
CONFIG_EDAC_AMD76X=y
So far, so good but the user-space utility doesn't find the drivers
Code:
root:~# edac-ctl --status
edac-ctl: drivers not loaded.
What to do?

Best

Charles
 
Old 10-28-2009, 12:04 AM   #2
Electro
Guru
 
Registered: Jan 2002
Posts: 6,042

Rep: Reputation: Disabled
Gentoo states it as mask which means it is not stable enough. If you insist of using it, you should make sure sysfsutils and dmidecode are installed. Since it is a user space program, make sure the kernel has support for user space programs.

You do not need EDAC to make ECC memory function. It happens behind the scenes.
 
Old 10-28-2009, 01:41 PM   #3
catkin
LQ 5k Club
 
Registered: Dec 2008
Location: Tamil Nadu, India
Distribution: Servers: Debian Squeeze and Wheezy. Desktop: Slackware64 14.0. Netbook: Slackware 13.37
Posts: 8,563
Blog Entries: 29

Original Poster
Rep: Reputation: 1179Reputation: 1179Reputation: 1179Reputation: 1179Reputation: 1179Reputation: 1179Reputation: 1179Reputation: 1179Reputation: 1179
Quote:
Originally Posted by Electro View Post
Gentoo states it as mask which means it is not stable enough. If you insist of using it, you should make sure sysfsutils and dmidecode are installed. Since it is a user space program, make sure the kernel has support for user space programs.

You do not need EDAC to make ECC memory function. It happens behind the scenes.
Thanks Electro

The information about EDAC not being stable enough is illuminating. I'm aiming to improve system availability, not decrease it! I understand that parity errors should be automatically corrected but was hoping that EDAC could be used to report on error corrections, giving advance warning of a degrading chip if corrections were being done in the same locality.

Given that EDAC is not stable enough, do you happen to know if any other reporting tools are available?
 
Old 10-28-2009, 05:28 PM   #4
Electro
Guru
 
Registered: Jan 2002
Posts: 6,042

Rep: Reputation: Disabled
No I do not know anything else. The EDAC is new to me. Also Linux-ECC project is new. IMHO, it is for those people that are perfectionist that want to know if their computer is working perfectly.

No offense if you are paranoid of having a perfect computer that handles data perfectly, then you should not use computers. Also the word perfect is not actually a good word to use in the computer industry or any engineering field. A good word to use is doing the best you can to minimize errors. In any engineering field there is always errors. There is an old saying "there is always room for error."

You already did the best you can by buying ECC memory. Make sure you stay within temperature and voltage limits or else your setup will be unstable and unreliable.
 
Old 10-29-2009, 07:46 AM   #5
catkin
LQ 5k Club
 
Registered: Dec 2008
Location: Tamil Nadu, India
Distribution: Servers: Debian Squeeze and Wheezy. Desktop: Slackware64 14.0. Netbook: Slackware 13.37
Posts: 8,563
Blog Entries: 29

Original Poster
Rep: Reputation: 1179Reputation: 1179Reputation: 1179Reputation: 1179Reputation: 1179Reputation: 1179Reputation: 1179Reputation: 1179Reputation: 1179
Quote:
Originally Posted by Electro View Post
IMHO, it is for those people that are perfectionist that want to know if their computer is working perfectly.
If only!

I know only too well that computers do not and cannot work perfectly; if nothing else, extra-terrestrial gamma radiation causes all semi-conductor devices to malfunction at random times. More manageably, devices degrade with time and their systematic error-rates increase.

Disk monitoring, once only for top-end servers, is becoming commoner in workstations, at best giving us early warning of disk failure and allowing us to replace at a convenient time before catastrophic failure interrupts day-to-day work.

The same is happening with memory (and PCI devices, ...). EDAC can deliver similar warning of impending failure on workstations as has been available on high-end servers. Apparently EDAC is not mature yet but something like it is likely to become common and in a few years hardware monitoring will be taken for granted.
 
  


Reply

Tags
ecc, kernel


Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off


Similar Threads
Thread Thread Starter Forum Replies Last Post
Memory usage reporting? ivanatora *BSD 3 07-06-2008 06:32 AM
uncorrected ECC memory errors on ASUS motherboards wpshooter Linux - Hardware 3 08-31-2007 08:00 PM
How does a server report ECC memory errors to Linux? Reggy Linux - Hardware 1 02-10-2007 02:01 PM
ECC Memory Ztyx Linux - Hardware 2 01-29-2004 06:43 PM
Selection of ECC DDR 266 1024 MB memory tungsten Linux - Hardware 6 08-02-2003 03:02 PM


All times are GMT -5. The time now is 08:40 AM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
identi.ca: @linuxquestions
Facebook: linuxquestions Google+: linuxquestions
Open Source Consulting | Domain Registration