LinuxQuestions.org

LinuxQuestions.org (/questions/)
-   Linux - Hardware (http://www.linuxquestions.org/questions/linux-hardware-18/)
-   -   inside /var/log/messages reporting these errors constantly (http://www.linuxquestions.org/questions/linux-hardware-18/inside-var-log-messages-reporting-these-errors-constantly-802254/)

narayanapalla 04-15-2010 02:28 PM

inside /var/log/messages reporting these errors constantly
 
Hi,
The folling error messages will continuoesly generating in side /var/log/messages


Apr 15 10:24:04 cnlx100 kernel: EDAC k8 MC0: general bus error: participating processor(local node origin), time-out(no timeout) memory transaction type(generic read), mem or i/o(mem access), cache level(generic)
Apr 15 10:24:04 cnlx100 kernel: MC0: CE page 0x3c66, offset 0x1a0, grain 8, syndrome 0xd6b4, row 0, channel 0, label "": k8_edac
Apr 15 10:24:04 cnlx100 kernel: MC0: CE - no information available: k8_edac Error Overflow set
Apr 15 10:24:04 cnlx100 kernel: EDAC k8 MC0: extended error code: ECC chipkill x4 error



why these kind of error messages will generating continuosly.....
please kindly give me the solution how to solve this.

Thnks in advance

AlucardZero 04-15-2010 03:15 PM

You probably have bad memory. Run memtest on your machine.

TB0ne 04-15-2010 03:47 PM

Quote:

Originally Posted by narayanapalla (Post 3936803)
Hi,
The folling error messages will continuoesly generating in side /var/log/messages

why these kind of error messages will generating continuosly.....
please kindly give me the solution how to solve this.

Thnks in advance

Please spell your words out. As AlucardZero said, this is probably bad memory. A quick Google search could have given you the solution. What version/distro of Linux are you using, and what have you tried??

unSpawn 04-15-2010 03:59 PM

Also please post machine brand and type, hardware specs, RAM bank population (even or not) and if running a previous kernel works.

narayanapalla 04-16-2010 04:33 AM

Hi,
actually we r working on the servers remotely.......
my task is to troubleshoot the problems......
so,our task is to investigate why these kind of errors messages were generating continuously.

inside /var/log/messages dir the following error messages were generating continuously.
[root@cnlx100 ~]#tail /var/log/messages
Apr 15 10:24:04 cnlx100 kernel: EDAC k8 MC0: general bus error: participating processor(local node origin), time-out(no timeout) memory transaction type(generic read), mem or i/o(mem access), cache level(generic)
Apr 15 10:24:04 cnlx100 kernel: MC0: CE page 0x3c66, offset 0x1a0, grain 8, syndrome 0xd6b4, row 0, channel 0, label "": k8_edac


And version is like this

[root@cnlx100 ~]# cat /etc/issue
Red Hat Enterprise Linux ES release 4 (Nahant Update 7)
Kernel \r on an \m


AND WE R ONLY SUPPOSE TO DO THOUGH COMMAND BY REMOTELY

Please kindly give me a solution how to find why this error messages were generating continuously and how to solve this proble
m

AlucardZero 04-16-2010 08:30 AM

You probably have bad memory. Run memtest on your machine. You need physical access.

TB0ne 04-16-2010 09:07 AM

Quote:

Originally Posted by narayanapalla (Post 3937362)
Hi,
actually we r working on the servers remotely.......
my task is to troubleshoot the problems......
so,our task is to investigate why these kind of errors messages were generating continuously.

inside /var/log/messages dir the following error messages were generating continuously.
[root@cnlx100 ~]#tail /var/log/messages
Apr 15 10:24:04 cnlx100 kernel: EDAC k8 MC0: general bus error: participating processor(local node origin), time-out(no timeout) memory transaction type(generic read), mem or i/o(mem access), cache level(generic)
Apr 15 10:24:04 cnlx100 kernel: MC0: CE page 0x3c66, offset 0x1a0, grain 8, syndrome 0xd6b4, row 0, channel 0, label "": k8_edac


And version is like this

[root@cnlx100 ~]# cat /etc/issue
Red Hat Enterprise Linux ES release 4 (Nahant Update 7)
Kernel \r on an \m


AND WE R ONLY SUPPOSE TO DO THOUGH COMMAND BY REMOTELY

Please kindly give me a solution how to find why this error messages were generating continuously and how to solve this proble
m

Again, SPELL YOUR WORDS OUT, and we have "kindly given" you a solution, but you don't want to hear it. Sorry, the only thing you can do is to replace the bad memory on your machine....that's it. So either replace it all with brand new memory, or wait until it totally fails.

narayanapalla 04-21-2010 09:27 AM

inside /var/log/messages reporting these errors constantly
 
Quote:

Originally Posted by AlucardZero (Post 3936844)
You probably have bad memory. Run memtest on your machine.

hi,
yes correct this may be a memory module problem.
how to resolve this bad memory modules?

And is it possible to reconfigure this bad memory modules?please let me know bcoz it will help me allot.

TB0ne 04-21-2010 09:40 AM

Quote:

Originally Posted by narayanapalla (Post 3942755)
hi,
yes correct this may be a memory module problem.
how to resolve this bad memory modules?

You obviously REPLACE THE BAD MEMORY. Open the box...take out the old one...put in new one.
Quote:

And is it possible to reconfigure this bad memory modules?please let me know bcoz it will help me allot.
Again, SPELL OUT YOUR WORDS. And no, it isn't...that's like asking "Gee, my car is broken down...if I put new air in the tires, will it run?"

fbsduser 04-21-2010 10:34 AM

You'll need to tell the local IT guy that his server's RAM is broken so he will go in and replace it.


All times are GMT -5. The time now is 09:51 PM.