LinuxQuestions.org
Review your favorite Linux distribution.
Home Forums Tutorials Articles Register
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - General
User Name
Password
Linux - General This Linux forum is for general Linux questions and discussion.
If it is Linux Related and doesn't seem to fit in any other forum then this is the place.

Notices


Reply
  Search this Thread
Old 05-26-2005, 07:25 PM   #1
Toadman
Member
 
Registered: Aug 2002
Location: Copperas Cove, Texas
Distribution: Ubuntu 20.04 LTS
Posts: 304

Rep: Reputation: 21
kernel: CPU 0: Machine Check Exception: 0000000000000004


I wasn't sure which forum to post this in so I'll try here. Over the past few days, about 18hrs or so apart the box has locked tight. No way out except the reset switch. Checking my logs I see the following:

11:15:02 cpollock kernel: CPU 0: Machine Check Exception: 0000000000000004
May 26 11:15:02 cpollock kernel: Bank 2: f60020000000017a at 0000000000364000
May 26 11:15:02 cpollock kernel: Kernel panic: CPU context corrupt

This is on an AMD 1.2GHz T-Bird processor running Mandriva 10.1. I've googled for the error and gone through about 10 pages of output, some say the cpu, some say memory. The cpu temp had been running about 118-120F, I blew out the system last night and now its running about 112-114F. I thought heat may have been a factor, but it happened again today at the time above. I ran dmesg but I don't know what I'm looking for in the output. Any advice would be appreciated on this.

Thanks
Chris
 
Old 05-27-2005, 09:16 PM   #2
btmiller
Senior Member
 
Registered: May 2004
Location: In the DC 'burbs
Distribution: Arch, Scientific Linux, Debian, Ubuntu
Posts: 4,290

Rep: Reputation: 378Reputation: 378Reputation: 378Reputation: 378
Really it could be either the CPU or the memory. I see this happen on machines at work that have been running hot, and the temps you give might be a bit high for a processor of that clock speed (I've seen Xeons that regularly hit 140 Farenheit under load, though, so what do I know). You might try running the system under load with the cover off and see if the problems recurs.
 
Old 05-27-2005, 09:27 PM   #3
Toadman
Member
 
Registered: Aug 2002
Location: Copperas Cove, Texas
Distribution: Ubuntu 20.04 LTS
Posts: 304

Original Poster
Rep: Reputation: 21
Don't know if this is related or another problem. Came home from work today to find that the system had tried to reboot but had stopped with this:

uncompressing kernel
crc error
system halted

To an unknowing newbie this sounds like the kernel had burped or something. I got the same error about a week ago. I ran an upgrade from Mandriva's 1st cd which reinstalled the kernel. I booted from the ultimate boot cd and checked the drives, both ok, on running memtest86 though, it got to about 30% and rebooted. Does that possibly signify bad ram? Odd the system seems to run ok for about 18hrs then it crashes.

Thanks for the reply
Chris
 
Old 05-27-2005, 10:39 PM   #4
btmiller
Senior Member
 
Registered: May 2004
Location: In the DC 'burbs
Distribution: Arch, Scientific Linux, Debian, Ubuntu
Posts: 4,290

Rep: Reputation: 378Reputation: 378Reputation: 378Reputation: 378
Bad RAM is definitely a possibility. The kernel image is stored compressed on disk, and it looks like you had an error uncompressing it, which means either the image itself is bad (and thus the drive is bad) or the RAM it was loaded into was flaking out. If you have multiple DIMMs, try pulling one stick at a time to narrow the error to a particular DIMM and verify that is indeed the problem.
 
Old 05-27-2005, 10:52 PM   #5
Toadman
Member
 
Registered: Aug 2002
Location: Copperas Cove, Texas
Distribution: Ubuntu 20.04 LTS
Posts: 304

Original Poster
Rep: Reputation: 21
Thanks, I'll try that with the ram. In my original msg
May 26 11:15:02 cpollock kernel: Bank 2: f60020000000017a at 0000000000364000
I'll assume that bank 2 is the 2nd ram module? If so, I swapped the first and 2nd module today when I installed a 256mb module. If the 2nd module was bad then I would think that if I get the same crash again that it will report that bank 1. Guess I'll see what happens in about 18hrs or so. I'll let you know.

Thanks
Chris
 
  


Reply



Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
Kernel Panic, Machine Check exception tinksmartbstupi Linux - Software 5 11-16-2005 03:18 PM
Machine Check Exception 000000000000004 AND CPU context corrupt RCbeta Linux - Hardware 1 10-08-2005 01:58 PM
Machine Check Exception 0000000000000004 pbs Linux - Software 7 06-26-2005 12:33 PM
kernel:CPU0:machine check exception:0000000000000004 madhabendra Red Hat 0 06-10-2004 11:49 PM
CPU#0:Machine Check Exception karamboul Linux - Software 1 03-29-2002 10:33 PM

LinuxQuestions.org > Forums > Linux Forums > Linux - General

All times are GMT -5. The time now is 01:12 PM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration