SlackwareThis Forum is for the discussion of Slackware Linux.
Notices
Welcome to LinuxQuestions.org, a friendly and active Linux Community.
You are currently viewing LQ as a guest. By joining our community you will have the ability to post topics, receive our newsletter, use the advanced search, subscribe to threads and access many other special features. Registration is quick, simple and absolutely free. Join our community today!
Note that registered members see fewer ads, and ContentLink is completely disabled once you log in.
If you have any problems with the registration process or your account login, please contact us. If you need to reset your password, click here.
Having a problem logging in? Please visit this page to clear all LQ-related cookies.
Get a virtual cloud desktop with the Linux distro that you want in less than five minutes with Shells! With over 10 pre-installed distros to choose from, the worry-free installation life is here! Whether you are a digital nomad or just looking for flexibility, Shells can put your Linux machine on the device that you want to use.
Exclusive for LQ members, get up to 45% off per month. Click here for more info.
yesterday when my PC was starting up, before it had completed the booting process (i.e. before I got the login prompt), it rebooted autonomously. I not sure how far it went but the second time the startup was completed successfully and it has been working normal since then. When I was checking the syslogs, found these error messages:
Jan 26 08:46:10 epg-hp kernel: [ 3.839827] [Hardware Error]: System Fatal error.
Jan 26 08:46:10 epg-hp kernel: [ 3.839938] [Hardware Error]: CPU:0 (15:60:1) MC4_STATUS[Over|UE|MiscV|PCC|AddrV|-|-]: 0xfe00000000070f0f
Jan 26 08:46:10 epg-hp kernel: [ 3.840214] [Hardware Error]: MC4 Error Address: 0x00000000d0d00e50
Jan 26 08:46:10 epg-hp kernel: [ 3.840314] [Hardware Error]: MC4 Error (node 0): Watchdog timeout due to lack of progress.
Jan 26 08:46:10 epg-hp kernel: [ 3.840510] [Hardware Error]: cache level: L3/GEN, mem/io: GEN, mem-tx: GEN, part-proc: GEN (timed out)
Googled a bit and found some people saying this could be RAM errors, however after ~8 hours running memtest didn find any errors.
So... What next? Any ideas of what could have caused this error??
it might be one of them things that only take place when you're not looking.
I'd keep an eye on it and perhaps get a new set of RAM chips just in case it's going down. Or at least stash some just in case money away for it.
I'm no expert on the subject, but aren't these errors related to the CPU cache and not the RAM? I agree with BW-userx that it could be a one-time thing.
Thks for the feedback... Yeah, you could be right. Anyway, I tried to run mcelog to capture proper logs if this issue happens again, but unfortunately AMD cpus are not supported. :-(
Same error message, same address... Is it fair to say it's a hardware issue? Any suggestions on how to troubleshoot this further??
Thank you
go to a store that takes returns, buy some hardware, swap it out, then see if the problem persists if yes, replace with the old, then swap out another piece of hardware with a new one then do the same.
repeat until the problem is no longer there.
take back everything that did not fix the problem and get your money back.
And the idea of changing HW until the problem disappears, I'm afraid it's not going to work for me. First, this is a company-owned laptop so I can't/shouldn't change the parts myself. And second, it's still under warranty so I'm gonna void it if I open the laptop.
I could just call warranty and see what they're gonna say, but I wanted to be sure this is indeed a HW issue...
And the idea of changing HW until the problem disappears, I'm afraid it's not going to work for me. First, this is a company-owned laptop so I can't/shouldn't change the parts myself. And second, it's still under warranty so I'm gonna void it if I open the laptop.
I could just call warranty and see what they're gonna say, but I wanted to be sure this is indeed a HW issue...
Are you are using a AMD CPU?
Is input–output memory management unit (IOMMU) available in the BIOS, and is it on?
How long you have company-owned laptop? How long laptop worked before this error started happening? Did you do something with the system since it was installed?
Check out this LINK and the link in the answer, seems to be cpu problem
It's a brand new PC, got it just a couple of months ago. First time I noticed this error was around two weeks ago, when I started this thread. Yesterday it happened again... And no, no changes were done since I installed slackware.
And I had seen that link you shared, but unfortunately I couldn't run mcelog, it seems (correct me if I'm wrong) that it doesn't support AMD cpus.
LinuxQuestions.org is looking for people interested in writing
Editorials, Articles, Reviews, and more. If you'd like to contribute
content, let us know.