Linux - Hardware This forum is for Hardware issues.
Having trouble installing a piece of hardware? Want to know if that peripheral is compatible with Linux? |
Notices |
Welcome to LinuxQuestions.org, a friendly and active Linux Community.
You are currently viewing LQ as a guest. By joining our community you will have the ability to post topics, receive our newsletter, use the advanced search, subscribe to threads and access many other special features. Registration is quick, simple and absolutely free. Join our community today!
Note that registered members see fewer ads, and ContentLink is completely disabled once you log in.
Are you new to LinuxQuestions.org? Visit the following links:
Site Howto |
Site FAQ |
Sitemap |
Register Now
If you have any problems with the registration process or your account login, please contact us. If you need to reset your password, click here.
Having a problem logging in? Please visit this page to clear all LQ-related cookies.
Get a virtual cloud desktop with the Linux distro that you want in less than five minutes with Shells! With over 10 pre-installed distros to choose from, the worry-free installation life is here! Whether you are a digital nomad or just looking for flexibility, Shells can put your Linux machine on the device that you want to use.
Exclusive for LQ members, get up to 45% off per month. Click here for more info.
|
 |
|
02-18-2021, 05:45 AM
|
#1
|
Member
Registered: Feb 2021
Location: Germany
Distribution: Artix Linux, Slackware, Gentoo
Posts: 83
Rep: 
|
System completely freezes CPU Stuck / Hardware Error
Hello. So recently I got the issue of regular freezes on my ryzen 5 computer, especially when suspending to ram. But I also already experienced this issue turning the computer off.
I don't know when the issue occured, as what hardware change might have been responsible for it.
I have already tried resetting the cmos to no avail.
This is what such a freeze looks like:
- computer completely freezes when suspending computer to ram (systemctl suspend)
- gets completely unresponsive
- after a while computer softreboots
upon reboot picture related error message appears and the pc proceeds to boot normally:
Any idea what this might be?
Is there some driver I am missing here or might this be a hardware related defect?
best regards
tl;dr this is roughly the message I'm getting after reboot [Hardware Error]: CPU 5: Machine Check: 0 Bank 5: bea0000000
Last edited by dosensuppe; 03-01-2021 at 03:37 AM.
|
|
|
02-18-2021, 06:12 AM
|
#2
|
LQ Guru
Registered: Jan 2006
Location: Ireland
Distribution: Slackware, Slarm64 & Android
Posts: 17,542
|
Sounds like a memory error when it's talking about banks.
The approach I would take is: Let it crash, power off; reboot on a live usb key(important). Then you can search the latest entries in /var/log and dmesg.
If we're overclocking here, cease & desist. A long memory test is essential. Re-seating and/or swapping ram might be a good idea, because if the error (bank 5) changed, you're pretty sure it's the DIMM. If it doesn'[t change, your suspicions turn to the motherboard. Do that much, & post results.
Before I put much more, It's good to know your own opinion of your abilities. This might be better handled by an experienced guy.
|
|
|
02-18-2021, 06:50 AM
|
#3
|
Member
Registered: Feb 2021
Location: Germany
Distribution: Artix Linux, Slackware, Gentoo
Posts: 83
Original Poster
Rep: 
|
I already tested the ram with memtest86 and it didn't give any errors.
Quote:
Originally Posted by business_kid
Let it crash, power off; reboot on a live usb key(important). Then you can search the latest entries in /var/log and dmesg.
|
The weird thing is I didn't get any errors when I did a runtrough with memtest86 and still it looks like it is a memory issue.
Maybe a more elaborate memory test? I let it run the standard test.
I will do it again and if that doesn't give any errors I will do what you proposed.
so force downshutting upon freeze, looking at the logs from a live environment, switching memory banks and if it still says bank 5, it's the mainboards fault.
thank you
(btw I'm not posting from windows 10 it's a user agent)
Last edited by dosensuppe; 02-18-2021 at 07:13 AM.
|
|
|
02-18-2021, 09:15 AM
|
#4
|
LQ Guru
Registered: Jan 2006
Location: Ireland
Distribution: Slackware, Slarm64 & Android
Posts: 17,542
|
On all the Linux stuff - YES. As you say. Sorry if I wasn't clear.
On the windows 10 user agent - a guy's gotta do what a guy's gotta do. In my last job it was wall to wall windows.
|
|
|
02-20-2021, 10:18 PM
|
#5
|
LQ Guru
Registered: Aug 2016
Location: SE USA
Distribution: openSUSE 24/7; Debian, Knoppix, Mageia, Fedora, OS/2, others
Posts: 6,498
|
Quote:
Originally Posted by dosensuppe
I did a runtrough with memtest86
|
Memtest86? Or Memtest86+?
Only one? If yes, do at least 4 non-stop. More is better.
|
|
1 members found this post helpful.
|
02-23-2021, 01:59 PM
|
#6
|
Member
Registered: Feb 2021
Location: Germany
Distribution: Artix Linux, Slackware, Gentoo
Posts: 83
Original Poster
Rep: 
|
Quote:
Originally Posted by mrmazda
Memtest86? Or Memtest86+?
Only one? If yes, do at least 4 non-stop. More is better.
|
I think 86+.
I did the combined test. It runs through all tests at once for several hours. Didn't return any errors.
But I will try with test 4 only for an extended time also then. Thanks.
Which log should I take a look at?
|
|
|
02-23-2021, 06:06 PM
|
#7
|
Senior Member
Registered: Aug 2016
Posts: 3,345
|
I think mrmazda meant to let memtest86+ run for an extended time, and do at least 4 full passes through the tests.
Without knowing how much memory you have, I would suggest that you start the memtest at night and let it run all night so you get a thorough test and analysis. The longer it runs the better the overall test results are at identifying potential errors.
That screenshot you posted clearly shows hardware error machine checks, both errors on CPU 2 with one on bank 22 and one on bank 5.
You might try rebooting the same way more than once and take a pic each time so you can compare the data reported and see if it is always the same or if the error moves around with different cpu, memory addresses, etc each time.
|
|
|
02-23-2021, 06:29 PM
|
#8
|
LQ Guru
Registered: Aug 2016
Location: SE USA
Distribution: openSUSE 24/7; Debian, Knoppix, Mageia, Fedora, OS/2, others
Posts: 6,498
|
Quote:
Originally Posted by computersavvy
I think mrmazda meant to let memtest86+ run for an extended time, and do at least 4 full passes through the tests.
|
At least 4 passes, yes, but not +. Non-FOSS memtest86 by default will automatically stop when #4 completes. FOSS memtest86+ hasn't changed in years. Memtest86 has had multiple version increments since the last one for +.
|
|
1 members found this post helpful.
|
02-25-2021, 11:34 AM
|
#9
|
Member
Registered: Feb 2021
Location: Germany
Distribution: Artix Linux, Slackware, Gentoo
Posts: 83
Original Poster
Rep: 
|
Quote:
Originally Posted by mrmazda
At least 4 passes, yes, but not +. Non-FOSS memtest86 by default will automatically stop when #4 completes. FOSS memtest86+ hasn't changed in years. Memtest86 has had multiple version increments since the last one for +.
|
Ok. I didn't know there was a FOSS version.
Just downloaded it and will report back when I did the test.
Last edited by dosensuppe; 02-25-2021 at 11:43 AM.
|
|
|
02-25-2021, 11:35 AM
|
#10
|
Member
Registered: Feb 2021
Location: Germany
Distribution: Artix Linux, Slackware, Gentoo
Posts: 83
Original Poster
Rep: 
|
Quote:
Originally Posted by computersavvy
Without knowing how much memory you have, I would suggest that you start the memtest at night and let it run all night
|
I have 32gb
|
|
|
02-25-2021, 02:27 PM
|
#11
|
Member
Registered: Feb 2021
Location: Germany
Distribution: Artix Linux, Slackware, Gentoo
Posts: 83
Original Poster
Rep: 
|
Oh well. Memtest86+ doesn't work with DDR4
how easy would it be to implement some kind of virus into a boot image?
Last edited by dosensuppe; 02-25-2021 at 02:42 PM.
|
|
|
02-27-2021, 01:32 PM
|
#12
|
LQ Guru
Registered: Jan 2006
Location: Ireland
Distribution: Slackware, Slarm64 & Android
Posts: 17,542
|
Quote:
Originally Posted by dosensuppe
Oh well. Memtest86+ doesn't work with DDR4
how easy would it be to implement some kind of virus into a boot image?
|
This seems to have gone quiet. I'd be surprised if Memtest wasn't updated to cope with DDR4 shortly. FOSS moves quickly. Report the bug. If anyone here has the appropriate memory, they might confirm it for you.
On the picture on post one, I had to open my insecure browser to view it but it looks hardware/timing related, Are we overclocking or under volting here? The Ryzen core voltage can be very low indeed. What is the base clock frequency? Have you tried lowering it?
I posted some ideas in post #4 How did they work out for you?
|
|
1 members found this post helpful.
|
03-01-2021, 03:35 AM
|
#13
|
Member
Registered: Feb 2021
Location: Germany
Distribution: Artix Linux, Slackware, Gentoo
Posts: 83
Original Poster
Rep: 
|
Quote:
Originally Posted by business_kid
This seems to have gone quiet. I'd be surprised if Memtest wasn't updated to cope with DDR4 shortly.
|
No. I tested it. It also explicitly says on the website it doesn't support DDR4.
Quote:
Originally Posted by business_kid
Are we overclocking or under volting here? The Ryzen core voltage can be very low indeed. What is the base clock frequency? Have you tried lowering it?
|
I left everything as it is. I didn't change anything. No overclocking.
Quote:
Originally Posted by business_kid
I posted some ideas in post #4 How did they work out for you?
|
I booted into a live environment and saved the files inside /var/log
I don't really know which files to look at...
thanks for the replies
|
|
|
03-01-2021, 04:41 AM
|
#14
|
LQ Guru
Registered: Jan 2006
Location: Ireland
Distribution: Slackware, Slarm64 & Android
Posts: 17,542
|
The files you want to look at are - /var/log/messages - the last restart forward
- /var/log/syslog - BTW both of these may have been moved under systemd
- the output of the 'dmesg' command. This IS moved and renamed in a systemd environment. To find it in Linux Mint, I had to mount the Mint /, chroot to it, mount /proc (mount -t proc proc /proc) and run it's dmesg program. That's a forensic record of the last kernel's life, disk mounts, etc.
|
|
|
03-01-2021, 11:12 PM
|
#15
|
LQ Guru
Registered: Aug 2016
Location: SE USA
Distribution: openSUSE 24/7; Debian, Knoppix, Mageia, Fedora, OS/2, others
Posts: 6,498
|
Quote:
Originally Posted by dosensuppe
No. I tested it. It also explicitly says on the website it doesn't support DDR4.
|
Wrong website. https://www.memtest86.com/memtest86.html has the one that supports DDR4. It definitely works here with DDR4.
|
|
2 members found this post helpful.
|
All times are GMT -5. The time now is 01:59 AM.
|
LinuxQuestions.org is looking for people interested in writing
Editorials, Articles, Reviews, and more. If you'd like to contribute
content, let us know.
|
Latest Threads
LQ News
|
|