LinuxQuestions.org
Help answer threads with 0 replies.
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Hardware
User Name
Password
Linux - Hardware This forum is for Hardware issues.
Having trouble installing a piece of hardware? Want to know if that peripheral is compatible with Linux?

Notices


Reply
  Search this Thread
Old 02-18-2021, 05:45 AM   #1
dosensuppe
Member
 
Registered: Feb 2021
Location: Germany
Distribution: Artix Linux, Slackware, Gentoo
Posts: 83

Rep: Reputation: Disabled
System completely freezes CPU Stuck / Hardware Error


Hello. So recently I got the issue of regular freezes on my ryzen 5 computer, especially when suspending to ram. But I also already experienced this issue turning the computer off.

I don't know when the issue occured, as what hardware change might have been responsible for it.
I have already tried resetting the cmos to no avail.

This is what such a freeze looks like:
  • computer completely freezes when suspending computer to ram (systemctl suspend)
  • gets completely unresponsive
  • after a while computer softreboots
upon reboot picture related error message appears and the pc proceeds to boot normally:

Any idea what this might be?
Is there some driver I am missing here or might this be a hardware related defect?
best regards

tl;dr this is roughly the message I'm getting after reboot [Hardware Error]: CPU 5: Machine Check: 0 Bank 5: bea0000000
Attached Thumbnails
Click image for larger version

Name:	9CwfPgl.jpg
Views:	32
Size:	222.9 KB
ID:	35755  

Last edited by dosensuppe; 03-01-2021 at 03:37 AM.
 
Old 02-18-2021, 06:12 AM   #2
business_kid
LQ Guru
 
Registered: Jan 2006
Location: Ireland
Distribution: Slackware, Slarm64 & Android
Posts: 17,542

Rep: Reputation: 2608Reputation: 2608Reputation: 2608Reputation: 2608Reputation: 2608Reputation: 2608Reputation: 2608Reputation: 2608Reputation: 2608Reputation: 2608Reputation: 2608
Sounds like a memory error when it's talking about banks.

The approach I would take is: Let it crash, power off; reboot on a live usb key(important). Then you can search the latest entries in /var/log and dmesg.

If we're overclocking here, cease & desist. A long memory test is essential. Re-seating and/or swapping ram might be a good idea, because if the error (bank 5) changed, you're pretty sure it's the DIMM. If it doesn'[t change, your suspicions turn to the motherboard. Do that much, & post results.

Before I put much more, It's good to know your own opinion of your abilities. This might be better handled by an experienced guy.
 
Old 02-18-2021, 06:50 AM   #3
dosensuppe
Member
 
Registered: Feb 2021
Location: Germany
Distribution: Artix Linux, Slackware, Gentoo
Posts: 83

Original Poster
Rep: Reputation: Disabled
I already tested the ram with memtest86 and it didn't give any errors.
Quote:
Originally Posted by business_kid View Post
Let it crash, power off; reboot on a live usb key(important). Then you can search the latest entries in /var/log and dmesg.
The weird thing is I didn't get any errors when I did a runtrough with memtest86 and still it looks like it is a memory issue.
Maybe a more elaborate memory test? I let it run the standard test.
I will do it again and if that doesn't give any errors I will do what you proposed.

so force downshutting upon freeze, looking at the logs from a live environment, switching memory banks and if it still says bank 5, it's the mainboards fault.

thank you

(btw I'm not posting from windows 10 it's a user agent)

Last edited by dosensuppe; 02-18-2021 at 07:13 AM.
 
Old 02-18-2021, 09:15 AM   #4
business_kid
LQ Guru
 
Registered: Jan 2006
Location: Ireland
Distribution: Slackware, Slarm64 & Android
Posts: 17,542

Rep: Reputation: 2608Reputation: 2608Reputation: 2608Reputation: 2608Reputation: 2608Reputation: 2608Reputation: 2608Reputation: 2608Reputation: 2608Reputation: 2608Reputation: 2608
On all the Linux stuff - YES. As you say. Sorry if I wasn't clear.

On the windows 10 user agent - a guy's gotta do what a guy's gotta do. In my last job it was wall to wall windows.
 
Old 02-20-2021, 10:18 PM   #5
mrmazda
LQ Guru
 
Registered: Aug 2016
Location: SE USA
Distribution: openSUSE 24/7; Debian, Knoppix, Mageia, Fedora, OS/2, others
Posts: 6,498
Blog Entries: 1

Rep: Reputation: 2231Reputation: 2231Reputation: 2231Reputation: 2231Reputation: 2231Reputation: 2231Reputation: 2231Reputation: 2231Reputation: 2231Reputation: 2231Reputation: 2231
Quote:
Originally Posted by dosensuppe View Post
I did a runtrough with memtest86
Memtest86? Or Memtest86+?

Only one? If yes, do at least 4 non-stop. More is better.
 
1 members found this post helpful.
Old 02-23-2021, 01:59 PM   #6
dosensuppe
Member
 
Registered: Feb 2021
Location: Germany
Distribution: Artix Linux, Slackware, Gentoo
Posts: 83

Original Poster
Rep: Reputation: Disabled
Quote:
Originally Posted by mrmazda View Post
Memtest86? Or Memtest86+?

Only one? If yes, do at least 4 non-stop. More is better.
I think 86+.
I did the combined test. It runs through all tests at once for several hours. Didn't return any errors.
But I will try with test 4 only for an extended time also then. Thanks.

Which log should I take a look at?
Attached Thumbnails
Click image for larger version

Name:	logs.png
Views:	17
Size:	38.0 KB
ID:	35703  
 
Old 02-23-2021, 06:06 PM   #7
computersavvy
Senior Member
 
Registered: Aug 2016
Posts: 3,345

Rep: Reputation: 1486Reputation: 1486Reputation: 1486Reputation: 1486Reputation: 1486Reputation: 1486Reputation: 1486Reputation: 1486Reputation: 1486Reputation: 1486
I think mrmazda meant to let memtest86+ run for an extended time, and do at least 4 full passes through the tests.

Without knowing how much memory you have, I would suggest that you start the memtest at night and let it run all night so you get a thorough test and analysis. The longer it runs the better the overall test results are at identifying potential errors.

That screenshot you posted clearly shows hardware error machine checks, both errors on CPU 2 with one on bank 22 and one on bank 5.

You might try rebooting the same way more than once and take a pic each time so you can compare the data reported and see if it is always the same or if the error moves around with different cpu, memory addresses, etc each time.
 
Old 02-23-2021, 06:29 PM   #8
mrmazda
LQ Guru
 
Registered: Aug 2016
Location: SE USA
Distribution: openSUSE 24/7; Debian, Knoppix, Mageia, Fedora, OS/2, others
Posts: 6,498
Blog Entries: 1

Rep: Reputation: 2231Reputation: 2231Reputation: 2231Reputation: 2231Reputation: 2231Reputation: 2231Reputation: 2231Reputation: 2231Reputation: 2231Reputation: 2231Reputation: 2231
Quote:
Originally Posted by computersavvy View Post
I think mrmazda meant to let memtest86+ run for an extended time, and do at least 4 full passes through the tests.
At least 4 passes, yes, but not +. Non-FOSS memtest86 by default will automatically stop when #4 completes. FOSS memtest86+ hasn't changed in years. Memtest86 has had multiple version increments since the last one for +.
 
1 members found this post helpful.
Old 02-25-2021, 11:34 AM   #9
dosensuppe
Member
 
Registered: Feb 2021
Location: Germany
Distribution: Artix Linux, Slackware, Gentoo
Posts: 83

Original Poster
Rep: Reputation: Disabled
Quote:
Originally Posted by mrmazda View Post
At least 4 passes, yes, but not +. Non-FOSS memtest86 by default will automatically stop when #4 completes. FOSS memtest86+ hasn't changed in years. Memtest86 has had multiple version increments since the last one for +.
Ok. I didn't know there was a FOSS version.
Just downloaded it and will report back when I did the test.

Last edited by dosensuppe; 02-25-2021 at 11:43 AM.
 
Old 02-25-2021, 11:35 AM   #10
dosensuppe
Member
 
Registered: Feb 2021
Location: Germany
Distribution: Artix Linux, Slackware, Gentoo
Posts: 83

Original Poster
Rep: Reputation: Disabled
Quote:
Originally Posted by computersavvy View Post
Without knowing how much memory you have, I would suggest that you start the memtest at night and let it run all night
I have 32gb
 
Old 02-25-2021, 02:27 PM   #11
dosensuppe
Member
 
Registered: Feb 2021
Location: Germany
Distribution: Artix Linux, Slackware, Gentoo
Posts: 83

Original Poster
Rep: Reputation: Disabled
Oh well. Memtest86+ doesn't work with DDR4

how easy would it be to implement some kind of virus into a boot image?
Attached Thumbnails
Click image for larger version

Name:	Screenshot (5).png
Views:	7
Size:	3.2 KB
ID:	35715  

Last edited by dosensuppe; 02-25-2021 at 02:42 PM.
 
Old 02-27-2021, 01:32 PM   #12
business_kid
LQ Guru
 
Registered: Jan 2006
Location: Ireland
Distribution: Slackware, Slarm64 & Android
Posts: 17,542

Rep: Reputation: 2608Reputation: 2608Reputation: 2608Reputation: 2608Reputation: 2608Reputation: 2608Reputation: 2608Reputation: 2608Reputation: 2608Reputation: 2608Reputation: 2608
Quote:
Originally Posted by dosensuppe View Post
Oh well. Memtest86+ doesn't work with DDR4

how easy would it be to implement some kind of virus into a boot image?
This seems to have gone quiet. I'd be surprised if Memtest wasn't updated to cope with DDR4 shortly. FOSS moves quickly. Report the bug. If anyone here has the appropriate memory, they might confirm it for you.

On the picture on post one, I had to open my insecure browser to view it but it looks hardware/timing related, Are we overclocking or under volting here? The Ryzen core voltage can be very low indeed. What is the base clock frequency? Have you tried lowering it?

I posted some ideas in post #4 How did they work out for you?
 
1 members found this post helpful.
Old 03-01-2021, 03:35 AM   #13
dosensuppe
Member
 
Registered: Feb 2021
Location: Germany
Distribution: Artix Linux, Slackware, Gentoo
Posts: 83

Original Poster
Rep: Reputation: Disabled
Quote:
Originally Posted by business_kid View Post
This seems to have gone quiet. I'd be surprised if Memtest wasn't updated to cope with DDR4 shortly.
No. I tested it. It also explicitly says on the website it doesn't support DDR4.
Quote:
Originally Posted by business_kid View Post
Are we overclocking or under volting here? The Ryzen core voltage can be very low indeed. What is the base clock frequency? Have you tried lowering it?
I left everything as it is. I didn't change anything. No overclocking.
Quote:
Originally Posted by business_kid View Post
I posted some ideas in post #4 How did they work out for you?
I booted into a live environment and saved the files inside /var/log
I don't really know which files to look at...
thanks for the replies
Attached Thumbnails
Click image for larger version

Name:	logs.png
Views:	10
Size:	38.0 KB
ID:	35754  
 
Old 03-01-2021, 04:41 AM   #14
business_kid
LQ Guru
 
Registered: Jan 2006
Location: Ireland
Distribution: Slackware, Slarm64 & Android
Posts: 17,542

Rep: Reputation: 2608Reputation: 2608Reputation: 2608Reputation: 2608Reputation: 2608Reputation: 2608Reputation: 2608Reputation: 2608Reputation: 2608Reputation: 2608Reputation: 2608
The files you want to look at are
  • /var/log/messages - the last restart forward
  • /var/log/syslog - BTW both of these may have been moved under systemd
  • the output of the 'dmesg' command. This IS moved and renamed in a systemd environment. To find it in Linux Mint, I had to mount the Mint /, chroot to it, mount /proc (mount -t proc proc /proc) and run it's dmesg program. That's a forensic record of the last kernel's life, disk mounts, etc.
 
Old 03-01-2021, 11:12 PM   #15
mrmazda
LQ Guru
 
Registered: Aug 2016
Location: SE USA
Distribution: openSUSE 24/7; Debian, Knoppix, Mageia, Fedora, OS/2, others
Posts: 6,498
Blog Entries: 1

Rep: Reputation: 2231Reputation: 2231Reputation: 2231Reputation: 2231Reputation: 2231Reputation: 2231Reputation: 2231Reputation: 2231Reputation: 2231Reputation: 2231Reputation: 2231
Quote:
Originally Posted by dosensuppe View Post
No. I tested it. It also explicitly says on the website it doesn't support DDR4.
Wrong website. https://www.memtest86.com/memtest86.html has the one that supports DDR4. It definitely works here with DDR4.
 
2 members found this post helpful.
  


Reply


Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
[SOLVED] System completely freezes, what is broken and needs replacing? AndrewAmmerlaan Linux - Hardware 7 01-27-2021 02:46 PM
Cpu fan stuck to the cpu, how do I get the cpu out? abefroman Linux - Hardware 16 09-04-2009 12:47 AM
xorg / startx completely freezes system every time granth Slackware 17 11-06-2008 10:42 PM
Startx freezes system completely, mysterious white dots Dymitry Slackware 5 02-19-2007 04:41 PM
system freezes completely c-- Ubuntu 2 03-29-2005 04:23 PM

LinuxQuestions.org > Forums > Linux Forums > Linux - Hardware

All times are GMT -5. The time now is 01:59 AM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration