LinuxQuestions.org
Visit Jeremy's Blog.
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Hardware
User Name
Password
Linux - Hardware This forum is for Hardware issues.
Having trouble installing a piece of hardware? Want to know if that peripheral is compatible with Linux?

Notices


Reply
  Search this Thread
Old 06-23-2019, 03:52 PM   #1
Stefan42
LQ Newbie
 
Registered: Mar 2019
Posts: 8

Rep: Reputation: Disabled
Unhappy Ryzen 2400G iGPU randomly hangs, only SSH works


Background:
Hi, I build a new computer with the Ryzen 2400G CPU and the DeskMini A300 Motherboard/Case. I use Linux Mint 19.1 (Cinnamon).
I had to use a newer mainline kernel because the Linux Mint one (4.15) was too old for the iGPU of the Ryzen processor. I use 5.1.8 from the Ubuntu mainline ppa.

Problem:
At some random point (mostly while scrolling through websites), the computer gets stuck. The screen/mouse/keyboard does not respond to anything anymore. However, I can still SSH into the machine and e.g. reboot it. It is similar to the bug described here:
https://bugs.freedesktop.org/show_bug.cgi?id=105733

Tried solutions:
- I tried setting "amdgpu.dc=1" as kernel command line but the bug still appears.
- I tried other kernels, in particular I tried the 5.2.0 kernel from drm-next (compiled myself) and drm-tip (ubuntu) but Linux Mint did not boot then
- Same with 5.2 rc6, when booting I just get a black screen at the point where the graphics driver turns on. If I use nomodeset it boots but is slow.

Logs:
I used SSH to print the dmesg log:
Code:
[    0.000000] Linux version 5.1.8-050108-generic (kernel@kathleen) (gcc version 8.3.0 (Ubuntu 8.3.0-13ubuntu1)) #201906090832 SMP Sun Jun 9 08:35:40 UTC 2019
[    0.000000] Command line: BOOT_IMAGE=/boot/vmlinuz-5.1.8-050108-generic root=UUID=08a24e70-06ba-446c-944e-37138ac0f03f ro verbose amdgpu.dc=1
<<... Forum Limitations ...>>
[    5.840712] r8169 0000:02:00.0 enp2s0: Link is Up - 100Mbps/Full - flow control rx/tx
[    5.840723] IPv6: ADDRCONF(NETDEV_CHANGE): enp2s0: link becomes ready
[10318.481590] gmc_v9_0_process_interrupt: 11 callbacks suppressed
[10318.481597] amdgpu 0000:03:00.0: [gfxhub] no-retry page fault (src_id:0 ring:24 vmid:1 pasid:32768, for process Xorg pid 981 thread Xorg:cs0 pid 986)
[10318.481602] amdgpu 0000:03:00.0:   in page starting at address 0x0000800102002000 from 27
[10318.481605] amdgpu 0000:03:00.0: VM_L2_PROTECTION_FAULT_STATUS:0x00101031
[10318.481614] amdgpu 0000:03:00.0: [gfxhub] no-retry page fault (src_id:0 ring:24 vmid:1 pasid:32768, for process Xorg pid 981 thread Xorg:cs0 pid 986)
[10318.481616] amdgpu 0000:03:00.0:   in page starting at address 0x0000800102000000 from 27
[10318.481618] amdgpu 0000:03:00.0: VM_L2_PROTECTION_FAULT_STATUS:0x00000000
[10318.481626] amdgpu 0000:03:00.0: [gfxhub] no-retry page fault (src_id:0 ring:24 vmid:1 pasid:32768, for process Xorg pid 981 thread Xorg:cs0 pid 986)
[10318.481628] amdgpu 0000:03:00.0:   in page starting at address 0x0000800102002000 from 27
[10318.481630] amdgpu 0000:03:00.0: VM_L2_PROTECTION_FAULT_STATUS:0x00000000
[10318.481637] amdgpu 0000:03:00.0: [gfxhub] no-retry page fault (src_id:0 ring:24 vmid:1 pasid:32768, for process Xorg pid 981 thread Xorg:cs0 pid 986)
[10318.481640] amdgpu 0000:03:00.0:   in page starting at address 0x0000800102000000 from 27
[10318.481641] amdgpu 0000:03:00.0: VM_L2_PROTECTION_FAULT_STATUS:0x00000000
[10318.481649] amdgpu 0000:03:00.0: [gfxhub] no-retry page fault (src_id:0 ring:24 vmid:1 pasid:32768, for process Xorg pid 981 thread Xorg:cs0 pid 986)
[10318.481651] amdgpu 0000:03:00.0:   in page starting at address 0x0000800102006000 from 27
[10318.481652] amdgpu 0000:03:00.0: VM_L2_PROTECTION_FAULT_STATUS:0x00000000
[10318.481659] amdgpu 0000:03:00.0: [gfxhub] no-retry page fault (src_id:0 ring:24 vmid:1 pasid:32768, for process Xorg pid 981 thread Xorg:cs0 pid 986)
[10318.481662] amdgpu 0000:03:00.0:   in page starting at address 0x0000800102002000 from 27
[10318.481663] amdgpu 0000:03:00.0: VM_L2_PROTECTION_FAULT_STATUS:0x00000000
[10318.481670] amdgpu 0000:03:00.0: [gfxhub] no-retry page fault (src_id:0 ring:24 vmid:1 pasid:32768, for process Xorg pid 981 thread Xorg:cs0 pid 986)
[10318.481672] amdgpu 0000:03:00.0:   in page starting at address 0x0000800102004000 from 27
[10318.481674] amdgpu 0000:03:00.0: VM_L2_PROTECTION_FAULT_STATUS:0x00000000
[10318.481681] amdgpu 0000:03:00.0: [gfxhub] no-retry page fault (src_id:0 ring:24 vmid:1 pasid:32768, for process Xorg pid 981 thread Xorg:cs0 pid 986)
[10318.481683] amdgpu 0000:03:00.0:   in page starting at address 0x0000800102006000 from 27
[10318.481685] amdgpu 0000:03:00.0: VM_L2_PROTECTION_FAULT_STATUS:0x00000000
[10318.481691] amdgpu 0000:03:00.0: [gfxhub] no-retry page fault (src_id:0 ring:24 vmid:1 pasid:32768, for process Xorg pid 981 thread Xorg:cs0 pid 986)
[10318.481693] amdgpu 0000:03:00.0:   in page starting at address 0x0000800102000000 from 27
[10318.481695] amdgpu 0000:03:00.0: VM_L2_PROTECTION_FAULT_STATUS:0x00000000
[10318.481702] amdgpu 0000:03:00.0: [gfxhub] no-retry page fault (src_id:0 ring:24 vmid:1 pasid:32768, for process Xorg pid 981 thread Xorg:cs0 pid 986)
[10318.481704] amdgpu 0000:03:00.0:   in page starting at address 0x0000800102002000 from 27
[10318.481706] amdgpu 0000:03:00.0: VM_L2_PROTECTION_FAULT_STATUS:0x00000000
[10323.523168] [drm:amdgpu_dm_commit_planes.isra.43 [amdgpu]] *ERROR* Waiting for fences timed out.
[10328.653181] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx timeout, signaled seq=83716, emitted seq=83718
[10328.653268] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process Xorg pid 981 thread Xorg:cs0 pid 986
[10328.653271] [drm] GPU recovery disabled.
[10338.893177] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx timeout, signaled seq=83716, emitted seq=83718
[10338.893263] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process Xorg pid 981 thread Xorg:cs0 pid 986
[10338.893266] [drm] GPU recovery disabled.
[10349.133164] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx timeout, signaled seq=83716, emitted seq=83718
[10349.133251] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process Xorg pid 981 thread Xorg:cs0 pid 986
[10349.133254] [drm] GPU recovery disabled.
[10359.373650] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx timeout, signaled seq=83716, emitted seq=83718
[10359.373737] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process Xorg pid 981 thread Xorg:cs0 pid 986
[10359.373740] [drm] GPU recovery disabled.
<< ... repeats itself hundreds of times ... >>
[13093.458615] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx timeout, signaled seq=83716, emitted seq=83718
[13093.458702] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process Xorg pid 981 thread Xorg:cs0 pid 986
[13093.458705] [drm] GPU recovery disabled.
[Full dmesg], [xorg log]

It would be great if someone could help me fix this problem

Edit: Even a way to reset the driver via SSH would be helpful. rmmod -f amdgpu does not work (busy).

Last edited by Stefan42; 06-23-2019 at 03:54 PM.
 
Old 06-24-2019, 03:44 AM   #2
mrmazda
Senior Member
 
Registered: Aug 2016
Location: USA
Distribution: openSUSE, Debian, Knoppix, Mageia, Fedora, others
Posts: 2,780
Blog Entries: 1

Rep: Reputation: 895Reputation: 895Reputation: 895Reputation: 895Reputation: 895Reputation: 895Reputation: 895
Just an educated guess based on the last changelog entries 8 or more months ago for the server and amdgpu DDX, and "AMDGPU(0): Chipset: "Unknown AMD Radeon GPU" (ChipID = 0x15dd)" in your log, you probably should have one or both of them as well as kernel from PPA repo(s). The 1.19.6 server is rather old. I'd try the newer xserver-xorg-video-amdgpu first. If neither help, one of the bug comments I read suggests it may be that a newer libdrm and/or Mesa is needed. Another possible issue mentioned in the bug is incompatible RAM.

I don't know anything about resetting any driver via ssh, but you should be able to restart LightDM/GDM/*DM using systemctl, heavy handed, but less so than rebooting.
 
Old 06-24-2019, 05:15 PM   #3
Stefan42
LQ Newbie
 
Registered: Mar 2019
Posts: 8

Original Poster
Rep: Reputation: Disabled
Thank you for the tip, I am currently trying out
https://launchpad.net/~paulo-miguel-...e/ubuntu/mesa/
and
https://www.ubuntuupdates.org/ppa/xorg-edgers
to see if the issue no longer occurs.

If that does not work I will try using Manjaro as it should have a up to date package base.

Quote:
I don't know anything about resetting any driver via ssh, but you should be able to restart LightDM/GDM/*DM using systemctl, heavy handed, but less so than rebooting.
Thanks, but my main objective is not to loos the programs that are currently open (i.e. LibreOffice text I was writing, web browser session etc.) so that does not help.
 
  


Reply


Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
Video editor that works with Intel iGPU & VAAPI - does it exist? thender Linux - Software 9 06-26-2014 07:53 AM
Distro that allows use of IGPU and AMD graphics card for dual monitor puma pops Linux - Distributions 1 02-25-2013 04:39 PM
mtrr allocation failed. Graphics performance may suffer. Intel hd2000 iGPU Z0K4 Slackware 11 10-08-2012 11:42 AM

LinuxQuestions.org > Forums > Linux Forums > Linux - Hardware

All times are GMT -5. The time now is 03:37 AM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration