LinuxQuestions.org
Help answer threads with 0 replies.
Home Forums Tutorials Articles Register
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Software > Linux - Kernel
User Name
Password
Linux - Kernel This forum is for all discussion relating to the Linux kernel.

Notices


Reply
  Search this Thread
Old 09-10-2020, 03:26 AM   #1
MTCoster
LQ Newbie
 
Registered: Sep 2020
Distribution: Ubuntu
Posts: 2

Rep: Reputation: Disabled
Kexec on MT8173 platform causing high volume of IOMMU faults


I’ve been attempting to use kexec to write a network bootloader for an Acer R13 Chromebook. I initially tried on a 5.8.7 kernel, then moved on to 5.9-rc4 to see if that fixed the problem (spoiler: it didn’t).

When booted from kexec (instead of the native u-boot), the console is flooded with IOMMU fault messages (fortunately rate limited!) like these:

Code:
[  +0.001401] mtk-iommu 10205000.iommu: fault type=0x4 iova=0xffb39000 pa=0xb6d39500 larb=0 port=0 layer=0 read
[  +0.001402] mtk-iommu 10205000.iommu: fault type=0x4 iova=0xffbe4000 pa=0xb6de4400 larb=0 port=0 layer=0 read
[  +0.001377] mtk-iommu 10205000.iommu: fault type=0x4 iova=0xffc90000 pa=0xb6e90400 larb=0 port=0 layer=0 read
[  +0.001357] mtk-iommu 10205000.iommu: fault type=0x4 iova=0xffd3a000 pa=0xb6f3a500 larb=0 port=0 layer=0 read
[  +0.001342] mtk-iommu 10205000.iommu: fault type=0x4 iova=0xffde0000 pa=0xb6fe0e00 larb=0 port=0 layer=0 read
[  +0.001336] mtk-iommu 10205000.iommu: fault type=0x4 iova=0xffe86000 pa=0xb6886b00 larb=0 port=0 layer=0 read
[  +0.001316] mtk-iommu 10205000.iommu: fault type=0x4 iova=0xfff2b001 pa=0xb7d5b700 larb=0 port=0 layer=1 read
[  +0.001323] mtk-iommu 10205000.iommu: fault type=0x4 iova=0xfffcf001 pa=0xb7dff200 larb=0 port=0 layer=1 read
[  +0.001325] mtk-iommu 10205000.iommu: fault type=0x4 iova=0xff861000 pa=0xb7e61380 larb=0 port=0 layer=0 read
[  +4.991802] mtk_iommu_isr: 1209385 callbacks suppressed
That’s a lot of suppressed messages, and a similar block is printed every 5 seconds.

The system appears to be perfectly functional otherwise, but the kernel is using ~70% of a CPU core at idle as opposed to ~10% after a non-kexec boot (as reported by htop).

So far I’ve identified mtk_iommu_isr as the source of the message, and a call to domain->handler(...) in report_iommu_fault as responsible for the status code triggering the message. I’m still struggling to figure out where that handler is registered though.

My current thinking is that the actual issue stems from mtk_iommu_hw_init where the IOMMU hardware on the SoC is initialized. I suspect the hardware isn’t expecting to be initialized again, but without any documentation publicly available from Mediatek this is difficult to prove.

Does anyone have any suggestions as to how I could go about getting to the bottom of this?

PS: I hope this is the right place to post this. If not, please let me know!

Last edited by MTCoster; 09-10-2020 at 03:27 AM. Reason: Fixed typo
 
Old 09-10-2020, 11:45 AM   #2
business_kid
LQ Guru
 
Registered: Jan 2006
Location: Ireland
Distribution: Slackware, Slarm64 & Android
Posts: 16,297

Rep: Reputation: 2322Reputation: 2322Reputation: 2322Reputation: 2322Reputation: 2322Reputation: 2322Reputation: 2322Reputation: 2322Reputation: 2322Reputation: 2322Reputation: 2322
Not really a software guy here - more hardware myself.

That said, Mediatek? They're also selling Ralink, and I wish you luck. They seem to be Linux Lazy, to put it at it's mildest.

My experience was that I was looking for a wifi driver in 2020, and eventually found an out-of-tree 2012 wifi 'driver for everything' written on a RH 2.6 kernel :-/. I'm not expecting any updates. They don't put software on their site much.

If you're trying older kernels, RH/Centos 2.6.1.-2 mightn't be a bad one to start with. They seemed to contract a linux guy back in 2012. His stuff seemed OK -for 2012. It will take you less than an hour to try the 2.6 kernel. Yes, it will break everything, but you may get an answer on your IOMMU issue. If the old kernel sorts it, I'd see about bending the Mediatek code straight. If my wifi experience is anything to go by, the hardware is ok. But the problem is, it's so cheap, people trying to save a few cents buy it. That leaves the problem to others.
 
  


Reply



Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
Kexec on MT8173 platform causing high volume of IOMMU faults MTCoster Linux - Kernel 1 09-14-2020 08:59 AM
kexec issue on arm platform manjunath1990 Linux - Kernel 4 11-16-2017 01:23 PM
[SOLVED] virt-manager PCI passthrough gives error "No IOMMU Found" even w/ Intel IOMMU enabled jessetaylor84 Linux - Virtualization and Cloud 6 09-26-2014 08:14 PM
:high address but no IOMMU error ??? your_shadow03 Linux - Newbie 5 09-30-2008 09:44 AM
PCI-DMA not syncing: high address but no IOMMU joe_merchant Linux - Hardware 4 09-30-2008 05:16 AM

LinuxQuestions.org > Forums > Linux Forums > Linux - Software > Linux - Kernel

All times are GMT -5. The time now is 06:30 PM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration