LinuxQuestions.org
Welcome to the most active Linux Forum on the web.
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Hardware
User Name
Password
Linux - Hardware This forum is for Hardware issues.
Having trouble installing a piece of hardware? Want to know if that peripheral is compatible with Linux?

Notices


Reply
  Search this Thread
Old 12-27-2019, 06:45 AM   #1
dnp
LQ Newbie
 
Registered: Jan 2018
Posts: 3

Rep: Reputation: Disabled
possible to configure kernel to completely avoid using a core of multicore processor?


i just got the following,

Code:
syslog:Dec 27 07:09:46 carpdiem kernel: [33152702.203117] [Hardware Error]: Corrected error, no action required.
syslog:Dec 27 07:09:46 carpdiem kernel: [33152702.203277] [Hardware Error]: CPU:2 (15:2:0) MC2_STATUS[-|CE|MiscV|-|-|-|-|CECC]: 0x98144000010c0176
syslog:Dec 27 07:09:46 carpdiem kernel: [33152702.203430] [Hardware Error]: MC2 Error: VB Data ECC or parity error.
syslog:Dec 27 07:09:46 carpdiem kernel: [33152702.203792] [Hardware Error]: cache level: L2, tx: DATA, mem-tx: EV
if things become worse or repetitive, is it possible to configure my kernel/system to just not use this one core?

i'm on Slackware 14.2
and uname -a shows
Code:
Linux carpdiem 4.4.14 #2 SMP Fri Jun 24 13:38:27 CDT 2016 x86_64 AMD FX(tm)-8320 Eight-Core Processor AuthenticAMD GNU/Linux
 
Old 12-27-2019, 11:09 AM   #2
pan64
LQ Addict
 
Registered: Mar 2012
Location: Hungary
Distribution: debian/ubuntu/suse ...
Posts: 24,316

Rep: Reputation: 7985Reputation: 7985Reputation: 7985Reputation: 7985Reputation: 7985Reputation: 7985Reputation: 7985Reputation: 7985Reputation: 7985Reputation: 7985Reputation: 7985
I'm not really sure if it was a core (or something else).
Quote:
Now what does that really mean?
*shrug*, could be firmware/drivers/overheating/poor-CPU-seating/DIMM-seating/faulty-motherboard/faulty-CPU/faulty-DIMM.
https://centosfaq.org/centos/kernelhardware-error/

additionally:
https://www.linuxquestions.org/quest....php?p=5742874
 
Old 12-27-2019, 11:48 AM   #3
rnturn
Senior Member
 
Registered: Jan 2003
Location: Illinois (SW Chicago 'burbs)
Distribution: openSUSE, Raspbian, Slackware. Previous: MacOS, Red Hat, Coherent, Consensys SVR4.2, Tru64, Solaris
Posts: 2,850

Rep: Reputation: 553Reputation: 553Reputation: 553Reputation: 553Reputation: 553Reputation: 553
Quote:
Originally Posted by dnp View Post
i just got the following,

Code:
syslog:Dec 27 07:09:46 carpdiem kernel: [33152702.203117] [Hardware Error]: Corrected error, no action required.
syslog:Dec 27 07:09:46 carpdiem kernel: [33152702.203277] [Hardware Error]: CPU:2 (15:2:0) MC2_STATUS[-|CE|MiscV|-|-|-|-|CECC]: 0x98144000010c0176
syslog:Dec 27 07:09:46 carpdiem kernel: [33152702.203430] [Hardware Error]: MC2 Error: VB Data ECC or parity error.
syslog:Dec 27 07:09:46 carpdiem kernel: [33152702.203792] [Hardware Error]: cache level: L2, tx: DATA, mem-tx: EV
if things become worse or repetitive, is it possible to configure my kernel/system to just not use this one core?
Is the same CPU involved in every occurrence of these error messages?

There are tools that let you define what CPUs an individual process runs on but, AFAIK, they don't setup up a means to disable a specific CPU globally. (Probably because the kernel folks don't recommend it.)

You could try using the "isolcpus=2" boot time command line to keep CPU2 from being used by the scheduler. I think I'd use this as a last resort, though. See the file "/usr/src/linux/Documentation/admin-guide/kernel-parameters.txt" for more.

BUT... if this is really infrequent and it's non-fatal -- it looks like the system successfully recovered from the error, after all -- I would just keep an eye on it for now. If it begins happening so frequently that your logs are growing like crazy, then try the "isolcpus" option.

HTH...
 
Old 12-27-2019, 12:08 PM   #4
dnp
LQ Newbie
 
Registered: Jan 2018
Posts: 3

Original Poster
Rep: Reputation: Disabled
Quote:
Originally Posted by rnturn View Post

You could try using the "isolcpus=2" boot time command line to keep CPU2 from being used by the scheduler. I think I'd use this as a last resort, though. See the file "/usr/src/linux/Documentation/admin-guide/kernel-parameters.txt" for more.

BUT... if this is really infrequent and it's non-fatal -- it looks like the system successfully recovered from the error, after all -- I would just keep an eye on it for now. If it begins happening so frequently that your logs are growing like crazy, then try the "isolcpus" option.
thank-you, this sounds pretty much what i was hoping existed.
and, yes, i hope it's infrequent or a random sunspot occurance
 
  


Reply


Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
[SOLVED] multiple jobs on a multicore processor portia Slackware 7 05-03-2011 06:30 PM
LXer: Get the Most Out of Your Multicore Processor LXer Syndicated Linux News 0 09-11-2009 08:40 PM
Under moderate load in a multicore system of 13 processors one processor gets 100% us praveen24 Linux - Kernel 7 09-03-2009 04:21 AM
Run all OS processes on one core in multicore processor tryon16 Linux - Kernel 6 10-02-2007 05:13 PM
LXer: ARM MPCore Multicore Processor Enables Next-Generation Triple-Play Gateway to the Home LXer Syndicated Linux News 0 10-23-2006 05:54 PM

LinuxQuestions.org > Forums > Linux Forums > Linux - Hardware

All times are GMT -5. The time now is 11:32 AM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration