LinuxQuestions.org
Download your favorite Linux distribution at LQ ISO.
Home Forums Tutorials Articles Register
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Distributions > Fedora
User Name
Password
Fedora This forum is for the discussion of the Fedora Project.

Notices


Reply
  Search this Thread
Old 02-02-2015, 03:12 AM   #1
terry-duell
Member
 
Registered: Jan 2007
Location: Melbourne, Australia
Distribution: Fedora 38 x86_64
Posts: 539

Rep: Reputation: 59
Non reproducible errors compiling source


Hello All,
I am running Fedora 21 x86_64, on a Haswell refresh i7-4790, Asus Z97M-plus MB.
I often build a number of packages from source(regular test builds from repos), and have been getting what appear to be random errors, since moving to this hardware. I never saw this problem on my old hardware running Fedora 20, building the same packages.
The errors are always of the form...

"internal compiler error: Segmentation fault
Please submit a full bug report,
with preprocessed source if appropriate.
The bug is not reproducible, so it is likely a hardware or OS problem."

and usually (but not always) show up when using multiple CPUs.
When this error occurs, I can start from scratch (ie clean build dir) and the project may succeed or may fail again, at a different place. If I persist I can always get a successful build.
Generally, if I run the build process using only one CPU, it will succeed.
I suspected a memory problem, and ran a memory test which had 100% success.
All of the projects use cmake.
I am using cmake-3.0.2-2.fc21.x86_64, make-4.0-3.fc21.x86_64, and gcc-4.9.2-1.fc21.x86_64.

Submitting a bug report is a bit problematic as these are large projects, and due to the random nature of the error and the lack of information, does seem like a bit of wild goose chase.

I would really like to get a better feel for the possible causes before going any further.

Any suggestions?

Cheers,
Terry
 
Old 02-02-2015, 03:55 AM   #2
evo2
LQ Guru
 
Registered: Jan 2009
Location: Japan
Distribution: Mostly Debian and CentOS
Posts: 6,724

Rep: Reputation: 1705Reputation: 1705Reputation: 1705Reputation: 1705Reputation: 1705Reputation: 1705Reputation: 1705Reputation: 1705Reputation: 1705Reputation: 1705Reputation: 1705
Hi,

sounds like bad ram to me too. Which memory test did you run?

Evo2.
 
Old 02-02-2015, 03:31 PM   #3
terry-duell
Member
 
Registered: Jan 2007
Location: Melbourne, Australia
Distribution: Fedora 38 x86_64
Posts: 539

Original Poster
Rep: Reputation: 59
Quote:
Originally Posted by evo2 View Post
Hi,

sounds like bad ram to me too. Which memory test did you run?

Evo2.
I ran memtest86+ 5.1. I think it ran all the test suite.

Cheers,
Terry
 
Old 02-02-2015, 08:41 PM   #4
terry-duell
Member
 
Registered: Jan 2007
Location: Melbourne, Australia
Distribution: Fedora 38 x86_64
Posts: 539

Original Poster
Rep: Reputation: 59
I ran the memory tests again.
I ran all tests, parallel (all CPUs).
The tests took about 2.5 h (16GB), and the results are not at all clear.
On my system memtest86+ presents with a dreadful interface with some of it's display off screen, and I see no way of altering this.
When the tests were completed there were errors, but due to the screen display, it is unclear where they had occurred.
I have 2 x 8GB memory sticks, and it would be useful if memtest86+ could tell me which of those had errors.
Does memtest86+ also test CPU cache memory? From what I saw of the report it isn't clear to me whether the errors occurred in cache or RAM.
Any advice on how to configure the memtest86+ display, or other methods of testing that are less problematic?

Cheers,
Terry
 
Old 02-02-2015, 10:39 PM   #5
evo2
LQ Guru
 
Registered: Jan 2009
Location: Japan
Distribution: Mostly Debian and CentOS
Posts: 6,724

Rep: Reputation: 1705Reputation: 1705Reputation: 1705Reputation: 1705Reputation: 1705Reputation: 1705Reputation: 1705Reputation: 1705Reputation: 1705Reputation: 1705Reputation: 1705
Hi,

sorry I'm no expert on memtest86+, but a sure fire way to figure out if it is one stick or the other is to just put only one in and then run the test etc.

Evo2.
 
Old 02-02-2015, 11:32 PM   #6
veerain
Senior Member
 
Registered: Mar 2005
Location: Earth bound to Helios
Distribution: Custom
Posts: 2,524

Rep: Reputation: 319Reputation: 319Reputation: 319Reputation: 319
Errors in cpu or cpu cache is very unlikely. There are to ram testers. One memtest86 and other memtest86+. Try both.
 
Old 02-03-2015, 12:03 AM   #7
terry-duell
Member
 
Registered: Jan 2007
Location: Melbourne, Australia
Distribution: Fedora 38 x86_64
Posts: 539

Original Poster
Rep: Reputation: 59
Quote:
Originally Posted by evo2 View Post
Hi,

sorry I'm no expert on memtest86+, but a sure fire way to figure out if it is one stick or the other is to just put only one in and then run the test etc.

Evo2.
Yes, that is a possible approach, although I may fall foul of the rotten display...ie the results I need to see being off screen.
I'll try this approach and see how I go.

Cheers,
Terry
 
Old 02-03-2015, 12:04 AM   #8
terry-duell
Member
 
Registered: Jan 2007
Location: Melbourne, Australia
Distribution: Fedora 38 x86_64
Posts: 539

Original Poster
Rep: Reputation: 59
Quote:
Originally Posted by veerain View Post
Errors in cpu or cpu cache is very unlikely. There are to ram testers. One memtest86 and other memtest86+. Try both.
I was under the impression that memtest86 was proprietary.
I'll have another look.

Cheers,
Terry
 
Old 02-03-2015, 02:12 AM   #9
veerain
Senior Member
 
Registered: Mar 2005
Location: Earth bound to Helios
Distribution: Custom
Posts: 2,524

Rep: Reputation: 319Reputation: 319Reputation: 319Reputation: 319
Well memtest86 was initially open source. Then it changed closed source with version 5 as I remember. Though source code for version 4.3 or perhaps 4.7 is available.

Though they provide a free binary of memtest86.
 
Old 02-03-2015, 03:26 PM   #10
terry-duell
Member
 
Registered: Jan 2007
Location: Melbourne, Australia
Distribution: Fedora 38 x86_64
Posts: 539

Original Poster
Rep: Reputation: 59
The plot thickens.
The tests I have done to date have actually been with memtest86, not sure how I got that mixed up.
I got hold of memtest86+ and tried that. It gives a much better display, everything on screen and readable.
My MB has four slots, L to R they are labelled A1, A2, B1, B2.
The advice is for one memory stick, use slot A2. For two sticks, use A2, B2.
I have tested each of my two 8GB sticks individually in slot A2 and neither give any errors.
When I have both sticks in slots A2, B2, I get about 260 errors.
So, what might be going on here? Could the problem be related to slot B2 in the MB?
I need some means of determining if the problem is memory or MB.
What consequences, if any, of fitting the sticks in slots A1, B1, to see if any errors detected with that setup?

Cheers,
Terry
 
Old 02-03-2015, 09:08 PM   #11
terry-duell
Member
 
Registered: Jan 2007
Location: Melbourne, Australia
Distribution: Fedora 38 x86_64
Posts: 539

Original Poster
Rep: Reputation: 59
...and thickens again!
Pondering on the results I have reported, I thought it wise to have a look at the BIOS settings.
The memory sticks were purchased as as "16 GB kit 2400 (2x8GB)", and I recall that whilst the BIOS detected the ram as 16GB 2400, it also reported the ram as 1333 MHz.
In the BIOS the memory frequency was set at 2400, so I reset to "auto".
Another run through with memtest86+ with both sticks installed reports no errors. probably should let it run multiple runs but I need the system for work.
So, the problem may be solved, I'll have to see if the unreproducable errors crop again when compiling. Here's hoping.

Cheers,
Terry
 
Old 02-03-2015, 09:50 PM   #12
evo2
LQ Guru
 
Registered: Jan 2009
Location: Japan
Distribution: Mostly Debian and CentOS
Posts: 6,724

Rep: Reputation: 1705Reputation: 1705Reputation: 1705Reputation: 1705Reputation: 1705Reputation: 1705Reputation: 1705Reputation: 1705Reputation: 1705Reputation: 1705Reputation: 1705
Hi,

excellent, sounds like you may have solved it. Let us know if you see any more problems.

Evo2.
 
Old 02-03-2015, 10:16 PM   #13
veerain
Senior Member
 
Registered: Mar 2005
Location: Earth bound to Helios
Distribution: Custom
Posts: 2,524

Rep: Reputation: 319Reputation: 319Reputation: 319Reputation: 319
So it's probably a overclocking issue. Setting it to auto in BIOS solved it.

Can you see at what speed it's running in memtest.

Overclocking can damage cpus or ram.

Overclocking also affects stability of rams. So errors in operation of ram may be produced.
 
Old 02-04-2015, 03:26 PM   #14
terry-duell
Member
 
Registered: Jan 2007
Location: Melbourne, Australia
Distribution: Fedora 38 x86_64
Posts: 539

Original Poster
Rep: Reputation: 59
Quote:
Originally Posted by veerain View Post
So it's probably a overclocking issue. Setting it to auto in BIOS solved it.

Can you see at what speed it's running in memtest.

Overclocking can damage cpus or ram.

Overclocking also affects stability of rams. So errors in operation of ram may be produced.
If I recall correctly memtest reported the ram to be running at 2400 when the BIOS was set for 2400, and at 1333 when set to 'auto".
It hadn't been set to be overclocked as the ram was specified, as I said, as 2400, not 1333, but I just dredged up Kingston's specs for this ram, and whilst tested to run at 2400, 1.65v, are "...programmed to run at 1333 1.5v", so suspect that the MB doesn't like running the ram at 1.65v.

Cheers,
Terry
 
  


Reply



Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
LXer: If you want reproducible science, the software needs to be open source LXer Syndicated Linux News 0 02-27-2012 09:40 AM
Open Suse 10.1 Compiling source - gcc errors jmgleason SUSE / openSUSE 4 08-05-2006 01:01 PM
Getting lots of errors when compiling sip from source. What do they mean? BillWho? Linux - Newbie 6 05-04-2006 07:13 AM
Errors compiling Highpoint drivers from source under Fedora... The Belgain Linux - Hardware 0 12-01-2004 05:53 PM
Various Compiling Errors (GCC compiling Openal, GUIlib, xmms-wma) gregorya Linux - Software 2 08-27-2004 05:03 AM

LinuxQuestions.org > Forums > Linux Forums > Linux - Distributions > Fedora

All times are GMT -5. The time now is 11:08 PM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration