LinuxQuestions.org
Share your knowledge at the LQ Wiki.
Home Forums Tutorials Articles Register
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Software
User Name
Password
Linux - Software This forum is for Software issues.
Having a problem installing a new program? Want to know which application is best for the job? Post your question in this forum.

Notices


Reply
  Search this Thread
Old 03-26-2006, 12:43 PM   #1
ScottReed
Member
 
Registered: Dec 2005
Location: Montana
Distribution: Debian "squeeze"
Posts: 157

Rep: Reputation: 30
Question Complete system freeze-up


Slackware 10.2 - Kernel 2.6.14.6

I have been experiencing intermmitent freeze-ups of my system as of late.

At first they were spread far apart, weeks basically could go by and my system would be fine, then one day i'd come to the PC and the screensaver would be hung, and system would be un-responsive. I could NOT force-quit X, switch to a virtual console, reboot cleanly, etc.

So I would reboot, force a file system check and reboot again and then go back about my business. Weeks would pass and it would crash again.

Now the crashes are within days, and just yesterday it crashed twice.

No information in /var/log/syslog or /var/log/messages

Does anyone know how I can go about finding or activating some form of memory dump when my system does crash so I can trouble shoot the cause?

Steps i've taken so far:
- full memory test with MemTest 86+
- full file system checks with e2fsck
- full hard disk check with MaxBlast software
- closed every running application in X before initilizing screensaver

Steps i'm going to take today:
- recompile kernel with newer sources
- close X windows and leave machine running at console (maybe the problem is with X)

I'm leaning towards a kernel issue though, based on the fact that the entire system locks.

Thanks,
Scott

Last edited by ScottReed; 03-26-2006 at 12:44 PM.
 
Old 03-26-2006, 03:31 PM   #2
J.W.
LQ Veteran
 
Registered: Mar 2003
Location: Boise, ID
Distribution: Mint
Posts: 6,642

Rep: Reputation: 87
One other thing to check - what are your system temperatures? Your symptoms could be the result of increased dust accumulation and/or a failing fan. I'd recommend checking that the CPU fan in particular is spinning normally, and the the heat sink is not obstructed with dust. Obviously this may not be the true cause, but it's something to check. Good luck with it
 
Old 03-26-2006, 04:07 PM   #3
Electro
LQ Guru
 
Registered: Jan 2002
Posts: 6,042

Rep: Reputation: Disabled
Buy a new power supply. Everybody that I know and read on this forum use cheap ass power supplies. I am using Seasonic S12-430 (430 watts).

X Window System crashes a lot and it does not matter what X11 (XFree86 or Xorg), so setup ssh to terminate X11 instead of rebooting. Use kill -15 [pid number] instead of kill -9 [pid number] because the KILL can actually corrupt the program.

I do not use screen savers because the display goes blank within 10 to 15 minutes after no user activity. I think I have it set by the BIOS.
 
Old 04-01-2006, 07:18 PM   #4
ScottReed
Member
 
Registered: Dec 2005
Location: Montana
Distribution: Debian "squeeze"
Posts: 157

Original Poster
Rep: Reputation: 30
Thumbs up Figured out the problem

Leaving my machine for the night out of X windows and at the console allowed the kernel to dump to the screen. I gathered the information it spit out and come to find out the crashes were related to SMP.

This all makes sense now. For the past few months I was randomly seeing the following message in /var/log/messages:

Code:
MCE: The hardware reports a non fatal, correctable incident occurred on CPU 0
I didn't think much about it, but as the freezes became more prolific I noticed that the amount of these errors I was recieving increased.

I disabled SMP in my kernel, not sure why it was on in the first place, recompiled and everything has been good for about a week now. No more MCE errors.
 
Old 04-02-2006, 03:06 AM   #5
J.W.
LQ Veteran
 
Registered: Mar 2003
Location: Boise, ID
Distribution: Mint
Posts: 6,642

Rep: Reputation: 87
Excellent - congrats on solving the problem and thanks for posting back with the update ScottReed
 
  


Reply



Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
Complete Freeze jakkals Linux - Software 2 11-18-2005 11:26 PM
Looking for a complete Linux System AndeAnderson Linux - General 5 07-21-2005 02:21 PM
please help - complete system loss caraphor Linux - Newbie 4 02-02-2004 03:00 PM
Complete system failure twinkers Mandriva 6 09-25-2003 04:27 PM
Complete System Backup top111 Linux - Newbie 1 08-20-2001 06:31 AM

LinuxQuestions.org > Forums > Linux Forums > Linux - Software

All times are GMT -5. The time now is 07:45 AM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration