LinuxQuestions.org
Welcome to the most active Linux Forum on the web.
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Newbie
User Name
Password
Linux - Newbie This Linux forum is for members that are new to Linux.
Just starting out and have a question? If it is not in the man pages or the how-to's this is the place!

Notices


Reply
  Search this Thread
Old 04-02-2010, 02:57 PM   #1
nicklaszlo
LQ Newbie
 
Registered: Jan 2010
Posts: 14

Rep: Reputation: 0
Lenny Freezes overnight - need diagnosis help


I recently switched from Windows XP to stable Debian w/KDE on my work dell B120 laptop. I have been trying to diagnose freezes. In the evenings I have been leaving an SSH connection to my home computer. I leave a command running that writes the current time to a file every ten minutes. The past two nights it has stopped writing the time after a couple hours (not at the exact same time), and I am no longer able to ssh from my home computer to my work computer.

When I get back to work, the work laptop is frozen. CTRL+Alt+Backspace does not help, neither does CTRL+ALT+F1. Ctrl+Alt+SysRq works sometimes. It has not yet frozen while I have been present.

I tried disabling the screen saver and monitor power controls in KDE, thinking that those only come on when I am away, so they might be the cause. But it did not help.

I took the last time recorded by my ssh connection and looked through the logs trying to find something that happened at that time, but the only event occuring around the time the SSH connection dropped was a chron job that runs every hour. As far as I can tell, the chron job does not actually do anything but write to the log each hour.

So obviously I need to post more information. What would be helpful? Which log or configuration files might contain clues?

I see these lines in my logs:
kernel: Kernel logging (proc) stopped.
imklog 3.18.6, log source = /proc/kmsg started.
It appears at 6:25 AM each day between when the SSH connection drops and when I arrive in the morning. Do I need to restart logging?

I don't have a serial console.

Thanks for your help! My laptop runs much faster with Linux and the sound card (which probably had a hardware problem) works more frequently than it did with Windows XP.
 
Old 04-02-2010, 04:46 PM   #2
jsteel
Member
 
Registered: Mar 2007
Location: England
Distribution: Arch
Posts: 392

Rep: Reputation: 34
I would test your RAM with memtest86+, run a self-test on the hard drive with smartmontools and also read its SMART attributes. This will just rule out some possible hardware problems. If that looks OK, I would try another Linux distribution (maybe just a live CD if you don't want to wipe your current installation) at least to see if it is distribution specific.
 
Old 04-04-2010, 08:31 PM   #3
nicklaszlo
LQ Newbie
 
Registered: Jan 2010
Posts: 14

Original Poster
Rep: Reputation: 0
Replacement disks?

Long story short, the disk looks like it is failing.


sudo smartctl -a /dev/hda
(snip)
Error 6672 occurred at disk power-on lifetime: 17066 hours (711 days + 2 hours)
When the command that caused the error occurred, the device was active or idle.

After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
40 51 08 ca e9 30 e1 Error: UNC 8 sectors at LBA = 0x0130e9ca = 19982794

Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
c8 00 08 ca e9 30 e1 00 00:22:33.125 READ DMA
ca 00 08 4a a9 f0 e0 00 00:22:33.125 WRITE DMA
ca 00 08 fa 39 e9 e0 00 00:22:33.125 WRITE DMA
ca 00 08 c2 a7 e0 e0 00 00:22:33.125 WRITE DMA
ca 00 80 1f fd 19 e0 00 00:22:33.125 WRITE DMA

(snip)
SMART Self-test log structure revision number 1
Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error
# 1 Extended offline Completed: read failure 00% 17193 46039111


I have this hard disk. Anyone want to suggest a new replacement? Dell will only sell me used disks, even if I wanted to pay their inflated prices. Would this one be good?

Is there any chance this is a motherboard failure or failure of some other non-replaceable component?

Note: Can this thread be moved to hardware?
 
  


Reply


Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
[SOLVED] memtester freezes lenny ericdanc Debian 4 02-02-2010 05:01 AM
[SOLVED] lenny freezes with certain apps ericdanc Debian 6 01-27-2010 06:58 AM
[SOLVED] HP DV6000 - Diagnosis Please? business_kid Linux - Hardware 3 11-08-2009 03:29 PM
Crash Diagnosis Ruler2112 Linux - Software 4 01-25-2005 01:17 PM
diagnosis? humanveal Linux - General 3 10-16-2002 08:32 AM

LinuxQuestions.org > Forums > Linux Forums > Linux - Newbie

All times are GMT -5. The time now is 03:07 AM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration