LinuxQuestions.org
Review your favorite Linux distribution.
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Distributions > Debian
User Name
Password
Debian This forum is for the discussion of Debian Linux.

Notices


Reply
  Search this Thread
Old 02-16-2009, 06:22 PM   #1
arfon
Member
 
Registered: Apr 2004
Location: Texas
Distribution: Slackware
Posts: 178

Rep: Reputation: Disabled
Debian box fell down go boom...


We have a Debian box and when it boots in Single-user (rescue mode), it comes up fine.

When it boots normally, udev and several RC scripts segfault. The logs are useless, I've checked messages, debug, dmesg and kern.log and NONE recorded the problems.

Damnedest thing I have ever seen.

Anyway, this box has been running for over a year and has alot of data on it...

Is there anyway to repair the Debian installation without touching anything else?

Is there a magic apt-get command???

Last edited by arfon; 02-16-2009 at 11:14 PM.
 
Old 02-16-2009, 08:35 PM   #2
stress_junkie
Senior Member
 
Registered: Dec 2005
Location: Massachusetts, USA
Distribution: Ubuntu 10.04 and CentOS 5.5
Posts: 3,873

Rep: Reputation: 334Reputation: 334Reputation: 334Reputation: 334
I don't know how you would fix the operating system before you know the cause of the problem. Have you considered that you could have a hardware failure? The segfault says that a process is making an invalid request to access memory. The fact that this never happens when networking is turned off says that maybe a NIC is broken.

Try running memtest86 for a quick pass/fail. Also consider swapping a known good NIC into the machine.

Last edited by stress_junkie; 02-16-2009 at 08:44 PM.
 
Old 02-16-2009, 08:37 PM   #3
unSpawn
Moderator
 
Registered: May 2001
Posts: 29,394
Blog Entries: 55

Rep: Reputation: 3565Reputation: 3565Reputation: 3565Reputation: 3565Reputation: 3565Reputation: 3565Reputation: 3565Reputation: 3565Reputation: 3565Reputation: 3565Reputation: 3565
Besides, I thought you opted for reinstalling from scratch, then you just stopped replying, now a new thread? What gives?
 
Old 02-16-2009, 11:12 PM   #4
arfon
Member
 
Registered: Apr 2004
Location: Texas
Distribution: Slackware
Posts: 178

Original Poster
Rep: Reputation: Disabled
Quote:
I don't know how you would fix the operating system before you know the cause of the problem. Have you considered that you could have a hardware failure? The segfault says that a process is making an invalid request to access memory. The fact that this never happens when networking is turned off says that maybe a NIC is broken.
Not a HW problem, I swapped the disc into a known good box and it does the same thing. The HW also boots and runs a Live CD just fine.


Quote:
Besides, I thought you opted for reinstalling from scratch, then you just stopped replying, now a new thread? What gives?
I was until we found out that we can't just re-install. The company that made this server will not support it and wants us to buy a whole new box. I can get the OS on it but not the specialized applications that are also on it.

If there's a way to re-install the base debian packages OR upgrade the packages it will probably work.

If this was a Slackware box, I'd slap the install CDs in and do a base install (since I'm familiar with Slackware). I don't know how to do this with Debian.
 
Old 02-17-2009, 08:57 AM   #5
stress_junkie
Senior Member
 
Registered: Dec 2005
Location: Massachusetts, USA
Distribution: Ubuntu 10.04 and CentOS 5.5
Posts: 3,873

Rep: Reputation: 334Reputation: 334Reputation: 334Reputation: 334
Quote:
Originally Posted by arfon View Post
Not a HW problem, I swapped the disc into a known good box and it does the same thing. The HW also boots and runs a Live CD just fine.
Okay. I agree it's not hardware. But having said that I think that you are SOL. I hate to say it.

You could try doing an apt-get upgrade but your application may depend on keeping the same version of Debian.
Code:
apt-get update
apt-get upgrade
I'd make an image of the system first using partimage. But, as I said, I doubt that this is the answer.

Maybe this is an opportunity for your business to find a new vendor for this application. You're already hosed so the worst thing that could happen already has happened.

Last edited by stress_junkie; 02-17-2009 at 09:01 AM.
 
Old 02-17-2009, 09:14 AM   #6
rweaver
Senior Member
 
Registered: Dec 2008
Location: Louisville, OH
Distribution: Debian, CentOS, Slackware, RHEL, Gentoo
Posts: 1,833

Rep: Reputation: 164Reputation: 164
It sounds like you have some corruption on at least some of your shared libraries. Updating to a newer version of them would likely fix the problem but if this machine is running specific applications by a vendor you deal with then you're tied to whatever they're using and a change would likely break the system. I would be far less concerned about the machine and far more concerned about the vendor refusing to support an item they deployed. You can force installation of packages that are already installed... and you can get a list of installed packages doing a dpkg -l... and force the reinstall with apt-get --reinstall packagename, I'd try to isolate what programs were segfaulting and ldd them, find the related package, and force a reinstall.

Last edited by rweaver; 02-17-2009 at 09:23 AM.
 
Old 02-17-2009, 09:18 AM   #7
arfon
Member
 
Registered: Apr 2004
Location: Texas
Distribution: Slackware
Posts: 178

Original Poster
Rep: Reputation: Disabled
I'm thinking the "apt-get update" may work. I am surprised that Debian has no "apt-get repair" feature.


I'll try it today and let you all know what happens.
 
Old 02-17-2009, 12:23 PM   #8
rweaver
Senior Member
 
Registered: Dec 2008
Location: Louisville, OH
Distribution: Debian, CentOS, Slackware, RHEL, Gentoo
Posts: 1,833

Rep: Reputation: 164Reputation: 164
Quote:
Originally Posted by arfon View Post
I'm thinking the "apt-get update" may work. I am surprised that Debian has no "apt-get repair" feature.


I'll try it today and let you all know what happens.
Just make sure you have good backups... and you might want to make sure you're not set on "stable" "testing" "unstable" unless you're sure you want to upgrade releases (esp an issue since lenny just hit stable.)

Last edited by rweaver; 02-17-2009 at 12:24 PM.
 
Old 02-19-2009, 02:52 AM   #9
ebmi
LQ Newbie
 
Registered: Oct 2006
Posts: 18

Rep: Reputation: 0
apt-get does kinda have a 'repair' feature. I've used it a couple times when my hard drive on my old computer decided to lose some bits and corrupt a few files. The command to use is

Code:
# apt-get install --reinstall packagename
The only problem is you have to know which file belongs to which package. You can find this out with the apt-file command,

Code:
# apt-file search file_name
This will return the package(s) that contain the given file name.

EDIT:
It may also be useful to know that the above apt-get command can miss conf files, i.e. files in the /etc directory. In order to replace those files you need to pass an option to dpkg through apt-get like this

Code:
# apt-get -o DPkg::Options::="--force-confmiss" install --reinstall packagename

Last edited by ebmi; 02-19-2009 at 03:14 AM.
 
Old 02-19-2009, 04:53 AM   #10
nx5000
Senior Member
 
Registered: Sep 2005
Location: Out
Posts: 3,307

Rep: Reputation: 57
What gives ldd `which udevd`

Are you able to reproduce the segfault manually?
If yes, try to run it through strace ?

Quote:
The logs are useless, I've checked messages, debug, dmesg and kern.log and NONE recorded the problems.
Probably because your disk are not yet write enabled at this time.
 
  


Reply


Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
Foot fell asleep colinstu General 13 04-17-2006 04:52 PM
Cannot ssh to debian box from Gentoo box or Putty, but I can from Cygwin? Pengus Linux - Networking 2 01-20-2006 01:47 PM
Server fell over :/ JF1980 Linux - Software 2 12-08-2005 07:56 AM
box goes 'boom' , please help - Loading Linux........ limpimpin Linux - Software 3 01-23-2002 07:26 PM
i fell asleep at my desk... Jeffrey Linux - Newbie 14 03-16-2001 12:27 PM

LinuxQuestions.org > Forums > Linux Forums > Linux - Distributions > Debian

All times are GMT -5. The time now is 06:51 AM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Facebook: linuxquestions Google+: linuxquestions
Open Source Consulting | Domain Registration