Debian box fell down go boom...
We have a Debian box and when it boots in Single-user (rescue mode), it comes up fine.
When it boots normally, udev and several RC scripts segfault. The logs are useless, I've checked messages, debug, dmesg and kern.log and NONE recorded the problems. Damnedest thing I have ever seen. Anyway, this box has been running for over a year and has alot of data on it... Is there anyway to repair the Debian installation without touching anything else? Is there a magic apt-get command??? |
I don't know how you would fix the operating system before you know the cause of the problem. Have you considered that you could have a hardware failure? The segfault says that a process is making an invalid request to access memory. The fact that this never happens when networking is turned off says that maybe a NIC is broken.
Try running memtest86 for a quick pass/fail. Also consider swapping a known good NIC into the machine. |
Besides, I thought you opted for reinstalling from scratch, then you just stopped replying, now a new thread? What gives?
|
Quote:
Quote:
If there's a way to re-install the base debian packages OR upgrade the packages it will probably work. If this was a Slackware box, I'd slap the install CDs in and do a base install (since I'm familiar with Slackware). I don't know how to do this with Debian. |
Quote:
You could try doing an apt-get upgrade but your application may depend on keeping the same version of Debian. Code:
apt-get update Maybe this is an opportunity for your business to find a new vendor for this application. You're already hosed so the worst thing that could happen already has happened. |
It sounds like you have some corruption on at least some of your shared libraries. Updating to a newer version of them would likely fix the problem but if this machine is running specific applications by a vendor you deal with then you're tied to whatever they're using and a change would likely break the system. I would be far less concerned about the machine and far more concerned about the vendor refusing to support an item they deployed. You can force installation of packages that are already installed... and you can get a list of installed packages doing a dpkg -l... and force the reinstall with apt-get --reinstall packagename, I'd try to isolate what programs were segfaulting and ldd them, find the related package, and force a reinstall.
|
I'm thinking the "apt-get update" may work. I am surprised that Debian has no "apt-get repair" feature.
I'll try it today and let you all know what happens. |
Quote:
|
apt-get does kinda have a 'repair' feature. I've used it a couple times when my hard drive on my old computer decided to lose some bits and corrupt a few files. The command to use is
Code:
# apt-get install --reinstall packagename Code:
# apt-file search file_name EDIT: It may also be useful to know that the above apt-get command can miss conf files, i.e. files in the /etc directory. In order to replace those files you need to pass an option to dpkg through apt-get like this Code:
# apt-get -o DPkg::Options::="--force-confmiss" install --reinstall packagename |
What gives ldd `which udevd`
Are you able to reproduce the segfault manually? If yes, try to run it through strace ? Quote:
|
All times are GMT -5. The time now is 12:05 AM. |