Sid stability issue resolved! Quick Amarok database, X display permission questions.
I recently installed Debian Sid over an Ubuntu partition because I had become frustrated with stability problems in Ubuntu. Not coincidentally, I first installed Ubuntu some months ago to replace a Debian Sid install which was also having stability problems. Unfortunately, the new Debian install is having similar problems. Perhaps this is a hardware problem? I've run memtest86+ overnight and found no errors, not sure what other hardware tests to do. I will describe my symptoms below:
I'm using all the latest packages from the Sid repositories. Various programs (Nicotine, Gaim, Kopete, vorbisgain, slimserver when scanning for music) routinely crash after about 10s or so. When running these programs from the command line I will usually see either a 'segfault' error or none at all. Other programs (konqueror, amarok, superkaramba) live longer, perhaps an half a hour or so, but also frequently crash while I'm using them. Some programs have no observable or unexpected stability issues (K3B, konsole, opera). During times of high load (ripping a CD, launching apps) random apps seem prone to crashing, particularly kicker and kwin, and occasionaly the xserver will restart. High load seems to exacerbate the aforementioned stability problems, as well. Most distressing is that xorg occasionally crashes hard. When this happens I will be able to move the cursor, but no keyboard or mouse input will be responded to (I cannot ctrl+alt+backspace to restart x) and the screen will freeze, so I have to hard reboot. I'm using the 8178 nvidia drivers and the xorg server with a Geforce 5900. I understand that Sid is unstable, but I don't believe this behaviour is normal. Thanks for any help! |
No that is not normal behavior, segfault errors are usually memory related but I see you have run the memtest and it passed. Do you have another machine you can take and put the hard drive in to run for a while? Things I would try would be testing the hard drive with your manufacturers diagnostic software if overclocking the CPU put it back to default settings. If using fastwrites/side banb addresing for the nVidia card turn it off if using the renderaccel in your X config disable it. Try going to the console stopping the X server and compiling a kernel 5 or 10 times in a row to eliminate cpu/memory problems. Check the temperatures of the CPU and Video card on an ongoing basis. Have a console window with top running at all times you can see to check if a particular process seems to be using a lot of cpu power when it hangs/segfaults. What is the hardware you have in the machine besides the 5900?
|
Linux 2.6.15-1-K7
Hardware: Athlon XPM 2400+ OCed to 2.3GHZ 1GB of WINTEC DDR RAM (refurbished, IIRC) A-Bit NF7 mobo 250GB Maxtor HDD with two Reiser3 partitions I was concerned that the overclocked CPU might be a problem, but I dismissed it once it passed the memtest. Also, this box runs LiveCDs without any apparent stability issues (which, I suppose, points to either the HDD or software as the problem). Should I still try turning that off for a while? I had the 5900 OCed to 5950 speeds, but I just disabled that. We'll see if that helps the X problems, but I'm doubtful because I had it OCed under the old Ubuntu setup, as well, and I wasn't having problems with X crashing then. I took a look for 'renderaccel' in xorg.conf and it wasn't there, so I assume it's not enabled. I'm not familiar with fastwrites/side banb addresing, is that something that would be set in the BIOS? Running hard drive diagnostic tools sounds interesting, I hadn't considered the HDD as a point of failure. How do I do that, exactly? Is there a Debian package? You mention the manufacturer's tools, how are those used? I do have another box I could transplant the HDD into, but I can't afford to spare it at the moment. I might try that later this week. |
Quote:
Quote:
Quote:
Quote:
Quote:
|
Alright, I will try a lower kernel. I see that you're using the Kolivas kernel, is that the one with userspace preempting (or something fancy-sounding like that) to make GUIs feel more responsive? How do you like it? Would you recommend it? Did you compile it yourself, or are there Debian packages for it somewhere?
Thanks for all the help! |
How hot does your system run? The combo of high CPU load + overclock + heat may be your problem. Two out of three may not trigger the problem. e.g., running from a LiveCD may not ever generate the "high CPU load" part of the equation because the CPU is always waiting on CDROM IO and never gets around to heating itself up.
Check your temps, stop overclocking, and see if that helps. I would recommend testing this hypothesis, not theorizing/rationalizing it out of existence without some physical test. Heat is well known to cause instability. CPU load causes heat. Overclocking causes heat. Overclocking can cause stability issues of it's own. When you're troubleshooting and hear hoofbeats, think horses, not zebras. Make sure you've really ruled out the common causes before advancing on to more exotic possibilities. Kernels, video drivers, etc. could certainly be playing a part here. But I personally doubt it. My bet goes for mundane problems with heat/overclocking. Especially since this problem has been plagueing you across different distros, most likely these were using different kernels and video drivers already. |
Quote:
|
I would agree with haertig here, Put that CPU back down to it's designated clock rate and monito the system. Until you do the basics you are most likely wasting your time chasing ghosts. 9 times out of 10 it's the simple solutions that we skip or overlook that are causing the problem.
My Sid machine is rock steady, it has never crashed, the only time it ever went down hard was due to a power outage.. |
Alright, I throttled the clock settings back to the factory defaults and, after about 10 minutes, everything seems to be stable! I'll wait a day or two before I'm satisfied, but so far I'm very pleased!
It's beyond me why the stability issues that were showing up at the OS level (presumably) due to overheating weren't a problem during the memtest, which I believe does some pretty CPU-stressing tests, but I suppose it doesn't matter. Thanks to all those who helped! While I'm here, there are a couple other problems I've been having:
|
Quote:
|
Quote:
It's OK to post multiple threads. One topic per thread. You'll get much better help. Use appropriate subject lines so you attract the right people. For example, a subject line of exactly what you said before: "Root console cannot launch X apps" will do you MUCH better than "New to Debian - need some help!" [edit] Oh, I now see that you changed the subject line of your original thread. I must have missed that before. However, I still recommend starting new threads for new problems rather than trying to redirect an old thread with a new subject line. I'd say, don't edit your subject line unless it's so glaringly obvious that it was terrible to start with. Your original subject line for this thread - I can't remember exactly what it was now - was quite good, if I remember correctly. [/edit] |
Quote:
Quote:
Code:
## allows me to run an X program as root |
All times are GMT -5. The time now is 01:49 AM. |