LinuxQuestions.org

LinuxQuestions.org (/questions/)
-   Debian (https://www.linuxquestions.org/questions/debian-26/)
-   -   X crashing when trying to load, NVIDIA and 2.6.29-2-amd64 (https://www.linuxquestions.org/questions/debian-26/x-crashing-when-trying-to-load-nvidia-and-2-6-29-2-amd64-726713/)

jimdaworm 05-18-2009 03:16 AM

X crashing when trying to load, NVIDIA and 2.6.29-2-amd64
 
Hi Everyone,

This may well be caused by me breaking somthing.

As I couldn't find the nvidia module for the current lenny kernel in the repos so I downloaded and attempted to install the driver using the Nvidia installer. This didnt work as it complained about a mismatch which I eventually figured out that had to do with the version of Nvidia-GLX from the repositories.

I downloaded the previous driver from NVIDIA and installed it, now the module appears to load:

Quote:

[ 54.278586] nvidia: module license 'NVIDIA' taints kernel.
[ 54.530316] nvidia 0000:04:00.0: PCI INT A -> GSI 16 (level, low) -> IRQ 16
[ 54.530322] nvidia 0000:04:00.0: setting latency timer to 64
[ 54.530477] NVRM: loading NVIDIA UNIX x86_64 Kernel Module 180.44 Tue Mar 2
but when x tries to load I see the Nvidia logo and it crashes. I see the below in the xorg logs:

Quote:

Backtrace:
0: /usr/bin/X11/X(xf86SigHandler+0x7e) [0x80c91ce]
1: [0xf7f67400]
2: /usr/lib/xorg/modules/drivers//nvidia_drv.so [0xf67e2250]

Fatal server error:
Caught signal 11. Server aborting
What do you guys think?

the trooper 05-18-2009 03:36 AM

Let me get this right...
You installed Nvidia-glx from the repositories and then downloaded and ran the Nvidia installer?
The normal advice for using Nvidia's installer is that you must purge all Nvidia packages installed from the repo's
before running Nvidia's installer.

jimdaworm 05-18-2009 03:51 AM

Hi the trooper

When I remove nvidia glx using aptitude, I get the below:


Quote:

(II) LoadModule: "glx"
(II) Loading /usr/lib/xorg/modules/extensions//libglx.so
dlopen: /usr/lib/xorg/modules/extensions//libglx.so: wrong ELF class: ELFCLASS64
(EE) Failed to load /usr/lib/xorg/modules/extensions//libglx.so
(II) UnloadModule: "glx"
(EE) Failed to load module "glx" (loader failed, 7)
(II) LoadModule: "record"
(II) Loading /usr/lib/xorg/modules/extensions//librecord.so
(II) Module record: vendor="X.Org Foundation"
compiled for 1.4.2, module version = 1.13.0
Module class: X.Org Server Extension
ABI class: X.Org Server Extension, version 0.3
(II) Loading extension RECORD
(II) LoadModule: "dri"
(II) Loading /usr/lib/xorg/modules/extensions//libdri.so
(II) Module dri: vendor="X.Org Foundation"
compiled for 1.4.2, module version = 1.0.0
ABI class: X.Org Server Extension, version 0.3
(II) Loading extension XFree86-DRI
(II) LoadModule: "nvidia"
(II) Loading /usr/lib/xorg/modules/drivers//nvidia_drv.so
dlopen: /usr/lib/xorg/modules/drivers//nvidia_drv.so: wrong ELF class: ELFCLASS64
(EE) Failed to load /usr/lib/xorg/modules/drivers//nvidia_drv.so
(II) UnloadModule: "nvidia"
(EE) Failed to load module "nvidia" (loader failed, 7)

the trooper 05-18-2009 04:15 AM

Ok so you have removed all packages installed from the repositories?
Here's how i would do it:
Download the latest stable driver from Nvidia.(180.51).
Put the installer somewhere you can find it ie your /home directory.
Stop X,either from the command line or reboot into single user mode.
Run the command nvidia-installer --uninstall this will remove the current driver.
Then navigate to your home directory and run the installer with sh NVIDIAxxxx.
See how you go.

jimdaworm 05-18-2009 04:52 AM

Hi trooper,

I still get the same error

Quote:

Backtrace:
0: /usr/bin/X11/X(xf86SigHandler+0x7e) [0x80c91ce]
1: [0xf7f67400]
2: /usr/lib/xorg/modules/drivers//nvidia_drv.so [0xf67e2250]

Fatal server error:
Caught signal 11. Server aborting

X tries to load and I see the nvidia logo then it crashes.

the trooper 05-18-2009 04:57 AM

So what have you tried?
Did you remove all nvidia packages from the repo's?
Did you remove the downloaded nvidia driver?
Did you download and run the latest driver from Nvidia?

Talk me through what you have done already.

jimdaworm 05-18-2009 05:18 AM

Hi Trooper,

I removed everything Nvidia related using aptitude. I then downloaded the latest driver from Nvida.

I ran sh Nvidia(Name of the package) --uninstall

I then installed the driver.

Then ran modprobe nvidia (as it wasn't loaded)

startx and I got the error below. Let me know if I can give you any more info that may help :)

the trooper 05-18-2009 05:23 AM

Quote:

I ran sh Nvidia(Name of the package) --uninstall
That should have been nvidia-installer --uninstall to remove the first downloaded driver(180.44).
Run that command again,then try to reinstall the latest driver.

jimdaworm 05-18-2009 06:58 AM

Hi Trooper,

I have tried that but it doesn't seem to have made any difference.

the trooper 05-18-2009 07:09 AM

When you ran the uninstall command did it launch the Nvidia text based interface?

the trooper 05-18-2009 07:25 AM

Quote:

Fatal server error:
Caught signal 11. Server aborting
On Googling the error it points to Debian's nvidia-glx and the Nvidia downloaded driver being installed.
Have a look at the following link,specifically the last post:

http://www.linuxquestions.org/questi...orting-256896/

You must remove all other Nvidia libraries and drivers before running the downloaded Nvidia driver.

jimdaworm 05-18-2009 01:40 PM

Hi Trooper,

I dont have anything with the world nvidia in it installed according to aptitude.

Anyway I think I am going to take advantage of this to re-install a purely 32 bit system and not so bleeding edge as its been getting a bit annoying lately having to fix lots of issues.

Thanks for your time.

the trooper 05-18-2009 02:10 PM

Thanks for the feedback Jimdaworm.
If you want less 'bleeding edge' give Testing a try.
Don't forget to post if you need help in the future.


All times are GMT -5. The time now is 10:42 PM.