Linux - SoftwareThis forum is for Software issues.
Having a problem installing a new program? Want to know which application is best for the job? Post your question in this forum.
Notices
Welcome to LinuxQuestions.org, a friendly and active Linux Community.
You are currently viewing LQ as a guest. By joining our community you will have the ability to post topics, receive our newsletter, use the advanced search, subscribe to threads and access many other special features. Registration is quick, simple and absolutely free. Join our community today!
Note that registered members see fewer ads, and ContentLink is completely disabled once you log in.
If you have any problems with the registration process or your account login, please contact us. If you need to reset your password, click here.
Having a problem logging in? Please visit this page to clear all LQ-related cookies.
Get a virtual cloud desktop with the Linux distro that you want in less than five minutes with Shells! With over 10 pre-installed distros to choose from, the worry-free installation life is here! Whether you are a digital nomad or just looking for flexibility, Shells can put your Linux machine on the device that you want to use.
Exclusive for LQ members, get up to 45% off per month. Click here for more info.
Gigabyte 890FXA-UD5
AMD PhenomII 1075T
2 x GTX-470
4 x Patriot DIMM 4 GB 1600MHx PC3-12800
Debian wheezy amd64
with
nvidia driver 275.09.07-1 compiled with Debian for linux-headers-2.6.38-2
is used for
(a) number crunching or - alternatively -
(b)graphic examination of results with CUDA-enabled viewer (VMD).
I am getting problems with both tasks. For (a), the best I found was from the linux prompt (without calling the X server)
# nvidia-smi -L
# nvidia-smi -pm 1
$ launch the parallelized code
$ monitor what happens from a ssh-linked desktop (the terminal of the gtx470 machine is as if it were hanged).
When completed (one-two days, a 20-100 GB output file), reboot from the shh-linked desktop and launch the graphic program, orking on the gtx470 computer itself. VMD creates its CUDA environment with the cards and normally it worked fine (it could not be launched without first rebooting).
Possibly after update/upgrade, VMD launching now hangs the system.
I have tried in alternative the desktop xfce4 with same problems. Removed xfce4. Then KDE, which was the worst and was removed, hopefully (not sure) removing everything with
I relied on the experience of the vendor, specialized in gaming computers.
If really only one card is supported, could you advise if a gtx-580 is supported by that mainboard? Or could you suggest an alternative mainboard to support the hardware that I posted ?
AMD PhenomII 1075T
2 x GTX-470
4 x Patriot DIMM 4 GB 1600MHx PC3-12800
********************
What I can say is that the simulation is severely demanding, based on a truly CUDA-parallelized code (NAMD) dealing with a system of over 250,000 atoms in my case. This means that the molecular dynamics work is spread over the cpu/gpu components (it does not deal of a simple launching of separate jobs) and it would hardly work if there is a hardware problem.
During the simulation, nvidia-smi queries revealed both cards and their temperature (not their % involvement, which probably could only be revealed with tesla cards), while "top -i" revealed all six cpus at work.
reports mem% usage for both gtx-470 while carrying out the simulation. In addition, for these simulation, the ratio cpu/gpu 6:2 is correct (must be at least 2:1). cpus carry out only energy calculations, most of the laod is to the gpus.
Also, before current nvidia driver 275.09.01-1, I worked for a week with 270.41.19-1 without any warning.
I asked to the debian amd64 site if problems were encountered on upgrading to 275.09.01-1 without getting any answer.
For one thing, that is a Crossfire board and I don't believe that the chipset AMD 890FX/SB850 supports dual Nvidia cards.
CUDA isnt SLI. You dont need SLI support for CUDA to work with multipule video cards. BTW, the SLI patch has been known to work with 890FX chipsets.
Quote:
Originally Posted by chiendarret
Also, before current nvidia driver 275.09.01-1, I worked for a week with 270.41.19-1 without any warning.
That makes me think its just an issue with the 275.09.01-1 driver, not your hardware. You could try getting the drivers from 'sid' (currently 275.09.07-5), or maybe even experimental if you were game. Or you could manually install 270.41.19 drivers. There might even be aq way to rollback to the 270.41.19-1 drivers as well, but I dont know how.....if its even possible.
I was wondering if your built that system chiendarret, sorry to hear its not playing nicely with the newest debian 'testing' drivers.
Please see also my thread "automount in xfce4 or gnome2" for the same computer. There is information that may closely relevant to the problems raised here for the gtx-470 cards and they are not in line with problems with the driver. Possibly there is something else that was not yet grasped.
Separately, I have asked to debian about how to safely install the nvidia driver from "unstable" or recover the older driver. Hope they will answer.
With the new nvidia-kernel-dkms 275.09.07-5 in Debian GNU-Linux amd64 wheezy, all described problems vanished. Thus, the suspicion that problems (absent with driver 270.41.19-1) were introduced by driver 275.09.07-1 was correct.
chiendarret
LinuxQuestions.org is looking for people interested in writing
Editorials, Articles, Reviews, and more. If you'd like to contribute
content, let us know.