Linux - SoftwareThis forum is for Software issues.
Having a problem installing a new program? Want to know which application is best for the job? Post your question in this forum.
Notices
Welcome to LinuxQuestions.org, a friendly and active Linux Community.
You are currently viewing LQ as a guest. By joining our community you will have the ability to post topics, receive our newsletter, use the advanced search, subscribe to threads and access many other special features. Registration is quick, simple and absolutely free. Join our community today!
Note that registered members see fewer ads, and ContentLink is completely disabled once you log in.
Hi,
I'm trying to run some software parallelised through mpich2.
I have the daemon running (mpd).
I then launch the software: nohup mpiexec -n 8 /path/to/software/softwarename.ex > out 2> err &
and when I check with "ps aux" I can see I have 8 copies of the software running and 8 listings of mpd, however according to "top", these processes are running at 0%, and indeed there is no output - it just hangs.
This happens on some computers, but not on others, so I can rule out a problem with the software itself.
Update:
It seems if I specify "-n 1" I can get it running on one processor, but any more and nothing happens. So maybe it's something to do with mpich2?
Does anyone know what's wrong? Until I fix this I can barely do any calculations, so I'm quite desperate to get it working. I've tried restarting the computer, and also re-installing Mpich2 using shm instead of nemesis (since it's simply running on a multi-core computer) but without any change in the situation.
Thanks a lot for your suggestion, but in this case, ssh is working for the computer.
Additionally, since I'm running on a multi-core processor, all spawns of the software occur on one processor, on the same computer, and ssh is not involved in this case.
For future reference: I did a work-around by deleting mpich2 and installing openmpi instead... it requires re-compilation of any software that was originally compiled for mpich2, but once that's done it works fine, and apparently without the need for an mpd like mpich2 needed.
LinuxQuestions.org is looking for people interested in writing
Editorials, Articles, Reviews, and more. If you'd like to contribute
content, let us know.