LinuxQuestions.org
Visit Jeremy's Blog.
Home Forums Tutorials Articles Register
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Software
User Name
Password
Linux - Software This forum is for Software issues.
Having a problem installing a new program? Want to know which application is best for the job? Post your question in this forum.

Notices


Reply
  Search this Thread
Old 06-09-2010, 09:40 AM   #1
KBriggs
Member
 
Registered: Jun 2009
Posts: 35

Rep: Reputation: 15
MPI issues


Hey all,

I am running MPI on 361 processors spead over 46 nodes. One of them is not working, and it kills the program immediately, but it won't tell me which one. Is there any way to find out which node is not communicating short of testing each one individually? (Which I did already, dammit)
 
Old 06-13-2010, 03:43 AM   #2
GroomedGoose
LQ Newbie
 
Registered: Jun 2010
Posts: 2

Rep: Reputation: 0
Hi KBriggs,
I was waiting to see if someone would come forward with an answer for you because I was curious too. I don't have an answer but just some ideas, based on what I found googling around (which I am sure you did already). On systems I have used the error logs would report which node crashed and the error it gave. It's supposed to do this: from the openmpi documentation for mpirun
Quote:
Process Termination / Signal Handling
During the run of an MPI application, if any rank dies abnormally
(either exiting before invoking MPI_FINALIZE, or dying as the result of
a signal), mpirun will print out an error message and kill the rest of
the MPI application.
nal, it is probably not necessary (and safest) for the user to only
clean up non-MPI state.
Which MPI implementation are you using?
If it is not telling you anything then you could try the verbose (--verbose) option to mpirun. If you get only the MPI rank but not the hostname, maybe you could add a statement earlier that prints to stdout on every node, the machine's hostname and mpi rank?
Sorry I can't be more helpful, but I will be interested to know if you find a good solution.
Cheers
Scott
 
Old 06-14-2010, 08:33 PM   #3
KBriggs
Member
 
Registered: Jun 2009
Posts: 35

Original Poster
Rep: Reputation: 15
Thanks for your reply Scott.

I actually happened to find out which one wasn't working more or less by luck, so I stopped trying to pursue this aftewards. So sadly I never did find a good solution to it. If it ever comes up again I will run with your suggestions and let you know what I figure out.
 
  


Reply



Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
MPI kmvinoth Programming 2 09-25-2009 07:35 AM
Mpi mkrems Programming 0 06-09-2008 07:18 PM
Mpi chandan_shetty Linux - Networking 1 05-10-2008 05:16 AM
Mpi chui_yap Programming 1 03-07-2006 07:22 AM
what do I need to know to use MPI kgustaf Linux - Networking 0 07-25-2005 04:13 PM

LinuxQuestions.org > Forums > Linux Forums > Linux - Software

All times are GMT -5. The time now is 07:29 AM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration