LinuxQuestions.org

LinuxQuestions.org (/questions/)
-   Linux - Networking (https://www.linuxquestions.org/questions/linux-networking-3/)
-   -   errors while testing cluster machines (MPICH) (https://www.linuxquestions.org/questions/linux-networking-3/errors-while-testing-cluster-machines-mpich-264356/)

nedian123 12-09-2004 03:27 AM

errors while testing cluster machines (MPICH)
 
After the extraction of MPICH in /usr/local

1- tcsh
2- ./configure ¡Vwith-comm=shared --prefix=/usr/local
3- make
4- make install
5- util/tstmachines
in the 5th step error was
Errors while trying to run rsh 192.168.0.25 ¡Vn /bin/ls /usr/local/mpich/mpich-1.2.5.2/mpichfoo unexpected response from 192.168.0.25

ƒÞ > /bin/ls: /usr/local/mpich/mpich-1.2.5.2/mpichfoo:
ƒÞ no such file or directory
The ls test failed on some machines.
This usually means that u donot have a common filesystem on all of the machines in your machines list; MPICH requires this for mpirun (it is possible to handle this in a procgroup file; see the¡K¡K)
Other possible problems include:-
The remote shell command rsh doesnot allow you to run ls.
See the doc abt remote shell & rhosts

You have common filesystem, but with inconsistent names
See the doc on the automounter fix
1 error were encountered while testing the machines list for LINUX
only these machines seem to be available
host1




now since this is only a two node cluster host1 is the server on to which MPICH is being installed. & 192.168.0.25 is the client¡K..
rsh on both nodes is logging freely¡K¡K.
On the server side the file ¡§ machines.LINUX ¡§ contains
-192.168.0.25
-host1
Kindly help

Akhtar


All times are GMT -5. The time now is 05:51 PM.