Hello everyone,
I have a computer cluster which has a CentOS operating system and uses SGE as a queue manager. The problem is that they have recently appeared when making a qstat -f an error of au ((a) larm, (u) nreachable), as shown below. So I understood that the problem is connection.
queuename qtype resv/used/tot. load_avg arch states
---------------------------------------------------------------------------------
all.q@compute-0-0.local BIP 0/0/20 -NA- linux-x64 au
---------------------------------------------------------------------------------
all.q@compute-0-1.local BIP 0/0/20 -NA- linux-x64 au
---------------------------------------------------------------------------------
all.q@compute-0-2.local BIP 0/0/20 -NA- linux-x64 au
I have tried to restart the cluster and update the operating system, but it has not corrected the problem. I have also tried to restart the sge through the command ./sgemaster start (which tells me that it is on) and ./sgeexecd start (which tells me that it is starting it). Despite all this, the error persists
. Can you think of how I could solve my problem?
Thank you very much to all.
Regards
Rafael