opensuse 11.1 cluster mount issues
We have a 150 node cluster that was running opensuse 10.2 with no problems. To allow users to get to their autofs mounted homes (using nis) across all the nodes, we set /proc/sys/sunrpc/max_resvport to 5000 which worked great the whole time. Now after limited albeit successful testing, we've upgraded the cluster to opensuse 11.1, and now we get these errors with the max_resvport set to 5000:
kernel: lockd_up: makesock failed, error=-13
Users are randomly unable to get to their home directories, and the problem gradually increments. We can get rid of the errors by dropping the max_resvport back to 1023, but that defeats the purpose because then we run into the nfs mount limit. I've tried using either portmap or rpcbind (also with -i for insecure), but I get the same behavior regardless. I don't think it's a firewall issue, as the internal network is wide open, and the external (through a head node) seem to allow all nfs and nis traffic through. I've found the nscd dies and causes many ypcall errors, but by keeping nscd up and running (either by restarting or installing unscd), it limits any yp errors I get, but it still doesn't fix the max_resvport problem.
Any ideas would be greatly appreciated.
|