DNS entries killed my NFS
I am running a file-server on SLES10. I have had absolutely no problems with it until earlier this week. It has a number of NFS mounts that it serves up to other servers (all running OpenSuSE10.3 or 11), and those mounts have been in place for quite some time now without issue.
On Monday, I showed up at work and found that there had been (apparantly) a brief power outage on Sunday night (I say apparantly because I have no monitoring in place for this sort of thing, but judging by the log messages I received, the server had restarted in the middle of the night). I noticed that none of my NFS mounts were working, although all of the mounted servers listed the mounts as being active. After some time, I remembered that the only recent change had been my adding an entry in the /etc/hosts file for one of the mounted servers. I removed that entry, and everything started to work properly. The next day, the same thing happened- the logs showed what I believe to be a restart, and the NFS mounts were down. I added the DNS entry (which I had removed the day before to fix the system) and everything started working again. The next day, same scenario, and I found that changing the time to a few days later than it actually was fixed the problem.
After playing with it more, I have discovered that changing either the time or the /etc/hosts file switches the state of the mounts, either from working to not working or vice-versa. I cannot find any other things that affect this state, and I have tried multiple times restarting the NFS server without making changes to either of these, with no effect. I have checked the logs, and interestingly enough, altough the server states that the mount requests were authenticated, the servers that are trying to mount the directories return an error stating that the server denied permission.
If anyone has any ideas whatsoever as to what is happening here, or if you need more information to figure out what is wrong, please write me back. This issue is wreaking havoc on our backup process, our SVN server, and various other aspects of our daily work, and has consumed the majority of my week.
Last edited by Tyler_H72; 04-16-2009 at 03:40 PM.