LinuxQuestions.org
Share your knowledge at the LQ Wiki.
Home Forums Tutorials Articles Register
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Networking
User Name
Password
Linux - Networking This forum is for any issue related to networks or networking.
Routing, network cards, OSI, etc. Anything is fair game.

Notices


Reply
  Search this Thread
Old 05-23-2005, 10:16 AM   #1
asmithumd
LQ Newbie
 
Registered: May 2005
Posts: 2

Rep: Reputation: 0
Automount fails and doesn't retry


Our system contains about 10 disk servers and about 20 compute nodes. We use
NIS with automount to configure disk sharing. The system works fine except
when the load on a disk server is high. When this is the case, it is possible
for a mount request (from automount on a compute node) to time out. Automount
reports in /var/log/messages that the "mount failed".

The problem is that the process that requested the disk to be mounted dies
as it doesn't have the data it requires to run. We use torque as a batch system
for production jobs, so when a job dies, torque sends the next job in the queue
to the compute node, and it promptly dies. The process goes on and on until
all the jobs waiting on the queue have been submitted and have died.

The problem with automount is 2 fold:

1) Under high load, when a mount request times out, automount does not
resubmit the request. There does not seem to be a way to lengthen the
timeout or increase the number of attempts. Note, this is not the idle time
unmount "--imeout" that I am talking about.

2) If automount fails to mount a disk, subsequent attempts to mount the disk
fail instantly. This is big time bad news for me, because it causes my
jobs to die by the hundreds when one compute node does south.

OS: RHEL-3

Any Ideas -Andy
 
  


Reply



Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
Printer not connected; will retry in 30 seconds DaneM Linux - Hardware 5 10-28-2005 11:11 PM
dd retry on error? SDraconis Linux - General 4 06-12-2005 01:20 AM
changing password retry time andy753421 Linux - Security 2 12-19-2004 06:24 PM
how to retry after kernel panic Mike-BB Linux - Software 1 08-11-2004 02:03 AM
Automount my windows shared directory fails cmf Fedora 0 06-01-2004 04:44 PM

LinuxQuestions.org > Forums > Linux Forums > Linux - Networking

All times are GMT -5. The time now is 06:13 PM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration