[SOLVED] samba resetting users connections to files
Linux - ServerThis forum is for the discussion of Linux Software used in a server related context.
Notices
Welcome to LinuxQuestions.org, a friendly and active Linux Community.
You are currently viewing LQ as a guest. By joining our community you will have the ability to post topics, receive our newsletter, use the advanced search, subscribe to threads and access many other special features. Registration is quick, simple and absolutely free. Join our community today!
Note that registered members see fewer ads, and ContentLink is completely disabled once you log in.
If you have any problems with the registration process or your account login, please contact us. If you need to reset your password, click here.
Having a problem logging in? Please visit this page to clear all LQ-related cookies.
Get a virtual cloud desktop with the Linux distro that you want in less than five minutes with Shells! With over 10 pre-installed distros to choose from, the worry-free installation life is here! Whether you are a digital nomad or just looking for flexibility, Shells can put your Linux machine on the device that you want to use.
Exclusive for LQ members, get up to 45% off per month. Click here for more info.
I am working on a PDC for our company running Debian Sarge. It has Bind9.2.4, Samba 3.0.14a and exim4.50. All clients are Windows XP SP2 with the Windows firewall disabled. ZoneAlarm is the 3rd party firewall, AV, anti-spam, anti-malware application of choice (for now).
The main problem that we are having right now is a lack of reliable communication with the server's shares. The users are able to connect to the server, open a file (using the applicable editor), save several changes to the file while working on it, all without problems. However, there have been several instances of users who work on files for quite a bit of time (an hour or more) having their work interrupted with a connection error (currently sitting at about a 30% failure rate). These users will be trudging along and suddenly, they receive a message (usually a Windows type error) telling them that the connection to the file has been lost/severed and that they cannot save their changes. They then have to save it locally, re-establish connection to the network share (usually just openning it in Windows Explorer does the trick, but sometimes it does prompt for their username/password) and overwrite the original with the locally saved copy. This generally happens after the first hour of openning the file but has also happened within the first 15 minutes.
ZoneAlarm has been uninstalled from a few users systems with no effect on the problem. I have recompiled to use cifs instead of smbfs for the network shares, but this only seemed to increase the time it takes for the connection to drop. Now it takes 30-90 minutes for the connection to drop, instead of 15-50.
I've gone through the logs and can see no indication of problems with user connections. No resets, no excessibe packet loss, nada. The networking kit is all new NetGear Gigabit (10/100/1000) switches and new Cat5e cable. There is only one system older than 2 years and it is only used for scanning hardcopy. All the rest are new Dell's.
I'm not sure what else to post here to help clarify the problem. If you need more info, I'll be happy to provide whatever answers I can. I'm nowhere near Guru status yet, so forgive me if I ask any dumb questions. Thanks for reading; thanks a bunch for helping.
I don't know if this is your problem but it's worth taking a look at and following any leads that it provides. It's from the smb.conf man page.
Quote:
keepalive (G)
The value of the parameter (an integer) represents the number of seconds between keepalive packets. If this parameter is zero, no keepalive packets will be sent. Keepalive packets, if sent, allow the server to tell whether a client is still present and responding.
Keepalives should, in general, not be needed if the socket has the SO_KEEPALIVE attribute set on it by default. (see socket options). Basically you should only use this option if you strike difficulties.
Thanks for the reply, xjlittle. As I understand it, if you do not specify a value for keepalive, the default value is used as ..., well ..., the default. I went ahead and specified a keepalive of 600 (10 minutes) in my smb.conf, but it had no effect.
I have read through the man pages for smb.conf and samba already. I've tried messing with the various oplock settings thinking one of them might be the problem, but had no luck with those either. I feel that I'm looking in the wrong place, but haven't got a clue where else to look.
Is there any other info I can provide to clarify the problem?
Could you post your smb.conf please? My experience tells me that if it is samba it most likely is a configuration error in the smb.conf. If there is nothing there you will need to go back to the workstations.
If it were me I would bring up a totally clean ws without all the av, antispam antiwhatever and see if it experiences the same problem. Give it to a user who seems to have the problem a lot (or screams the loudest )
I can think of two other things off hand.
You are using 3.0.14 which is getting old compared to WinXP SP2. MS does a lot of things with the way they communicate with other machines eg. digital signing. I would seriously consider upgrading to the latest version of samba.
Other than that I would try a clean workstation as I mentioned before w/o all the anti{spyware,virus,etc} stuff. You might even leave off sp2.
Beyond all of that it seems that this a transient thing. It could be a nic going bad and unable to handle a certain load, so app running on the server that causes overload and so on. Keep watching your /var/log/messages. Sooner or later you will spot it and get it fixed.
As this is a production server for a corporate domain, I'm more than a little leary about upgrading something beyond the stable release. I'll go ahead and check through the changes made, just in case.
I will keep an eye on the logs. I'm still hoping that something will pop up that will let me know where to focus my attention. Thanks for your help.
I apologize for not addressing this earlier, but I thought that I had. You suggested a couple times that I test with a "clean" machine. I do have two machines that I use for testing purposes that have never had anti-virus, anti-malware, firewall or any other restrictive software installed (except windows, of course). Both of those systems experience the same problems as all the others. I have also removed the anti-whatever suite from my computer and still experienced the same problem. So it doesn't seem to matter if the computer has AV, had it and it was removed or never had it at all.
Thank you. I didn't realize that Debian locked Samba to the version included with the .iso and no further. I'll go ahead and update and post the results.
The upgrade went very smoothly, but did not solve the problem. It did have a positive effect though. The problems occur much less frequently now. I would say about half as often as they did before the upgrade. Anything else to check?
LinuxQuestions.org is looking for people interested in writing
Editorials, Articles, Reviews, and more. If you'd like to contribute
content, let us know.