LinuxQuestions.org

LinuxQuestions.org (/questions/)
-   Linux - Server (https://www.linuxquestions.org/questions/linux-server-73/)
-   -   Windows Server 2003 clients periodically unable to read CIFS shares (https://www.linuxquestions.org/questions/linux-server-73/windows-server-2003-clients-periodically-unable-to-read-cifs-shares-740355/)

AdamDaughterson 07-15-2009 04:50 PM

Windows Server 2003 clients periodically unable to read CIFS shares
 
Hi forum,

Here are the symptoms:

Periodically, and especially when our dual-quad-core file servers are under load, the Windows Server 2003 systems report:
Quote:

"Insufficient system resources exist to complete the requested service"
With log level turned up to 6 in smb.conf, the only thing logged at the exact moment of failure is:
Quote:

@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^ @^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^ @^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^v
(no exaggeration, it's all At's and Carat's and an arbitrary seeming number of 'em. A Google search for even the phonetic 'at'+'carat' doesn't return a thing...)
...which leads me to many various docs, including a KB which indicates NTBackup has such and such an issue with the blob of data that certain apps send instead of chunking it out, and to assuage the issue one can simply increase PagedPoolSize etc, and at this point I'm near to insane with the babysitting and restarting samba when my monitors report they can't get the directory listing any longer.

Configuration is fairly simple. Here's my smb.conf sans anything that might get me in trouble:
Code:

[global]
        log file = /var/log/samba/%m.log
        log level = 3
        socket options = TCP_NODELAY SO_RCVBUF=8162 SO_SNDBUF=8162 IPTOS_LOWDELAY
        smb ports = 445
        oplocks = no
        dns proxy = yes
        disable netbios = yes
        server string = SATA04
        workgroup = TPV
        security = SHARE
        max log size = 50
        unix extensions = no
        local master = no
        os level = 0
        load printers = no
        map to guest = bad user
        deadtime = 1

...plus some share definitions which basically look like:
Code:

        path = /some/path
        force directory mode = 777
        force create mode = 777
        comment = This is a comment
        create mode = 777
        directory mode = 777
        write list = @writers
        guest ok = yes

DNS is functional, CentOS is up to date on one system, but the same issue occurs on a RHEL4 system with a completely different version of samba, so I guess the fact that CentOS is current is immaterial.

I've done tcpdump's and captured the packets that are being transmitted at the time of the failure between the linux and windows systems and there are no errors (like invalid checksums, or otherwise transport issues).

Thanks in advance...


A

AdamDaughterson 07-15-2009 04:54 PM

Just as an additional bit, I just had the issue and tested from Windows 2000 Server (sp4, up to date, blah, blah) and the issue occurs there too.

A

stress_junkie 07-15-2009 05:04 PM

You might get some useful information from sar. It is in the sysstats package. Once you install sar you have got to configure it to collect data on a regular basis, such as every ten minutes. It will keep enormous amounts of data about every little thing going on in the kernel. You have to tell it to do that. The default configuration may be nearly useless on your distribution; it is on mine.

Once you have sar data collector running it should provide a really great snapshot of system resources at the time of the next failure. You may have to be clever about what you ask sar to tell you. It has lots of parameters because it can show you lots of different things.

I would first suspect that the insufficient resources could be memory or network buffers. Fixing one of those could require you to recompile the kernel.

Keep us up to date on this. It sounds really interesting.

There may be useful information at www.samba.org. I don't know.

AdamDaughterson 08-10-2009 09:53 AM

Just to follow up on this post:

By changing the following, I have been able to reduce the amount we are seeing this particular issue 99%.

security = user
encrypt passwords = yes
bind interfaces only = yes
interfaces = ip.of.this.system 127.0.0.1

#Although the next two were already present in my config, I changed the system to use a RO account and an RW account. {shrugs} Not sure if it has anything to do with the error going away, but it does control write access...
guest account = roaccount
map to guest = bad user


All times are GMT -5. The time now is 02:36 PM.