Linux - NetworkingThis forum is for any issue related to networks or networking.
Routing, network cards, OSI, etc. Anything is fair game.
Notices
Welcome to LinuxQuestions.org, a friendly and active Linux Community.
You are currently viewing LQ as a guest. By joining our community you will have the ability to post topics, receive our newsletter, use the advanced search, subscribe to threads and access many other special features. Registration is quick, simple and absolutely free. Join our community today!
Note that registered members see fewer ads, and ContentLink is completely disabled once you log in.
If you have any problems with the registration process or your account login, please contact us. If you need to reset your password, click here.
Having a problem logging in? Please visit this page to clear all LQ-related cookies.
Get a virtual cloud desktop with the Linux distro that you want in less than five minutes with Shells! With over 10 pre-installed distros to choose from, the worry-free installation life is here! Whether you are a digital nomad or just looking for flexibility, Shells can put your Linux machine on the device that you want to use.
Exclusive for LQ members, get up to 45% off per month. Click here for more info.
I'm not sure where the issue is, however I have a RHEL v5 that has an Oracle Database along with a Java Based Application on it.
The end users try to run this Java Based Application from their pc and it takes along time for it to connect. Sometimes it doesn't connect at all. All network traffic is going across the LAN. I've also been able to ping and traceroute from the server back to a pc and vice versa, so I know its not iptables.
The DBA is looking into the database side and I'm looking at the OS side. The error message is referencing an ORA message, however I still want to do my part and make sure its not the OS.
I can see the network connections seem to be ok with the following command:
Code:
netstat -ap | grep ESTA
I'm also looking at socket settings as well, such as setting under /proc/sys/net/ipv4
I'm not sure what is going on with the incorrect value from the destination server to the 192.168.50.8.
I looked at the values of eth0 using ethtool, and looked at a few blogs online where either checksum offloading or tcpoffloading is turned off, however this is the first time I have seen this, so I'm not sure what would be the best course of action:
Code:
[root@foo core]# ethtool eth0
Settings for eth0:
Supported ports: [ TP ]
Supported link modes: 1000baseT/Full
Supports auto-negotiation: No
Advertised link modes: Not reported
Advertised auto-negotiation: No
Speed: 1000Mb/s
Duplex: Full
Port: Twisted Pair
PHYAD: 0
Transceiver: internal
Auto-negotiation: off
Link detected: yes
[root@foo core]# ethtool -k eth0
Offload parameters for eth0:
rx-checksumming: on
tx-checksumming: on
scatter-gather: on
tcp segmentation offload: on
udp fragmentation offload: off
generic segmentation offload: off
generic-receive-offload: off
Last edited by JockVSJock; 06-29-2015 at 02:24 PM.
Isn't that error msg saying that the destination sent a corrupt packet? I may be laying a red herring, but that would tie in with the symptom of long login times while corrupt pkts are discarded until valid ones are received. Have you run the pings for a huge number of datagrams and from the Oracle machine to yours? OTOH ping datagrams may not necessarily show up as corrupt in any case.
I would be looking at the (possibly failing) nic card for replacement on the Oracle machine. Since they're so cheap, it's an easy and quick elimination of a possible contributor to the problem. (Of course, it could be your nic that is calculating the checksum incorrectly or it could be the cable corrupting the data. The joys of network troubleshooting...)
I would be looking at the (possibly failing) nic card for replacement on the Oracle machine. Since they're so cheap, it's an easy and quick elimination of a possible contributor to the problem. (Of course, it could be your nic that is calculating the checksum incorrectly or it could be the cable corrupting the data. The joys of network troubleshooting...)
Crap, I forgot to mention that this is a VM in VMWare vCenter, not a physical machine. Which introduces a whole level of complexity to the situation.
This VM is shares a data store on a SAN with a number of other VMs, and they don't seem to have any networking issues either, or at least I don't see any issues with them.
I didn't think of the idea of trying to send bigger packets from the Oracle machine to the client. I would have to read up on this because I've never done this before.
No I didn't mean bigger datagrams, I meant more. I was thinking along the lines of a degrading nic that occasionally sends corrupt packets, in which case you would have to capture a lot of them to see this.
As to the VMWare, I can't offer any help as I don't use it.
I changed the driver that was tied to the NIC in RHEL, went from Flexible to VMXNET3 and once doing that we ran the test again and I watched the tcpdump traffic and no longer see the incorrect error. The error we are getting now says: Socket red time out62000
However we are still getting the socket error, however I'm starting to lean more towards that this issue maybe with the software and how it is trying to connect.
LinuxQuestions.org is looking for people interested in writing
Editorials, Articles, Reviews, and more. If you'd like to contribute
content, let us know.