LinuxQuestions.org
Visit Jeremy's Blog.
Home Forums Tutorials Articles Register
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Networking
User Name
Password
Linux - Networking This forum is for any issue related to networks or networking.
Routing, network cards, OSI, etc. Anything is fair game.

Notices


Reply
  Search this Thread
Old 10-22-2022, 12:33 PM   #1
Sheriff Hobbes
LQ Newbie
 
Registered: Nov 2010
Location: Germany (DDR 4.0)
Distribution: Debian 11
Posts: 24

Rep: Reputation: 5
Physical adapter in a bridge entering disabled state


Hi!

I have the problem with Debian 11.4 on a notebook that the network connection is being reset from time to time and I don't know why. I have configured a bridge, because I need to share the connection with virtual machines. The bridge looks like this:

Code:
root@xxx:~# brctl show
bridge name	bridge id		STP enabled	interfaces
bridge-hn		8000.9638fea2521a	no		enp0s25
							vnet12
							vnet14
							vnet16
enp0s25 is the physical adapter causing the problem:

Code:
root@xxx:~# cat /var/log/messages |grep enp0s25
Oct 22 15:41:11 xxx kernel: [2252362.136807] bridge-hn: port 1(enp0s25) entered disabled state
Oct 22 15:41:15 xxx kernel: [2252365.909356] e1000e 0000:00:19.0 enp0s25: NIC Link is Up 1000 Mbps Full Duplex, Flow Control: Rx/Tx
Oct 22 15:41:15 xxx kernel: [2252365.909441] bridge-hn: port 1(enp0s25) entered blocking state
Oct 22 15:41:15 xxx kernel: [2252365.909455] bridge-hn: port 1(enp0s25) entered forwarding state
I thought that spanning tree protocol was doing this, so I disabled it, but the problem continued.

Any ideas what might cause this and how to fix it?
This is really annoying because always when I listen to streamed music from that notebook, I get interruptions. My network is really simple, only a 192.168.0.0/24 local network connected to one switch.

Thanks,
SH
 
Old 10-23-2022, 08:56 PM   #2
Ser Olmy
Senior Member
 
Registered: Jan 2012
Distribution: Slackware
Posts: 3,341

Rep: Reputation: Disabled
I'd like to see the line just above the "entered disabled state" message, because the next line:
Code:
e1000e 0000:00:19.0 enp0s25: NIC Link is Up 1000 Mbps Full Duplex, Flow Control: Rx/Tx
...strongly implies that the interface link went down for some reason.

Try greping for messages related to the e1000e driver. If the link goes down sporadically and then comes up again, you're probably experiencing a NIC/cable/switchport issue (layer 1).

The blocking/forwarding messages seem to be simply symptoms of the underlying problem. Also, if you look at the timestamps, the port does go directly from blocking to forwarding without the typical STP delay, as one would expect when STP is not running.
 
Old 10-24-2022, 10:31 AM   #3
Sheriff Hobbes
LQ Newbie
 
Registered: Nov 2010
Location: Germany (DDR 4.0)
Distribution: Debian 11
Posts: 24

Original Poster
Rep: Reputation: 5
Quote:
Originally Posted by Ser Olmy View Post
...strongly implies that the interface link went down for some reason.
No, it doesn't, that's the strange thing. The excerpt from /var/log/messages was everything regarding the issue. There is no "link down" preceeding entering the disabled state.

I exchanged the switch and the problem continues. Next I can try a cable change, but it happened over different cables, so that's probably not the issue. I'm also thinking about disabling Network Manager and reconfiguring the bridge with the "old" script based method. If that doesn't help either, maybe the NIC has a problem? That would be the worst case, meaning to get a new notebook.
 
Old 10-25-2022, 02:55 PM   #4
Sheriff Hobbes
LQ Newbie
 
Registered: Nov 2010
Location: Germany (DDR 4.0)
Distribution: Debian 11
Posts: 24

Original Poster
Rep: Reputation: 5
I completely deinstalled network-manager und set up the bridge manually. The problem continues, but now I get this interesting info preceeding "port 1(enp0s25) entered disabled state":

Code:
Oct 25 23:37:20 xxx kernel: [25661.968604] ------------[ cut here ]------------
Oct 25 23:37:20 xxx kernel: [25661.968633] NETDEV WATCHDOG: enp0s25 (e1000e): transmit queue 0 timed out
Oct 25 23:37:20 xxx kernel: [25661.968668] WARNING: CPU: 5 PID: 0 at net/sched/sch_generic.c:467 dev_watchdog+0x25c/0x260
Oct 25 23:37:20 xxx kernel: [25661.968673] Modules linked in: rpcsec_gss_krb5 vhost_net vhost vhost_iotlb tap tun nft_chain_nat nf_nat cmac algif_hash algif_skcipher af_alg bnep bridge stp llc binfmt_misc isofs loop btusb btrtl btbcm btintel bluetooth raid1 uvcvideo cdc_mbim cdc_wdm dm_raid videobuf2_vmalloc videobuf2_memops videobuf2_v4l2 raid456 videobuf2_common jitterentropy_rng async_raid6_recov async_memcpy async_pq drbg async_xor cdc_ncm async_tx cdc_ether md_mod videodev ansi_cprng hid_generic usbnet xor ecdh_generic mc ecc xfs cdc_acm mii raid6_pq intel_rapl_msr intel_rapl_common uas rmi_smbus rmi_core usbhid hid x86_pkg_temp_thermal intel_powerclamp coretemp snd_hda_codec_hdmi usb_storage kvm_intel snd_hda_codec_realtek snd_hda_codec_generic kvm snd_hda_intel nf_log_ipv6 snd_intel_dspcfg ip6t_REJECT iwlmvm nf_reject_ipv6 irqbypass crc32_pclmul soundwire_intel ghash_clmulni_intel soundwire_generic_allocation xt_hl ip6_tables aesni_intel snd_soc_core ip6t_rt libaes i915 mac80211 snd_compress crypto_simd
Oct 25 23:37:20 xxx kernel: [25661.969128]  rtsx_pci_sdmmc cryptd libarc4 soundwire_cadence nf_log_ipv4 glue_helper nf_log_common iTCO_wdt mei_wdt snd_hda_codec ipt_REJECT intel_pmc_bxt snd_hda_core nf_reject_ipv4 iTCO_vendor_support mei_hdcp mmc_core xhci_pci at24 watchdog wmi_bmof iwlwifi xt_LOG rapl ehci_pci xt_comment snd_hwdep intel_cstate e1000e xhci_hcd xt_multiport drm_kms_helper ehci_hcd sr_mod soundwire_bus thinkpad_acpi cfg80211 intel_uncore snd_pcm mei_me cdrom nvram usbcore i2c_i801 nft_limit ptp cec rtsx_pci mei snd_timer ledtrig_audio usb_common joydev i2c_algo_bit pcspkr i2c_smbus pps_core sg snd lpc_ich tpm_tis soundcore tpm_tis_core wmi rfkill tpm battery ac rng_core video button xt_limit xt_addrtype xt_tcpudp xt_conntrack nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 nft_compat nft_counter nfsd nf_tables auth_rpcgss nfs_acl parport_pc lockd ppdev libcrc32c grace nfnetlink lp parport drm sunrpc fuse configfs ip_tables x_tables autofs4 ext4 crc16 mbcache jbd2 crc32c_generic dm_mod sd_mod t10_pi crc_t10dif
Oct 25 23:37:20 xxx kernel: [25661.969651]  crct10dif_generic ahci libahci libata scsi_mod psmouse crct10dif_pclmul crct10dif_common evdev crc32c_intel serio_raw
Oct 25 23:37:20 xxx kernel: [25661.969706] CPU: 5 PID: 0 Comm: swapper/5 Not tainted 5.10.0-16-amd64 #1 Debian 5.10.127-1
Oct 25 23:37:20 xxx kernel: [25661.969710] Hardware name: LENOVO 20BE00B8GE/20BE00B8GE, BIOS GMET91WW (2.39 ) 06/03/2021
Oct 25 23:37:20 xxx kernel: [25661.969722] RIP: 0010:dev_watchdog+0x25c/0x260
Oct 25 23:37:20 xxx kernel: [25661.969736] Code: eb a9 48 8b 1c 24 c6 05 a1 5e 10 01 01 48 89 df e8 a9 97 fa ff 44 89 e9 48 89 de 48 c7 c7 d0 c8 96 aa 48 89 c2 e8 28 45 14 00 <0f> 0b eb 86 0f 1f 44 00 00 41 57 41 56 49 89 d6 41 55 4d 89 c5 41
Oct 25 23:37:20 xxx kernel: [25661.969742] RSP: 0018:ffffaef1801e0eb0 EFLAGS: 00010282
Oct 25 23:37:20 xxx kernel: [25661.969754] RAX: 0000000000000000 RBX: ffff9049110cc000 RCX: 0000000000000000
Oct 25 23:37:20 xxx kernel: [25661.969765] RDX: ffff904a3676c760 RSI: ffff904a3675ca00 RDI: 0000000000000300
Oct 25 23:37:20 xxx kernel: [25661.969770] RBP: ffff9049110cc3dc R08: 0000000000000000 R09: ffffaef1801e0cd0
Oct 25 23:37:20 xxx kernel: [25661.969775] R10: ffffaef1801e0cc8 R11: ffffffffaaecb448 R12: ffff904903c63680
Oct 25 23:37:20 xxx kernel: [25661.969781] R13: 0000000000000000 R14: ffff9049110cc480 R15: 0000000000000001
Oct 25 23:37:20 xxx kernel: [25661.969795] FS:  0000000000000000(0000) GS:ffff904a36740000(0000) knlGS:0000000000000000
Oct 25 23:37:20 xxx kernel: [25661.969801] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Oct 25 23:37:20 xxx kernel: [25661.969806] CR2: 0000384c0bb11000 CR3: 000000001680a003 CR4: 00000000001726e0
Oct 25 23:37:20 xxx kernel: [25661.969813] Call Trace:
Oct 25 23:37:20 xxx kernel: [25661.969826]  <IRQ>
Oct 25 23:37:20 xxx kernel: [25661.969839]  ? pfifo_fast_enqueue+0x150/0x150
Oct 25 23:37:20 xxx kernel: [25661.969847]  call_timer_fn+0x29/0xf0
Oct 25 23:37:20 xxx kernel: [25661.969860]  __run_timers.part.0+0x1d5/0x250
Oct 25 23:37:20 xxx kernel: [25661.969875]  ? recalibrate_cpu_khz+0x10/0x10
Oct 25 23:37:20 xxx kernel: [25661.969884]  ? ktime_get+0x38/0xa0
Oct 25 23:37:20 xxx kernel: [25661.969898]  ? lapic_next_deadline+0x28/0x30
Oct 25 23:37:20 xxx kernel: [25661.969907]  ? clockevents_program_event+0x8d/0xf0
Oct 25 23:37:20 xxx kernel: [25661.969920]  run_timer_softirq+0x26/0x50
Oct 25 23:37:20 xxx kernel: [25661.969929]  __do_softirq+0xc5/0x275
Oct 25 23:37:20 xxx kernel: [25661.969944]  asm_call_irq_on_stack+0x12/0x20
Oct 25 23:37:20 xxx kernel: [25661.969949]  </IRQ>
Oct 25 23:37:20 xxx kernel: [25661.969963]  do_softirq_own_stack+0x37/0x40
Oct 25 23:37:20 xxx kernel: [25661.969978]  irq_exit_rcu+0x8e/0xc0
Oct 25 23:37:20 xxx kernel: [25661.969991]  sysvec_apic_timer_interrupt+0x36/0x80
Oct 25 23:37:20 xxx kernel: [25661.970002]  asm_sysvec_apic_timer_interrupt+0x12/0x20
Oct 25 23:37:20 xxx kernel: [25661.970015] RIP: 0010:cpuidle_enter_state+0xc7/0x350
Oct 25 23:37:20 xxx kernel: [25661.970025] Code: 8b 3d 6d 40 17 56 e8 28 fa a1 ff 49 89 c5 0f 1f 44 00 00 31 ff e8 39 05 a2 ff 45 84 ff 0f 85 fa 00 00 00 fb 66 0f 1f 44 00 00 <45> 85 f6 0f 88 06 01 00 00 49 63 c6 4c 2b 2c 24 48 8d 14 40 48 8d
Oct 25 23:37:20 xxx kernel: [25661.970030] RSP: 0018:ffffaef1800ebea8 EFLAGS: 00000246
Oct 25 23:37:20 xxx kernel: [25661.970044] RAX: ffff904a3676fd40 RBX: 0000000000000005 RCX: 000000000000001f
Oct 25 23:37:20 xxx kernel: [25661.970049] RDX: 0000000000000000 RSI: 0000000033519044 RDI: 0000000000000000
Oct 25 23:37:20 xxx kernel: [25661.970053] RBP: ffff904a36778800 R08: 00001756e485906f R09: 0000000000000001
Oct 25 23:37:20 xxx kernel: [25661.970058] R10: 0000000000000000 R11: 0000000000011b73 R12: ffffffffaafae7c0
Oct 25 23:37:20 xxx kernel: [25661.970062] R13: 00001756e485906f R14: 0000000000000005 R15: 0000000000000000
Oct 25 23:37:20 xxx kernel: [25661.970075]  ? cpuidle_enter_state+0xb7/0x350
Oct 25 23:37:20 xxx kernel: [25661.970089]  cpuidle_enter+0x29/0x40
Oct 25 23:37:20 xxx kernel: [25661.970107]  do_idle+0x1ef/0x2b0
Oct 25 23:37:20 xxx kernel: [25661.970122]  cpu_startup_entry+0x19/0x20
Oct 25 23:37:20 xxx kernel: [25661.970133]  secondary_startup_64_no_verify+0xb0/0xbb
Oct 25 23:37:20 xxx kernel: [25661.970141] ---[ end trace fdd8392c01234d3f ]---
Does this indicate a problem with the CPU?
 
Old 10-25-2022, 08:35 PM   #5
Ser Olmy
Senior Member
 
Registered: Jan 2012
Distribution: Slackware
Posts: 3,341

Rep: Reputation: Disabled
Quote:
Originally Posted by Sheriff Hobbes View Post
Code:
Oct 25 23:37:20 xxx kernel: [25661.968604] ------------[ cut here ]------------
Oct 25 23:37:20 xxx kernel: [25661.968633] NETDEV WATCHDOG: enp0s25 (e1000e): transmit queue 0 timed out
It appears the NIC isn't serving the transmit queue.
Quote:
Originally Posted by Sheriff Hobbes View Post
Does this indicate a problem with the CPU?
No, it indicates a problem with either the NIC, the link, or the NIC driver.

It could be a defective NIC, it could be a driver bug (not very likely give how common this NIC is), or it could be a link issue causing the NIC to stall.

What does ifconfig enp0s25 have to say about RX/TX errors?
 
Old 10-26-2022, 03:24 AM   #6
Sheriff Hobbes
LQ Newbie
 
Registered: Nov 2010
Location: Germany (DDR 4.0)
Distribution: Debian 11
Posts: 24

Original Poster
Rep: Reputation: 5
Quote:
Originally Posted by Ser Olmy View Post
What does ifconfig enp0s25 have to say about RX/TX errors?
There's no ifconfig in Debian 11.

Code:
root@xxx:~# ip -s link show enp0s25 
2: enp0s25: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc pfifo_fast master bridge-hn state UP mode DEFAULT group default qlen 1000
    link/ether 54:ee:75:52:1d:a7 brd ff:ff:ff:ff:ff:ff
    RX: bytes  packets  errors  dropped missed  mcast   
    1006778337 1594145  0       2419    0       10432   
    TX: bytes  packets  errors  dropped carrier collsns 
    4778083517 3640443  0       0       0       0
No errors, but 0.15% of the received packets were dropped.
 
Old 10-26-2022, 11:03 AM   #7
Sheriff Hobbes
LQ Newbie
 
Registered: Nov 2010
Location: Germany (DDR 4.0)
Distribution: Debian 11
Posts: 24

Original Poster
Rep: Reputation: 5
I updated to kernel 5.15.74 and the problem continues. Next I will buy a USB ethernet adapter and test over this port.
 
Old 11-05-2022, 02:45 PM   #8
Sheriff Hobbes
LQ Newbie
 
Registered: Nov 2010
Location: Germany (DDR 4.0)
Distribution: Debian 11
Posts: 24

Original Poster
Rep: Reputation: 5
I deactivated the onboard NIC in the BIOS and installed this cheap USB NIC:
https://www.amazon.de/gp/product/B00M77HMU0/

It runs out of the box in Debian 11 kernel 5.15.74 and the problem is gone!
So either the onboard NIC was buggy or the e1000e driver, we'll never know.
 
  


Reply



Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
DWA-131 Access Point - Debian : interface state UNINITIALIZED->DISABLED , AP-DISABLED mr.simo Linux - Networking 1 03-22-2018 03:29 PM
How to detect the physical connected state of a network cable ? bhaskara-r Linux - Networking 4 03-19-2015 08:11 AM
scrambled display after entering "power saving?" state rbees Linux - General 2 02-26-2008 05:32 AM
USB Audio - Trust SC5500p - Processes entering unkillable Zl+ state warden Linux - Hardware 0 02-09-2008 06:23 AM
WinXP on one physical drive and SUSE 10 on another physical drive flanker12k Linux - Newbie 4 05-13-2006 08:36 AM

LinuxQuestions.org > Forums > Linux Forums > Linux - Networking

All times are GMT -5. The time now is 07:00 AM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration