LinuxQuestions.org
Help answer threads with 0 replies.
Home Forums Tutorials Articles Register
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Distributions > Slackware
User Name
Password
Slackware This Forum is for the discussion of Slackware Linux.

Notices


Reply
  Search this Thread
Old 10-01-2018, 12:56 PM   #16
abga
Senior Member
 
Registered: Jul 2017
Location: EU
Distribution: Slackware
Posts: 1,634

Rep: Reputation: 929Reputation: 929Reputation: 929Reputation: 929Reputation: 929Reputation: 929Reputation: 929Reputation: 929

Regarding OOM (Out Of Memory), without ruling it out, I never experienced a hardlock because of it in modern kernels, or, at least the kernel was able to throw some lines in the logs.
https://www.kernel.org/doc/gorman/ht...rstand016.html
Here, what to look after in the logs in case of an OOM event:
https://serverfault.com/questions/54...yslog-messages
https://access.redhat.com/solutions/2612861
 
Old 10-01-2018, 10:19 PM   #17
ahTh6aaf
LQ Newbie
 
Registered: Jul 2017
Posts: 19

Rep: Reputation: Disabled
It happened exactly the same thing just a couple of hours ago... Never happened before...
 
Old 10-02-2018, 04:58 AM   #18
elcore
Senior Member
 
Registered: Sep 2014
Distribution: Slackware
Posts: 1,753

Rep: Reputation: Disabled
Seen this couple of years ago, but only with VFS and that thing file managers use to auto-mount usb drives.
Never happened to me in runlevel 3 where I mount everything manually.

Guess "I safely ejected the drive and went out" doesn't say much, OP. Maybe specify how did you mount/umount the drive, what commands?
It could be something broke in the kernel but I doubt it. Probably a bug in dbus, udisks, udisks2. Or someone might have accessed it while you weren't around.
 
Old 10-08-2018, 05:11 AM   #19
NonNonBa
Member
 
Registered: Aug 2010
Distribution: Slackware
Posts: 192

Rep: Reputation: Disabled
I got this too, several times since the last kernel update in -stable. Always the same, I'm closing the X apps (usually one or two xterms and firefox) before closing the WM (openbox), all with the keyboard shortcuts, and everything freezes suddenly before I complete. I suspect firefox has something to do with this, because it occurs always after I closed it, and I loose its usual dimensions and the menu bar once it is restarted next, which means the window disappeared but the application in fact did not properly exit. A conflict between firefox and the WM wouldn't surprise me, as the first is the only application I can move by just left-clicking on its window, which obviously cheats my openbox config. What I can't explain is why the heck it would prevent the power button to trigger init 0 as it normally does. The system logs sadly don't enlighten me much.
 
1 members found this post helpful.
Old 10-09-2018, 12:11 AM   #20
jostber
Member
 
Registered: Jul 2001
Location: Skien, Norway
Distribution: Slackware Current 64-bit
Posts: 543

Rep: Reputation: 178Reputation: 178
When you experience hardlock can you try to get some output from the magic Sysrq function to check what is causing this?

Code:
AltGr-Sysrq-r (Turns off keyboard raw mode)

AltGr-Sysrq-t (Will dump a list of current tasks and their information to your console)

AltGr-Sysrq-m (Will dump current memory info to your console)
 
Old 10-13-2018, 05:01 AM   #21
NonNonBa
Member
 
Registered: Aug 2010
Distribution: Slackware
Posts: 192

Rep: Reputation: Disabled
Quote:
Originally Posted by jostber View Post
When you experience hardlock can you try to get some output from the magic Sysrq function to check what is causing this?
Unfortunately, I'm jailed in X, and can't access a console.

Some news, though. I first tried to update my BIOS, and get things horribly worse. Closing firefox then systematically caused a general freeze. I tried to switch to fluxbox but it didn't help. I therefore downgraded the kernel to the original -stable 4.4.14, along with the firmware it came with. Everything worked back. So, I upgraded the firmware to the version in patches/ to see if the issue was there, but did not notice any change. Finally I upgraded the kernel to the 4.14.74 and everything is currently working like a charm. There's clearly something in the 4.4.157 allowing firefox to wreck the system which has since then been fixed, but I don't know what it was.

Last edited by NonNonBa; 10-13-2018 at 05:05 AM. Reason: Fluxbox switching test
 
1 members found this post helpful.
Old 10-13-2018, 12:50 PM   #22
montagdude
Senior Member
 
Registered: Apr 2016
Distribution: Slackware
Posts: 2,011

Rep: Reputation: 1619Reputation: 1619Reputation: 1619Reputation: 1619Reputation: 1619Reputation: 1619Reputation: 1619Reputation: 1619Reputation: 1619Reputation: 1619Reputation: 1619
Quote:
Originally Posted by NonNonBa View Post
Unfortunately, I'm jailed in X, and can't access a console.

Some news, though. I first tried to update my BIOS, and get things horribly worse. Closing firefox then systematically caused a general freeze. I tried to switch to fluxbox but it didn't help. I therefore downgraded the kernel to the original -stable 4.4.14, along with the firmware it came with. Everything worked back. So, I upgraded the firmware to the version in patches/ to see if the issue was there, but did not notice any change. Finally I upgraded the kernel to the 4.14.74 and everything is currently working like a charm. There's clearly something in the 4.4.157 allowing firefox to wreck the system which has since then been fixed, but I don't know what it was.
I'm glad you were able to solve the problem. In my experience, the kernel is always a likely culprit for these kinds of problems. In my case, something happened between 4.4.115 and 4.14 that caused lockups on one machine. It went away with 4.15 once I started building minimal kernels.
 
Old 10-14-2018, 03:36 AM   #23
keithpeter
Member
 
Registered: Nov 2015
Location: 52:30N 1:55W
Distribution: Slackware 15.0, OpenBSD 7.4
Posts: 310

Rep: Reputation: Disabled
I'm seeing lock-ups on Slackware64 14.2 after using slackpkg to upgrade to kernel 4.4.157 (huge kernel). This is on an old Dell Optiplex 7010 desktop. I'm using xfce4 as desktop environment.

I can reproduce the lock-up by simply plugging in a large external backup drive (usb, ntfs formatted) and then ejecting the drive by clicking on the icon in Thunar. The drive shows the 'data being written to...' notification, and if I move the Thunar window around, the desktop freezes.

I logged in to the desktop from the laptop using ssh and ran top until it froze, nothing dramatic just shows Thunar on 7% of cpu with load factor around 0.6.

Before I change the kernel, is there a way of monitoring the relevant logs over the ssh connection? like a way of tailing each of the logs once a second or something?

(Laptop is X61s running 32 bit with generic kernel and initrd as I've encrypted the ssd, no sign of anything yet)
 
Old 10-14-2018, 04:36 AM   #24
NonNonBa
Member
 
Registered: Aug 2010
Distribution: Slackware
Posts: 192

Rep: Reputation: Disabled
Quote:
Originally Posted by montagdude View Post
I'm glad you were able to solve the problem.
Thank you.

Quote:
Originally Posted by keithpeter View Post
I'm seeing lock-ups on Slackware64 14.2 after using slackpkg to upgrade to kernel 4.4.157 (huge kernel). This is on an old Dell Optiplex 7010 desktop. I'm using xfce4 as desktop environment.
My computer is also a Dell, Latitude E6400. The reboot has always been somewhat erratic, as long as I remember. Could all of this be related to the acpi?

Quote:
Originally Posted by keithpeter View Post
Before I change the kernel, is there a way of monitoring the relevant logs over the ssh connection? like a way of tailing each of the logs once a second or something?
Maybe with tail -f?
 
1 members found this post helpful.
Old 10-14-2018, 11:36 AM   #25
keithpeter
Member
 
Registered: Nov 2015
Location: 52:30N 1:55W
Distribution: Slackware 15.0, OpenBSD 7.4
Posts: 310

Rep: Reputation: Disabled
Quote:
Originally Posted by NonNonBa View Post
Thank you.


My computer is also a Dell, Latitude E6400. The reboot has always been somewhat erratic, as long as I remember. Could all of this be related to the acpi?


Maybe with tail -f?
Hello NonNonBa and all

Below is the /var/log/syslog extract from the Dell when I provoked a freeze by unmounting an external hard drive from Thunar

The first line in the code block below is the last line that I got from the tail -f command on the monitoring laptop, all the later lines were written to the log *after* the ssh session froze and after (I assume) the X session on the Dell froze.

It certainly looks as if the kernel is throwing its dummy about something. Anyone see anything useful here?

Code:
Oct 14 17:20:53 espresso kernel: [ 1178.981600] Buffer I/O error on dev sdb1, logical block 1, async page read
Oct 14 17:20:53 espresso kernel: [ 1179.027548] BUG: unable to handle kernel NULL pointer dereference at 000000000000001c
Oct 14 17:20:53 espresso kernel: [ 1179.027596] IP: [<ffffffffc020a420>] xhci_hub_control+0x1620/0x1720 [xhci_hcd]
Oct 14 17:20:53 espresso kernel: [ 1179.027627] PGD 0 
Oct 14 17:20:53 espresso kernel: [ 1179.027637] Oops: 0000 [#1] SMP 
Oct 14 17:20:53 espresso kernel: [ 1179.027664] Modules linked in: uas usb_storage ipv6 fuse rtl8187 mac80211 hid_generic cfg80211 eeprom_93cx6 rfkill usbhid hid i2c_dev coretemp hwmon intel_rapl iosf_mbi x86_pkg_temp_thermal intel_powerclamp kvm_intel kvm snd_hda_codec_hdmi irqbypass snd_hda_codec_realtek snd_hda_codec_generic crct10dif_pclmul i915 evdev dcdbas drm_kms_helper snd_hda_intel drm crc32_pclmul snd_hda_codec psmouse crc32c_intel thermal e1000e xhci_pci ptp serio_raw ghash_clmulni_intel intel_gtt snd_hda_core agpgart i2c_algo_bit snd_hwdep fb_sys_fops syscopyarea sysfillrect snd_pcm fjes 8250_fintek pps_core xhci_hcd ehci_pci mei_me mei sysimgblt video tpm_tis tpm i2c_i801 ehci_hcd i2c_core snd_timer snd processor lpc_ich soundcore button loop
Oct 14 17:20:53 espresso kernel: [ 1179.027999] CPU: 0 PID: 34 Comm: kworker/0:1 Not tainted 4.4.157 #2
Oct 14 17:20:53 espresso kernel: [ 1179.028017] Hardware name: Dell Inc. OptiPlex 7010/0MN1TX, BIOS A16 09/09/2013
Oct 14 17:20:53 espresso kernel: [ 1179.028053] Workqueue: usb_hub_wq hub_event
Oct 14 17:20:53 espresso kernel: [ 1179.028070] task: ffff88021497b100 ti: ffff880214ad4000 task.ti: ffff880214ad4000
Oct 14 17:20:53 espresso kernel: [ 1179.028090] RIP: 0010:[<ffffffffc020a420>]  [<ffffffffc020a420>] xhci_hub_control+0x1620/0x1720 [xhci_hcd]
Oct 14 17:20:53 espresso kernel: [ 1179.028125] RSP: 0018:ffff880214ad7a70  EFLAGS: 00010086
Oct 14 17:20:53 espresso kernel: [ 1179.028141] RAX: ffff880202718000 RBX: 0000000000000003 RCX: 0000000000000060
Oct 14 17:20:53 espresso kernel: [ 1179.028161] RDX: 0000000000000001 RSI: 0000000000000000 RDI: ffffffffc0209801
Oct 14 17:20:53 espresso kernel: [ 1179.028181] RBP: ffff880214ad7b20 R08: 0000000000000000 R09: 0000000000000000
Oct 14 17:20:53 espresso kernel: [ 1179.028199] R10: 0000000000000212 R11: 0000000000000000 R12: 0000000000000003
Oct 14 17:20:53 espresso kernel: [ 1179.028211] R13: ffff880212654000 R14: ffff880212650294 R15: ffff880212650000
Oct 14 17:20:53 espresso kernel: [ 1179.028224] FS:  0000000000000000(0000) GS:ffff88021e200000(0000) knlGS:0000000000000000
Oct 14 17:20:53 espresso kernel: [ 1179.028237] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Oct 14 17:20:53 espresso kernel: [ 1179.028248] CR2: 000000000000001c CR3: 000000000240a000 CR4: 0000000000160670
Oct 14 17:20:53 espresso kernel: [ 1179.028259] Stack:
Oct 14 17:20:53 espresso kernel: [ 1179.028265]  0000000080220007 0000000000000000 ffff880213a7d0f0 ffff880213824d80
Oct 14 17:20:53 espresso kernel: [ 1179.028300]  ffff8802153e6880 00000000000000c3 0000000000000301 ffff880212650248
Oct 14 17:20:53 espresso kernel: [ 1179.028325]  0000000000000000 ffff880214ad0005 000000008120ea04 0000000000000004
Oct 14 17:20:53 espresso kernel: [ 1179.028348] Call Trace:
Oct 14 17:20:53 espresso kernel: [ 1179.028360]  [<ffffffff811b3210>] ? __kmalloc+0x230/0x250
Oct 14 17:20:53 espresso kernel: [ 1179.028374]  [<ffffffff81ac5605>] usb_hcd_submit_urb+0x375/0x940
Oct 14 17:20:53 espresso kernel: [ 1179.028388]  [<ffffffff811b3945>] ? __slab_free+0xb5/0x290
Oct 14 17:20:53 espresso kernel: [ 1179.028401]  [<ffffffff81ac6ad0>] usb_submit_urb+0x2c0/0x530
Oct 14 17:20:53 espresso kernel: [ 1179.028415]  [<ffffffff81acd2ad>] ? usb_release_bos_descriptor+0x2d/0x40
Oct 14 17:20:53 espresso kernel: [ 1179.028430]  [<ffffffff81ac739f>] usb_start_wait_urb+0x5f/0xe0
Oct 14 17:20:53 espresso kernel: [ 1179.028443]  [<ffffffff81ac74e5>] usb_control_msg+0xc5/0x110
Oct 14 17:20:53 espresso kernel: [ 1179.028458]  [<ffffffff81abbcc8>] set_port_feature+0x48/0x50
Oct 14 17:20:53 espresso kernel: [ 1179.028471]  [<ffffffff81abd3ff>] hub_port_disable+0xdf/0x160
Oct 14 17:20:53 espresso kernel: [ 1179.028483]  [<ffffffff81ac1a14>] hub_event+0xb84/0x1280
Oct 14 17:20:53 espresso kernel: [ 1179.028497]  [<ffffffff81c8c652>] ? __schedule+0x272/0x760
Oct 14 17:20:53 espresso kernel: [ 1179.028509]  [<ffffffff81c8c646>] ? __schedule+0x266/0x760
Oct 14 17:20:53 espresso kernel: [ 1179.028521]  [<ffffffff81c8c652>] ? __schedule+0x272/0x760
Oct 14 17:20:53 espresso kernel: [ 1179.028534]  [<ffffffff81c8c646>] ? __schedule+0x266/0x760
Oct 14 17:20:53 espresso kernel: [ 1179.028546]  [<ffffffff81c8c652>] ? __schedule+0x272/0x760
Oct 14 17:20:53 espresso kernel: [ 1179.028560]  [<ffffffff810a9570>] process_one_work+0x150/0x4b0
Oct 14 17:20:53 espresso kernel: [ 1179.028573]  [<ffffffff810a9918>] worker_thread+0x48/0x4a0
Oct 14 17:20:53 espresso kernel: [ 1179.028586]  [<ffffffff810a98d0>] ? process_one_work+0x4b0/0x4b0
Oct 14 17:20:53 espresso kernel: [ 1179.028600]  [<ffffffff810aee88>] kthread+0xd8/0xf0
Oct 14 17:20:53 espresso kernel: [ 1179.028612]  [<ffffffff81c8c652>] ? __schedule+0x272/0x760
Oct 14 17:20:53 espresso kernel: [ 1179.028625]  [<ffffffff810aedb0>] ? kthread_worker_fn+0x160/0x160
Oct 14 17:20:53 espresso kernel: [ 1179.028639]  [<ffffffff81c90995>] ret_from_fork+0x55/0x80
Oct 14 17:20:53 espresso kernel: [ 1179.028652]  [<ffffffff810aedb0>] ? kthread_worker_fn+0x160/0x160
Oct 14 17:20:53 espresso kernel: [ 1179.028664] Code: 00 e9 93 f7 ff ff 31 db eb 11 48 83 c3 01 48 81 fb 00 01 00 00 0f 84 c9 f4 ff ff 49 8b 84 df 18 04 00 00 48 85 c0 74 e2 48 8b 30 <83> 7e 1c 04 40 0f 96 c7 41 83 bd b0 00 00 00 3f 40 0f 9f c6 40 
Oct 14 17:20:53 espresso kernel: [ 1179.028888] RIP  [<ffffffffc020a420>] xhci_hub_control+0x1620/0x1720 [xhci_hcd]
Oct 14 17:20:53 espresso kernel: [ 1179.028908]  RSP <ffff880214ad7a70>
Oct 14 17:20:53 espresso kernel: [ 1179.028916] CR2: 000000000000001c
Oct 14 17:20:53 espresso kernel: [ 1179.030881] ---[ end trace 961a06215f68ddaf ]---
 
Old 10-14-2018, 03:37 PM   #26
abga
Senior Member
 
Registered: Jul 2017
Location: EU
Distribution: Slackware
Posts: 1,634

Rep: Reputation: 929Reputation: 929Reputation: 929Reputation: 929Reputation: 929Reputation: 929Reputation: 929Reputation: 929
If it's about an USB3 storage device (adapter as in the OP) connected to an USB2 port, then the following might be related, although it's been observed in 4.17 and patched some months ago:
https://patchwork.kernel.org/patch/10394653/
 
1 members found this post helpful.
Old 10-14-2018, 04:47 PM   #27
animeresistance
Member
 
Registered: Feb 2004
Distribution: Slackware
Posts: 230

Rep: Reputation: 39
Hi.

I experienced hardlocks because of kernels, at least for me, newer kernels like 4.9 or 4.14 series makes my desktop to make a hardlock, this kind of hardlocks rarely left something written in the logs (at least for me, nothing was written in the logs, so I don't know what caused them), maybe it will get fixed if you change the kernel to a 4.4 series.

Greetings.
 
Old 10-15-2018, 03:28 AM   #28
elcore
Senior Member
 
Registered: Sep 2014
Distribution: Slackware
Posts: 1,753

Rep: Reputation: Disabled
If you think kernel has caused it; there is a facility to detect lockups in the kernel, but you may have to recompile with:
Code:
CONFIG_SOFTLOCKUP_DETECTOR=y
CONFIG_HARDLOCKUP_DETECTOR=y
Quote:
Originally Posted by keithpeter View Post
I provoked a freeze by unmounting an external hard drive from Thunar
Is there a ntfs on the drive?
By the way, has anyone been able to trigger the lockup without X running?
 
1 members found this post helpful.
Old 10-15-2018, 06:58 AM   #29
keithpeter
Member
 
Registered: Nov 2015
Location: 52:30N 1:55W
Distribution: Slackware 15.0, OpenBSD 7.4
Posts: 310

Rep: Reputation: Disabled
Quote:
Originally Posted by elcore View Post
Is there a ntfs on the drive?
By the way, has anyone been able to trigger the lockup without X running?
Yup, it is an ntfs formatted external usb hard drive. The PC has USB3 and USB2 hubs detected, I'll try to work out which socket is which. The external hard drive is USB2

I'll try reverting to run level 3 and using mount to mount the drive and see if I can reproduce.

I'm guessing that I can compile the huge kernel from the slackbuild in /patches/source to enable the two configuration settings mentioned, but I'll have to read up on how to do that that. I have reverted to the very old 4.4.14 kernel from the DVD but I would like to have a patched updated kernel on this PC, and I'm learning stuff so all fine.

Thanks for replies: UK timezone and I tend to have time to try things in the evening.
 
Old 10-15-2018, 07:45 AM   #30
montagdude
Senior Member
 
Registered: Apr 2016
Distribution: Slackware
Posts: 2,011

Rep: Reputation: 1619Reputation: 1619Reputation: 1619Reputation: 1619Reputation: 1619Reputation: 1619Reputation: 1619Reputation: 1619Reputation: 1619Reputation: 1619Reputation: 1619
Quote:
Originally Posted by elcore View Post
If you think kernel has caused it; there is a facility to detect lockups in the kernel, but you may have to recompile with:
Code:
CONFIG_SOFTLOCKUP_DETECTOR=y
CONFIG_HARDLOCKUP_DETECTOR=y


Is there a ntfs on the drive?
By the way, has anyone been able to trigger the lockup without X running?
Mine were on a server not running X. No idea if it was the same cause as these other reports, though. Thanks for the tip on the kernel config. I will try that if I encounter the issue again.
 
  


Reply



Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
Slackware current 64 hardlock GreenFireFly Slackware 7 12-20-2015 02:19 PM
Slackware is 21 Years Old jeremy Linux - News 1 07-18-2014 11:33 PM
Quasi-random system failure/hardlock/freeze with compiz-fusion and Intel video kopatops Linux - Software 0 01-07-2011 03:40 AM
dvb-utils' scan causes hardlock (hauppauge nova-t usb2 stick) neocookie Linux - General 3 01-30-2007 03:08 PM
Trying Slackware after years away Agent69 Slackware 5 06-04-2002 06:30 AM

LinuxQuestions.org > Forums > Linux Forums > Linux - Distributions > Slackware

All times are GMT -5. The time now is 02:10 AM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration