Hello:
Quote:
Originally Posted by business_kid
My no #1 guess - permissions ...
|
If this problem were of a permanent nature ie: if it happened at
every shutdown, I would totally agree with you.
But because I can either
have permission to do X or
not have permission to do X ie: shutdown the workstation, it seems (to me) highly unlikely that this is a premissions issue.
As I have related previously, this is an issue that crops up in a random manner without my being able to reproduce it, no matter what.
Of course, I'm quite sure that there
is a way to reproduce it, I just have not been able to find it.
But all is not lost:
What I can say is that
one of the symptoms ie: the random reboots after shutdown I mentioned in my OP has not cropped up again, probably due to having been able to disable WoL.
Quote:
Description of the problem:
On shutdown, the rig will do one of three things:
1. shut down properly
2. shut down properly and after about 5s. reboot start reboot and freeze
3. freeze during the shutdown at this point ...
|
The issue that remains unsolved is #3, the one that seems to be linked to the e1000e EEE Tx Timer.
Quote:
Originally Posted by business_kid
... put little faith in latest drivers.
|
You may be right in that.
Just in case, I decided to go through the updating of the driver.
I downloaded the latest available version from Intel:
https://downloadcenter.intel.com/dow...3.4.2.1.tar.gz
To my surprise (!) and even though the readme file indicated that the previous module had to be removed before loading the new one, on [c]# make install[/c], the script installed the module correctly without further intervention from me, which was a relief.
Good going Intel! (for once).
Code:
[root@devuan groucho]# modinfo e1000e
filename: /lib/modules/4.9.0-8-amd64/updates/drivers/net/ethernet/intel/e1000e/e1000e.ko
version: 3.4.2.1-NAPI
license: GPL
description: Intel(R) PRO/1000 Network Driver
author: Intel Corporation, <linux.nics@intel.com>
--- snip ---
[root@devuan groucho]#
But even with the latest driver I am not able to disable the e1000e EEE Tx LPI Timer:
Code:
[root@devuan groucho]# ethtool --set-eee eth0 tx-lpi off
Cannot get EEE settings: Operation not supported
[root@devuan groucho]# ethtool --set-eee eth0 eee off
Cannot get EEE settings: Operation not supported
[root@devuan groucho]#
Now, "Operation not supported" cannot mean anything but "ethtool cannot do this" or maybe "the driver cannot do this" as the card clearly supports EEE.
Check the screen output showing that the tx-lpi timer
is in action:
Code:
--- copied from uploded screen image ---
Devuan GNU/Linux ascii devuan tty1
devuan login: [483.367459] EXT-fs (sdc1): re-mounted. Opts: (null)
[485.772216] e1000e: eth0 NIC Link is Down
[485.776885] kvm: exiting hardware virtualization
[485.777756] sd 9:0:3:0: [sdf] Synchronizing SCSI cache
[485.778154] sd 9:0:2:0: [sdf] Synchronizing SCSI cache
[485.781519] e1000e: EEE TX LPI TIMER: 00000000
[485.785219] ACPI: Preparing to enter sleep state S5
[485.868007] reboot: Power down
--- copied from uploded screen image ---
From [485.868007] on, the rig freezes with the case/CPU fans blasting loudly at 100% output.
The only way out of this situation is a hard shutdown.
Quote:
Originally Posted by business_kid
Go after the usual suspects - ACPI & BIOS.
|
The only part of the BIOS that I will not touch again is the Intel ME settings.
Changing anything in there will wreak havoc. Been there , done that and really feared for the hardware. =-/
Quote:
Originally Posted by business_kid
... start acpid with a '-l' option, it logs events to syslog.
|
Done that:
Code:
groucho@devuan:~$ tail -1000 /var/log/syslog | grep -i logging
Dec 21 09:09:46 devuan acpid: waiting for events: event logging is on
groucho@devuan:~$
But I have found nothing in syslog.
Or auth.log, user.log, messages, daemon.log, kern.log, debug and boot for that matter.
Quote:
Originally Posted by business_kid
I'm amazed that your BIOS won't disable wake on lan ...
|
Like I mentioned earlier in the thread: I am unable to disable the Gbe controller: it is greyed out on the BIOS screen.
If you think that is strange, how about this: the BIOS reports the rig as 'Hand Held Computer', no matter who you ask:
Code:
[root@devuan groucho]# lshw
devuan
description: Hand Held Computer
product: Ultra 24 (Null)
vendor: Sun Microsystems
version: 0.00.01
serial: 0123456789
width: 64 bits
capabilities: smbios-2.5 dmi-2.5 smp vsyscall32
configuration: boot=normal chassis=handheld family=Null sku=Null uuid=295894AA-8E10-4A53-9090-00144F4AA281
[root@devuan groucho]#
Code:
[root@devuan groucho]# inxi -M
Machine: Device: portable System: Sun Microsystems product: Ultra 24 v: 0.00.01 serial: 0123456789
Mobo: Sun Microsystems model: Ultra 24 v: 50 serial: 3K08QG
BIOS: American Megatrends v: 1.56 date: 01/21/2011
[root@devuan groucho]#
So imagine the rest ...
[rant]
It's a real pity that the destiny of such excellently designed/built hardware was eventually obscured by the dickheads that wrote the BIOS and then, having received thousands of bug reports/complaints, never fixed them properly. I hope they are all rotting in some cold, dark Oracle dungeon.
[/rant]
I'll give this a rest for a few days: have to cook for monday/tuesday.
Thank you very much for your input.
Have a very Merry Xmas. *<8^D !
Cheers,
A.