LinuxQuestions.org

LinuxQuestions.org (/questions/)
-   Linux - Hardware (https://www.linuxquestions.org/questions/linux-hardware-18/)
-   -   watch sensors stops reporting fan rpms (https://www.linuxquestions.org/questions/linux-hardware-18/watch-sensors-stops-reporting-fan-rpms-4175596361/)

alanware 12-28-2016 06:27 PM

watch sensors stops reporting fan rpms
 
I am running Slackware64 14.2 on a supermicro motherboard. I have run sensors-detect and added entries into rc.modules.local that were output post sensors-detect. When I run watch sensors, I have cpu core temps, voltages, fan speeds and a couple of other temps. After a period of time the fan speeds, voltages, other temps zero out and stop reporting. The core temps still work like normal. Any ideas what is causing this?

Ztcoracat 12-28-2016 09:36 PM

I'm guessing that it stops reporting because of some sort of a default.
Looking at the man page for 'watch' the default runs every 2 seconds.
https://linux.die.net/man/1/watch

Maybe it zeros out on the fan because of times that the fan stops running?

Watch will run until it's interrupted.
http://beerpla.net/2007/08/04/watch-...ever-heard-of/

Not sure if the sensors.config file could be adjusted or not to stop the 'default'
(just an idea)
https://linux.die.net/man/5/sensors.conf

Are you getting any time out errors?

seasons 12-30-2016 04:16 AM

It sounds like it could be an issue with the kernel module used for the sensor chip on the mobo. (The core temp is the sensor on the CPU and uses a different module.)
What module does sensors-detect say you're using? Are there any clues in dmesg when it stops working?

alanware 01-03-2017 05:36 PM

My sensors-detect finds three different modules.

Code:

coretemp
i5500_temp
w83795

Watch keeps reporting the core temps and my gpu temp, just stops reporting fan speeds and voltages.

Ztcoracat 01-03-2017 07:28 PM

Any clues in the dmesg log?

Using sensors with the -u flag might help.

Code:

-u    Raw  output.  This  mode is suitable for debugging and for post-
              processing of the output by scripts.

What kernel are you running and what mobo do you have?

Here's the cmd for your mobo:

Code:

cat /sys/devices/virtual/dmi/id/board_vendor /sys/devices/virtual/dmi/id/board_name /sys/devices/virtual/dmi/id/board_version

alanware 01-03-2017 08:04 PM

Motherboard

Code:

Supermicro
X8DT3
2.0

sensors -u
Code:

coretemp-isa-0000
Adapter: ISA adapter
Core 0:
  temp2_input: 33.000
  temp2_max: 69.000
  temp2_crit: 79.000
  temp2_crit_alarm: 0.000
Core 1:
  temp3_input: 31.000
  temp3_max: 69.000
  temp3_crit: 79.000
  temp3_crit_alarm: 0.000
Core 2:
  temp4_input: 25.000
  temp4_max: 69.000
  temp4_crit: 79.000
  temp4_crit_alarm: 0.000
Core 8:
  temp10_input: 30.000
  temp10_max: 69.000
  temp10_crit: 79.000
  temp10_crit_alarm: 0.000
Core 9:
  temp11_input: 29.000
  temp11_max: 69.000
  temp11_crit: 79.000
  temp11_crit_alarm: 0.000
Core 10:
  temp12_input: 36.000
  temp12_max: 69.000
  temp12_crit: 79.000
  temp12_crit_alarm: 0.000

radeon-pci-0200
Adapter: PCI adapter
temp1:
  temp1_input: 43.000
  temp1_crit: 120.000
  temp1_crit_hyst: 90.000

w83795adg-i2c-0-2f
Adapter: SMBus I801 adapter at 0400
in0:
  in0_input: 0.000
  in0_min: 0.672
  in0_max: 1.494
  in0_alarm: 0.000
in1:
  in1_input: 0.000
  in1_min: 0.672
  in1_max: 1.494
  in1_alarm: 0.000
in2:
  in2_input: 0.000
  in2_min: 1.352
  in2_max: 1.654
  in2_alarm: 0.000
in3:
  in3_input: 0.000
  in3_min: 1.128
  in3_max: 1.382
  in3_alarm: 0.000
in4:
  in4_input: 0.000
  in4_min: 1.632
  in4_max: 1.998
  in4_alarm: 0.000
in5:
  in5_input: 0.000
  in5_min: 0.000
  in5_max: 2.046
  in5_alarm: 0.000
in6:
  in6_input: 0.000
  in6_min: 1.200
  in6_max: 1.654
  in6_alarm: 0.000
in7:
  in7_input: 0.000
  in7_min: 1.200
  in7_max: 1.654
  in7_alarm: 0.000
in11:
  in11_input: 0.000
  in11_min: 0.000
  in11_max: 2.046
  in11_alarm: 0.000
+3.3V:
  in12_input: 0.000
  in12_min: 2.952
  in12_max: 3.666
  in12_alarm: 0.000
3VSB:
  in13_input: 0.000
  in13_min: 2.952
  in13_max: 3.666
  in13_alarm: 0.000
Vbat:
  in14_input: 0.000
  in14_min: 2.952
  in14_max: 3.666
  in14_alarm: 0.000
fan1:
  fan1_input: 0.000
  fan1_min: 709.000
  fan1_alarm: 0.000
fan2:
  fan2_input: 0.000
  fan2_min: 709.000
  fan2_alarm: 0.000
fan3:
  fan3_input: 0.000
  fan3_min: 709.000
  fan3_alarm: 0.000
fan4:
  fan4_input: 0.000
  fan4_min: 709.000
  fan4_alarm: 0.000
fan5:
  fan5_input: 0.000
  fan5_min: 709.000
  fan5_alarm: 0.000
fan6:
  fan6_input: 0.000
  fan6_min: 709.000
  fan6_alarm: 0.000
fan7:
  fan7_input: 0.000
  fan7_min: 709.000
  fan7_alarm: 0.000
fan8:
  fan8_input: 0.000
  fan8_min: 709.000
  fan8_alarm: 0.000
temp1:
  temp1_input: 0.000
  temp1_max: 85.000
  temp1_max_hyst: 80.000
  temp1_crit: 100.000
  temp1_crit_hyst: 95.000
  temp1_alarm: 0.000
  temp1_type: 3.000
temp2:
  temp2_input: 0.000
  temp2_max: 85.000
  temp2_max_hyst: 80.000
  temp2_crit: 100.000
  temp2_crit_hyst: 95.000
  temp2_alarm: 0.000
  temp2_type: 3.000
temp5:
  temp5_input: 0.000
  temp5_max: 85.000
  temp5_max_hyst: 80.000
  temp5_crit: 75.000
  temp5_crit_hyst: 70.000
  temp5_alarm: 0.000
  temp5_type: 4.000
temp7:
  temp7_input: 0.000
  temp7_max: 95.000
  temp7_max_hyst: 92.000
  temp7_crit: 95.000
  temp7_crit_hyst: 92.000
  temp7_alarm: 0.000
  temp7_type: 6.000
temp8:
  temp8_input: 0.000
  temp8_max: 95.000
  temp8_max_hyst: 92.000
  temp8_crit: 95.000
  temp8_crit_hyst: 92.000
  temp8_alarm: 0.000
  temp8_type: 6.000
intrusion0:
  intrusion0_alarm: 0.000


Ztcoracat 01-03-2017 08:05 PM

Finding out which module that the kernel is using for your fan could be the key. Than you could dmesg the driver.

https://wiki.archlinux.org/index.php/ACPI_modules

https://wiki.archlinux.org/index.php/fan_speed_control

I don't think Fan Speed Control will help.
All 8 read outs (Fan1_min: 709.00 thru Fan8) of the fan appear to be normal with a consistent reading. IF the module wasn't loading you wouldn't get a reading from the sensors cmd at all. So.....

I'm baffled as to why it stops reporting fan speeds.

I don't know what that 3rd module (w83195) in post #4 is. Never seen it before.
I'll look up your board online and that module and see what I can find.

Did you install a newer kernel recently?

alanware 01-03-2017 08:33 PM

I have the following in my /etc/rc.d/rc.modules.local

Code:

/sbin/modprobe coretemp
/sbin/modprobe i5500_temp
/sbin/modprobe w83795


alanware 01-03-2017 09:14 PM

I forgot kernel, it is 4.4.14.

Ztcoracat 01-03-2017 09:24 PM

Quote:

Originally Posted by alanware (Post 5650030)
I have the following in my /etc/rc.d/rc.modules.local

Code:

/sbin/modprobe coretemp
/sbin/modprobe i5500_temp
/sbin/modprobe w83795


Try this and see if it returns anything worthy:-
Code:

modprobe w83795
Code:

dmesg | grep <name of driver>
I found your mobo-;)
http://www.supermicro.com/products/m...5500/X8DT3.cfm

:idea: Maybe once the module is loads it's interrupted and that's why the reading times out.

Ztcoracat 01-03-2017 09:27 PM

Quote:

Originally Posted by alanware (Post 5650039)
I forgot kernel, it is 4.4.14.

Thanks:-
If there are any issues with that kernel I'll post it.

alanware 01-03-2017 09:43 PM

The module was already loaded at boot, but the dmesg did show the following.

Code:

[21813.757157] i801_smbus 0000:00:1f.3: SMBus is busy, can't use it!
[21813.757160] w83795 0-002f: Failed to read from register 0x043, err -16
[21813.757174] i801_smbus 0000:00:1f.3: SMBus is busy, can't use it!
[21813.757177] w83795 0-002f: Failed to read from register 0x044, err -16
[21813.757192] i801_smbus 0000:00:1f.3: SMBus is busy, can't use it!
[21813.757194] w83795 0-002f: Failed to read from register 0x045, err -16
[21813.757209] i801_smbus 0000:00:1f.3: SMBus is busy, can't use it!
[21813.757212] w83795 0-002f: Failed to read from register 0x046, err -16
[21813.757229] i801_smbus 0000:00:1f.3: SMBus is busy, can't use it!
[21813.757231] w83795 0-002f: Failed to write to register 0x040, err -16


seasons 01-03-2017 09:49 PM

This is the kernel module in question:
https://www.mjmwired.net/kernel/Docu...n/hwmon/w83795
It interacts with the the Super I/O chip (the one that monitors fan speeds and voltages) on the motherboard.

Obviously, it stops responding for some reason. You should look at the end of dmesg AFTER it stops responding for clues.

seasons 01-03-2017 09:58 PM

Oh, I didn't see your post before mine. A quick google shows a very similar error on another SuperMicro board:
http://lm-sensors.lm-sensors.narkive...uestion#post10

I'm not sure what to suggest to you.

Ztcoracat 01-03-2017 10:24 PM

Here's one for the exact mobo:-
http://lm-sensors.lm-sensors.narkive...ol-not-working

Thanks, seasons for posting the link to that kernel module.;)
---------------------------------------------------------------------------------

What does the end of the dmesg log say?


All times are GMT -5. The time now is 02:39 AM.