LinuxQuestions.org

LinuxQuestions.org (/questions/)
-   Linux - Hardware (https://www.linuxquestions.org/questions/linux-hardware-18/)
-   -   sensors not working and core overheating (https://www.linuxquestions.org/questions/linux-hardware-18/sensors-not-working-and-core-overheating-647508/)

144419855310001 06-06-2008 05:01 PM

sensors not working and core overheating
 
Hi

The core on my laptop seems to be overheating without any effort at all, and I also can't get detection of my cpu temperature.

I've been much more concerned about monitoring this since I installed a new hard drive. My previous one died after just two years, which I suspect was because of permanent damage I must have done when I did some transcoding when I first bought my laptop. I also have these colourled lines down the side of my screen since exactly the same time, also heat damage(?)

Anyway, I installed lm-sensors, added k8temp to /etc/modules as suggested by sensors-detect and installed hddtemp. All of this info I just randomly stumbled upon, so this is no indication of any experience temp monitoring.

1) My cpu temp monitor is not picking up anything.
gnome-sensor and cat /proc/acpi/thermal_zone/THRM/temperature just read:
Quote:

temperature: 0 C
I would like to know what temp my cpu is!!! What can I do? The module k8temp is loaded
Quote:

lsmod | grep k8temp
k8temp 7680 0
but it doesn't do anything :-(((


2) My Core0 temp is rarely below 49 C (hovers around 50 C at rest, and all I have to do is say open a few tabs in firefox and open a youtube video and it's spiked past 60 C, when I get a warning message from gnome-sensors applet. Is 60 C the reasonable upper limit?
My hard drive just stays level at around 40 C - is hddtemp detecting this incorrectly?

example "sensors" output:
Quote:

user@home:~$ sensors
k8temp-pci-00c3
Adapter: PCI adapter
Core0 Temp: +51.0°C
Just now I opened appearance settings to turn off aiglx effects and it was staying at 66C while I had the window open! Here is the output of a script that polled core temp with the sensors command every 2 seconds while I had a 3D game open:



Quote:

Adapter: PCI adapter Core0 Temp: +50.0°C
Adapter: PCI adapter Core0 Temp: +51.0°C
Adapter: PCI adapter Core0 Temp: +57.0°C
Adapter: PCI adapter Core0 Temp: +60.0°C
Adapter: PCI adapter Core0 Temp: +60.0°C
Adapter: PCI adapter Core0 Temp: +61.0°C
Adapter: PCI adapter Core0 Temp: +61.0°C
Adapter: PCI adapter Core0 Temp: +62.0°C
Adapter: PCI adapter Core0 Temp: +63.0°C
Adapter: PCI adapter Core0 Temp: +64.0°C
Adapter: PCI adapter Core0 Temp: +65.0°C
Adapter: PCI adapter Core0 Temp: +66.0°C
Adapter: PCI adapter Core0 Temp: +64.0°C
Adapter: PCI adapter Core0 Temp: +66.0°C
Adapter: PCI adapter Core0 Temp: +66.0°C
Adapter: PCI adapter Core0 Temp: +66.0°C
Adapter: PCI adapter Core0 Temp: +67.0°C
Adapter: PCI adapter Core0 Temp: +67.0°C
Adapter: PCI adapter Core0 Temp: +67.0°C
Adapter: PCI adapter Core0 Temp: +67.0°C
Adapter: PCI adapter Core0 Temp: +68.0°C
Adapter: PCI adapter Core0 Temp: +68.0°C
Adapter: PCI adapter Core0 Temp: +68.0°C
Adapter: PCI adapter Core0 Temp: +69.0°C
Adapter: PCI adapter Core0 Temp: +69.0°C
Adapter: PCI adapter Core0 Temp: +69.0°C
Adapter: PCI adapter Core0 Temp: +69.0°C
Adapter: PCI adapter Core0 Temp: +68.0°C
Adapter: PCI adapter Core0 Temp: +69.0°C
Adapter: PCI adapter Core0 Temp: +68.0°C
Adapter: PCI adapter Core0 Temp: +70.0°C <-----------
Adapter: PCI adapter Core0 Temp: +68.0°C
I am very very concerned about this! I just spent money from limited means on a new hard drive (see above) and I really can't afford for my laptop to break. Why is my core running so hot? Does the fan need to be more aggressive? (Always a...) Newb question: and just what does it mean by my "core"? (my motherboard?) Why is my cpu temp not being recorded?

How can I fix these problems????!!!!!

thank you



-----------------------------
my specs: HP Pavilion dv5000 series AMD 64 turion 1.8Ghz processor (k8), with new seagate momentus 160gb hdd. any other info you need you'll have to tell me!

GrapefruiTgirl 06-06-2008 06:36 PM

What exactly gives the impression that it is actually overheating? Just the Gnome Applet thingy? Makes me wonder what exactly that applet is using as its source of temperature data, and if it is correct, especially as you are not having success with lm_sesnors.

As for the colored lines on the screen, it *could* be from heat damage, either direct OR indirect, but it could simply be a failing LCD panel.

Without doing my own research, I don't know much about the dv5000 or the AMD processor you have, but I would first look up the CPU you have, on a specs site or on the AMD site, and see if you can find out what its operating specs are. Check for the idle and loaded temperatures, and see if yours are indeed high.
Older CPUs run hotter, no doubt. Many do run at 60 Celcius or more *AT the CORE* which means inside the actual processer itself, where the work gets done. Depending on the CPU, a 60'C core temp may be normal.
The *DIE TEMP* or *SOCKET TEMP* is the measure of the OUTSIDE HOUSING of the CPU-- the black or silver thing we are all familiar with, and is often measured with a diode or thermistor of some sort located under the CPU or in the socket assy somewhere. Common socket temps are in the 30-40'C range. If a socket temperature were at or above 60'C, I would surely be worried: because the CORE temp would be absolutely scalding hot.

My dual core Intel E2160 runs (USUALLY) with a core temp of around 40'C at idle, and a 27-30'C socket temp. At the moment, it is warm in here, plus I need to clean my CPU cooler, so the temp at the moment is about 55'C at the core, and 40'C at the socket (for your reference).

If it is at all possible (age, environment, etc..) that the laptop cooling system is dirty, including the CPU fan, you definitely should contact a laptop cleaning place for a cleaning and inspection, or do it yourself if you are able. Especially as you see the temp going up but not easily coming back down, it does sound like inadequate cooling.

FYI - there is sometimes a *motherboard* temp sensor too (I have one), which reads the ambient average temp around the motherboard itself. Generally it is near or slightly above room temperature, though likely a bit warmer in the case of a laptop.

Best of luck-- and do check the cleanliness of the machine's cooling gear SOON, before worrying too too much about what the applet things are saying. Seeing with (your) eyes is better, after all if the applet said all was nice and frosty inside, and the computer then died of overheating, no one would be impressed.

Sasha

144419855310001 06-07-2008 08:48 AM

Ok, I have gnome-sensors currently displaying three sensors. (the following is to help me work things out myself, the info here is not to lecture anyone else with what htey already know!)


#1 I have hddtemp reading my hard drive temperature: usage " hddtemp /dev/hda "

#2 The first is from lmsensors, which is reading from the k8temp module. To quote the k8temp freshmeat page, it is recording:
Quote:

k8temp is a BSD utility to read the on-die temperature sensors provided by AMD K8 processors, including most Athlon 64s and Opterons.
#3 Thirdly, there is the acpi thermal zone module, which is supposed to be reading from
Quote:

/proc/acpi/thermal_zone/THRM/temperature
...but isn't working. Linux reads 0 C, and apparently from /proc/acpi/thermal_zone/THRM/state
Quote:

user@home:~$ cat /proc/acpi/thermal_zone/THRM/state
state: ok
is "ok"!!!! 0 C is apparently OK!

The purpose of this module is:
Quote:

ACPI allows OSPM to be proactive in its system cooling policies. With OSPM control of the operating environment, cooling decisions can be made based on the application load on the CPU and the thermal heuristics of the system. Graceful shutdown of the OS at critical heat levels becomes possible as well.
So to reason this out, if #2 above which is the "SOCKET TEMP" / "OUTSIDE HOUSING" is at around 52 - 60 C depending on load, then #3 which is the internal CPU temperature, must be much higher (worryingly high), as you're saying?

I've just switched my cpu frequency scaling governor to powersave and am avoiding heavy use programs. However, I'm curious that my fan *is working* - I hear it when the temperature spikes, even the internal CPU temp (#3) is not being detected! Is this working solely from the socket temp (#2) then? Or is the fan controlled by the BIOS, does this bypass linux?


So on the basis of the quote about ACPI and OSPM above, it's no wonder that my hdd died after 2 years and my LCD screen suffered heat damage!
JUST WHY THE £$%£$%$£ *$£$* didn't ANYTHING in linux alert me to the fact that the ACPI temp wasn't being detected? Do the developers REALLY think that 0 C is within the normal temperature range?!!!!!!! (see output of /proc/acpi/thermal_zone/THRM/state above)

I'm going to check out the temperature operating ranges for my AMD turion ML-32. I'm having a lot of difficulty finding this info though.
I'll ask about cleaning my fan. I'm also going to boot into XP, find a windows temp monitor and see if the problem is just in linux? Is there anything else I can do? Set the fan to be more aggressive? Why isn't the acpi processor thermal monitoring not working? Is there an error log for this that I could read???

Thank you


All times are GMT -5. The time now is 03:08 PM.