Linux - GeneralThis Linux forum is for general Linux questions and discussion.
If it is Linux Related and doesn't seem to fit in any other forum then this is the place.
Notices
Welcome to LinuxQuestions.org, a friendly and active Linux Community.
You are currently viewing LQ as a guest. By joining our community you will have the ability to post topics, receive our newsletter, use the advanced search, subscribe to threads and access many other special features. Registration is quick, simple and absolutely free. Join our community today!
Note that registered members see fewer ads, and ContentLink is completely disabled once you log in.
Hi there,
I am running the most recent (updated by yum) fedora release of redhat. I use this machine as print/file/web/mail/samba server. Every once and a while it just shuts itself off. I'm not sure where to look in the logs to find the answer.
Distribution: Fedora x86 and x86_64, Debian PPC and ARM, Android
Posts: 4,500
Rep:
Usually, a poweroff is caused by exceeding the thermal critical limit. There may be nothing in the logs, as Linux didn't make the call - the hardware/BIOS did.
Check to make sure all fans are running (and properly oriented - blowing in the correct direction). Monitor the temperature of the system.
Another possibility is a brief power outage. Make sure that you are on a UPS. Make sure that you are not exceeding the power draw on your power supply. If your server has redundant power supplies, make sure that you are using independant power sources for each.
By the way, I don't think its a heat issue - I had just checked the operation of all of the fans (I do that periodically) and I have an extra one on the chassis itself to boot.
Distribution: Fedora x86 and x86_64, Debian PPC and ARM, Android
Posts: 4,500
Rep:
Well, have you set up the lmsensors as PTrenholme suggested, to check the temperature readings before a shutdown? Have you verified that you are not overdrawing your power supply? Do you have a UPS? Also, are you overclocking? Are you using an aftermarket heatsink on the CPU?
Try this; open a command window and enter:
while true; do true; done
That's an infinite loop (you can interrupt it with Ctrl-c). It will cause your CPU temperature to go up quickly. If your system shuts down within about 5 minutes of starting that, it's a heat issue. If not, it's likely a power issue.
Well, have you set up the lmsensors as PTrenholme suggested, to check the temperature readings before a shutdown? I could not get this to work (compile, install, etc.) I was able to install the rpm, but I can't find where it put the binaries.
Have you verified that you are not overdrawing your power supply? I am definetly not overdrawing the ps. I have one card - everything else is onboard the motherboard.
Do you have a UPS? Actually, I have two - I live in Florida :-)
Also, are you overclocking? No
Are you using an aftermarket heatsink on the CPU? No - it is an Intel - at least it came with the CPU.
Try this; open a command window and enter:
while true; do true; done Did this - it has been running now for over an hour - the computer is still up.
Any other suggestions? Where in the logs could I find a clue - if the OS is shutting it down for some reason?
Thanks very much for your help,
Jordan
Distribution: Fedora x86 and x86_64, Debian PPC and ARM, Android
Posts: 4,500
Rep:
If the OS is shutting it down (instead of a spontaneous power off), edit /var/log/messages after booting back up. Go to the bottom of the file, then search backwards for "restart" - the syslogd restart message. The messages before this are the last messages recorded by the system before the poweroff. If this was a normal system initiated operation, you will see a series of service termination messages. For example:
Code:
Aug 8 01:22:41 mmouse shutdown: shutting down for system reboot
Aug 8 01:22:43 mmouse init: Switching to runlevel: 6
Aug 8 01:22:44 mmouse cups-config-daemon: cups-config-daemon -TERM succeeded
Aug 8 01:22:45 mmouse dbus: avc: 1 AV entries and 1/512 buckets used, longest chain length 1
Aug 8 01:22:45 mmouse messagebus: messagebus -TERM succeeded
Aug 8 01:22:45 mmouse cups: cupsd shutdown succeeded
Aug 8 01:22:50 mmouse httpd: httpd shutdown succeeded
Aug 8 01:22:50 mmouse sshd: sshd -TERM succeeded
Aug 8 01:22:51 mmouse sendmail: sendmail shutdown succeeded
Aug 8 01:22:51 mmouse sendmail: sm-client shutdown succeeded
Aug 8 01:22:51 mmouse spamassassin: spamd shutdown succeeded
Aug 8 01:22:52 mmouse dhcpd: dhcpd shutdown succeeded
Aug 8 01:22:52 mmouse dhcpd: dhcpd shutdown succeeded
Aug 8 01:22:52 mmouse smartd[3966]: smartd received signal 15: Terminated
Aug 8 01:22:52 mmouse smartd[3966]: smartd is exiting (exit status 0)
Aug 8 01:22:53 mmouse smartd: smartd shutdown succeeded
Aug 8 01:22:53 mmouse xinetd[4084]: Exiting...
Aug 8 01:22:53 mmouse xinetd: xinetd shutdown succeeded
Aug 8 01:22:54 mmouse acpid: acpid shutdown succeeded
Aug 8 01:22:55 mmouse crond: crond shutdown succeeded
Aug 8 01:22:55 mmouse ntpd[4104]: ntpd exiting on signal 15
Aug 8 01:22:55 mmouse mdmonitor: mdadm shutdown succeeded
Aug 8 01:22:55 mmouse kernel: Kernel logging (proc) stopped.
Aug 8 01:22:55 mmouse kernel: Kernel log daemon terminating.
Aug 8 01:22:56 mmouse syslog: klogd shutdown succeeded
Aug 8 01:22:56 mmouse exiting on signal 15
I really don't think you need it, since over-temp problems are not too likely, and your BIOS should be monitoring the temp.s for you, but, if you'd like to see the temperatures, the lm_sensors package is in, I believe, the FC4 "base" repository, so just do a
Code:
# yum install lm_sensors
(The version on ATrpms is more current, if you care.)
If you use KDE, try a
Code:
# yum install kdeutils
instad (which should install lm_sensors as a dependency), and the use the "KSensors" applet to display the temperatures on your panel, and set alarms. (If you don't see any hardware temperature sensors, look at info sensors and info sensors.conf.)
By the way, my system uses a 3GHz Intel 745 H/T, which was running hot until I replaced the stock cooler with an aftermarket one. Now I'm seldom above 110 F.
Caution: Replacing the heat-sink is not easy for the inexperienced, and experience can be expensive to acquire. It cost me $200 for a new CPU when I bent the pins removing the old CPU/heat-sink. (I thought I could remove the heat-sink leaving the CPU in the M/B, but the thermal grease had a different idea, and prevailed.)
LinuxQuestions.org is looking for people interested in writing
Editorials, Articles, Reviews, and more. If you'd like to contribute
content, let us know.