I apologize if this has been done before. I did do a search but didn't find anything I haven't tried already. I am at my wits end....
My mail server, running Slackware 10.2, is rebooting for reasons I can't discover. /var/log/syslog and /var/log/messages show nothing helpful. /var/log/debug only shows the following:
Code:
Mar 18 04:53:45 mail kernel: CPU: After generic, caps: 3febf9ff 00000000 00000000 00000000
Mar 18 04:53:45 mail kernel: CPU: Common caps: 3febf9ff 00000000 00000000 00000000
Mar 18 04:53:45 mail kernel: eth0: Identified 8139 chip type 'RTL-8100B/8139D'
Mar 18 04:53:48 mail kernel: 00:0a.0: tulip_stop_rxtx() failed
I am logging
sensors and
uptime to /var/log/messages and neither shows any anomalies. The load average leading up to the reboot is minimal.
Code:
Mar 18 10:30:01 mail sensors: w83697hf-isa-0290
Mar 18 10:30:01 mail sensors: Adapter: ISA adapter
Mar 18 10:30:01 mail sensors: VCore: +1.65 V (min = +1.62 V, max = +1.78 V)
Mar 18 10:30:01 mail sensors: +3.3V: +3.22 V (min = +3.14 V, max = +3.46 V)
Mar 18 10:30:01 mail sensors: +5V: +4.97 V (min = +4.74 V, max = +5.24 V)
Mar 18 10:30:01 mail sensors: +12V: +11.63 V (min = +10.83 V, max = +13.19 V)
Mar 18 10:30:01 mail sensors: -12V: -11.72 V (min = -13.16 V, max = -10.90 V)
Mar 18 10:30:01 mail sensors: V5SB: +5.46 V (min = +4.94 V, max = +6.05 V)
Mar 18 10:30:01 mail sensors: VBat: +3.14 V (min = +2.40 V, max = +3.60 V)
Mar 18 10:30:01 mail sensors: CPUFan: 3341 RPM (min = 2986 RPM, div = 4)
Mar 18 10:30:01 mail sensors: CPUTemp: +40.5 C (high = +63 C, hyst = +58 C) sensor = diode (beep)
Mar 18 10:30:01 mail sensors: alarms:
Mar 18 10:30:01 mail sensors: beep_enable:
Mar 18 10:30:01 mail sensors: Sound alarm enabled
Mar 18 10:30:01 mail sensors:
I have run memtest86[+], cpuburn, (both fine) and have smartctl running on the hard-drive with no errors.
The machine serves as my mail server, DNS server, and gateway to the internet. It's already running as bare-bones as I can make it with a fairly restrictive firewall. There doesn't seem to be any particular pattern to the reboots (eg, time of day, network traffic, etc) that I can determine.
The only clue I have is that I can make it reboot by doing a
zless /var/log/messages.1.gz and then doing a search for "Mar 18" (typing in the command '/Mar 18' and hitting enter). Obviously I'm not doing that at 4am.
I have three other machines that have been up for 36 days, but this one reboots several times a day.
I'm leaning towards a hardware problem but I'm having trouble isolating it. The machine isn't terribly old (the reboots have only started in the past few months and don't coincide with any new software) and I'd rather not have to replace the entire thing if I can avoid it, especially if it turns out to be a software problem.
Any suggestions would be appreciated. If I forgot any information that might be helpful, let me know and I will post it.
Thank you in advance.