LinuxQuestions.org
Latest LQ Deal: Latest LQ Deals
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Hardware
User Name
Password
Linux - Hardware This forum is for Hardware issues.
Having trouble installing a piece of hardware? Want to know if that peripheral is compatible with Linux?

Notices


Reply
  Search this Thread
Old 01-02-2008, 06:36 PM   #1
boozer_2
LQ Newbie
 
Registered: Mar 2005
Posts: 16

Rep: Reputation: 0
Question Time clock resetting with more than 2 cpu's


I have an old Compaq Proliant server with 4, PIII xeon 700 MHz processors. When I updated from Fedora 6 to Fedora 8, I started having all sorts of problems with lockups and freezes. After much troubleshooting, it appears to be occurring when 3 or 4 processors are installed. It normally takes a couple hours for me to start seeing troubles and I ran for 24 hours on 1 and 2 CPU's with no problem, but when I throw CPU 3 or 4 in... it happens.

Was ACPI enabled in Fedora 6? Or is there any ACPI change in F8 that might cause this behavior?

I know slots 1 and 2 for CPU's worked fine for 24 hours... but I saw the problem when I installed processor 3. I am now testing running with 3 processors again, except with processor 4 in and not processor 3. However, in the meantime, I wanted to see if anyone out here knew of anything in F8 that might cause the problem or see if someone could tell me that it is definitely a hardware issue. The only reason I think it is software is because I did not notice any problems in Fedora Core 6... Just when I did a fresh install of F8.

I know the problem occurs when SSH randomly works and the clock gets off... here you can see it losing time every few seconds:

[root@Servy internet]# date
Wed Jan 2 18:05:11 CST 2008
[root@Servy internet]# date
Wed Jan 2 18:05:12 CST 2008
[root@Servy internet]# date
Wed Jan 2 18:05:14 CST 2008
[root@Servy internet]# date
Wed Jan 2 18:05:11 CST 2008
[root@Servy internet]# date
Wed Jan 2 18:05:15 CST 2008
[root@Servy internet]# date
Wed Jan 2 18:05:11 CST 2008
[root@Servy internet]# date
Wed Jan 2 18:05:13 CST 2008
[root@Servy internet]# date
Wed Jan 2 18:05:14 CST 2008
[root@Servy internet]# date
Wed Jan 2 18:05:11 CST 2008
[root@Servy internet]# date
Wed Jan 2 18:05:13 CST 2008
[root@Servy internet]# date
Wed Jan 2 18:05:15 CST 2008
[root@Servy internet]# date
Wed Jan 2 18:05:11 CST 2008
[root@Servy internet]# date
Wed Jan 2 18:05:13 CST 2008
[root@Servy internet]# date
Wed Jan 2 18:05:14 CST 2008
[root@Servy internet]# date
Wed Jan 2 18:05:10 CST 2008


Here are some weird log messages I have seen:
Dec 30 18:30:46 localhost kernel: ..MP-BIOS bug: 8254 timer not connected to IO-APIC
Dec 30 18:30:46 localhost kernel: Uhhuh. NMI received for unknown reason b0 on CPU 0.
Dec 30 18:30:46 localhost kernel: You have some hardware problem, likely on the PCI bus.
Dec 30 18:30:46 localhost kernel: Dazed and confused, but trying to continue
Dec 30 18:30:46 localhost kernel: Setting up standard PCI resources
Dec 30 18:30:46 localhost kernel: mtrr: your CPUs had inconsistent fixed MTRR settings
Dec 30 18:30:46 localhost kernel: mtrr: probably your BIOS does not setup all CPUs.
Dec 30 18:30:46 localhost kernel: mtrr: corrected configuration.
Dec 30 18:30:46 localhost kernel: apm: BIOS not found.
Dec 30 18:55:25 localhost kernel: Time: tsc clocksource has been installed.
Dec 30 18:55:25 localhost kernel: piix4_smbus 0000:00:0f.0: Found 0000:00:0f.0 device
Dec 30 18:55:25 localhost kernel: piix4_smbus 0000:00:0f.0: SMB base address uninitialized - upgrade BIOS or use force_addr=0xaddr
Jan 2 14:13:26 localhost kernel: Clocksource tsc unstable (delta = 879823441398 ns)
Jan 2 14:13:26 localhost kernel: Time: acpi_pm clocksource has been installed.


Dec 30 18:30:46 localhost kernel: Initializing CPU#0
Dec 30 18:30:46 localhost kernel: CPU 0 irqstacks, hard=c07a8000 soft=c0788000
Dec 30 18:30:46 localhost kernel: PID hash table entries: 4096 (order: 12, 16384 bytes)
Dec 30 18:30:46 localhost kernel: Uhhuh. NMI received for unknown reason b0 on CPU 0.
Dec 30 18:30:46 localhost kernel: You have some hardware problem, likely on the PCI bus.
Dec 30 18:30:46 localhost kernel: Dazed and confused, but trying to continue
Dec 30 18:30:46 localhost kernel: Detected 701.620 MHz processor.


Dec 30 18:30:46 localhost kernel: Total of 4 processors activated (5613.07 BogoMIPS).
Dec 30 18:30:46 localhost kernel: ENABLING IO-APIC IRQs
Dec 30 18:30:46 localhost kernel: ..TIMER: vector=0x31 apic1=0 pin1=2 apic2=-1 pin2=-1
Dec 30 18:30:46 localhost kernel: ..MP-BIOS bug: 8254 timer not connected to IO-APIC
Dec 30 18:30:46 localhost kernel: ...trying to set up timer (IRQ0) through the 8259A ... failed.
Dec 30 18:30:46 localhost kernel: ...trying to set up timer as Virtual Wire IRQ... works.
Dec 30 18:30:46 localhost kernel: checking TSC synchronization [CPU#0 -> CPU#1]: passed.
Dec 30 18:30:46 localhost kernel: checking TSC synchronization [CPU#0 -> CPU#2]: passed.
Dec 30 18:30:46 localhost kernel: checking TSC synchronization [CPU#0 -> CPU#3]: passed.
Dec 30 18:30:46 localhost kernel: Brought up 4 CPUs
Dec 30 18:30:46 localhost kernel: Booting paravirtualized kernel on bare hardware

Thanks!
 
Old 01-02-2008, 06:53 PM   #2
Electro
LQ Guru
 
Registered: Jan 2002
Posts: 6,042

Rep: Reputation: Disabled
Try changing the motherboard battery and replace the power supply.

Try doing some research on compatible steppings for the processors.
 
Old 01-05-2008, 01:36 PM   #3
boozer_2
LQ Newbie
 
Registered: Mar 2005
Posts: 16

Original Poster
Rep: Reputation: 0
Quote:
Originally Posted by Electro View Post
Try changing the motherboard battery and replace the power supply.

Try doing some research on compatible steppings for the processors.

Thanks... I've been running these processors together for 3 years now with no problems. So I think their steppings are compatible.

I replaced the CMOS battery which had no effect.

I had a spare Compaq IC board (which has a bunch of things integrated on it) and swapped that out. That resolved the problem. Either it failed a long time ago and F8 was just more sensitive to that fact, or it failed right after I upgraded. It's an old box, so who knows.

Anyone see any clues in the logs I posted that might point to what failed? Some really strange side effects from whatever hardware was messed up.

Last edited by boozer_2; 01-05-2008 at 04:43 PM.
 
Old 01-05-2008, 08:47 PM   #4
boozer_2
LQ Newbie
 
Registered: Mar 2005
Posts: 16

Original Poster
Rep: Reputation: 0
Well....

I was proactive/stupid and upgraded the firmware on the board that was working... now its not... the one thing I did notice that was not there before was this line on boot up:

Dec 30 18:30:46 localhost kernel: ..MP-BIOS bug: 8254 timer not connected to IO-APIC

Could earlier Fedora have handled this differently?

I'm done messing with this ProliantDL580. The newest firmware for it was 2002 LOL. There have been several releases where one thing or another was left out or changed that left me troubleshooting forever until I found out it was something stupid with this server. Say goodbye to my first Linux box! It definitely taught me a lot :-)

Just for my own knowledge though, could the 8254 IO-APIC message cause my problems with time and SSH? Seems SSH was most sensitive to whatever was causing the problem.
 
Old 01-20-2008, 12:04 PM   #5
boozer_2
LQ Newbie
 
Registered: Mar 2005
Posts: 16

Original Poster
Rep: Reputation: 0
Thumbs up Fixed it by turning ACPI off

I changed the boot line in grub to this:
kernel /vmlinuz-2.6.23.9-85.fc8 ro root=/dev/VolGroup00/LogVol00 rhgb quiet apm=off acpi=off pci=noacpi

which basically added apm=off acpi=off pci=noacpi

My machine has now been stable for 5+ days which is a record for Fedora 8. I don't know what caused the problem, but turning acpi off appears to have solved. Just figured I'ld post in case it helped anyone else out.
 
  


Reply


Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
squish - quota resetting time gorav Linux - Networking 1 04-19-2011 06:46 AM
Time resetting to 00 Hrs on SLES Restart everyday kvvkumar Linux - Server 1 05-11-2007 03:05 PM
Where does RH8 daily set system time to hardware clock time smartnorman Red Hat 1 05-24-2006 02:42 PM
Various clock issues: Clock shows wrong time only in Knoppmyth, & CMOS time change ? davidbix General 1 04-05-2006 09:58 PM
resetting CPU time? servnov Linux - Newbie 1 11-19-2005 09:26 PM

LinuxQuestions.org > Forums > Linux Forums > Linux - Hardware

All times are GMT -5. The time now is 09:13 PM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration