LinuxQuestions.org
Visit the LQ Articles and Editorials section
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Newbie
User Name
Password
Linux - Newbie This Linux forum is for members that are new to Linux.
Just starting out and have a question? If it is not in the man pages or the how-to's this is the place!

Notices



Reply
 
Search this Thread
Old 08-15-2009, 08:52 AM   #1
john lee
Member
 
Registered: Jun 2001
Location: Sydney, Australia
Posts: 41

Rep: Reputation: 15
BUG: soft lockup - CPU#1 stuck for 10s! [swapper:0]


Hi all,

I have this DELL PE2950 running Asterisk 1.4.21.2 on RHEL 5 with no problems since Dec last year. We are using Digium TE412P to connect to an E1 ISDN line. Since Dec last year, we did not add or delete any software or hardware. We also did not do any "yum update".

The linux kernel is 2.6.18-92.1.22.el5

Last week, the users reported that people from outside could not dial in but users can dial out. We rebooted the box and everything was fine.

Suddenly, starting this week, the box froze several times a day with a "BUG: soft lockup - CPU#1 stuck for 10s! [swapper:0]" error message on the console. Before it freezes, I can see a continuous stream of error message "timing source auto card 0!" coming up on the machine.

We rebooted and it became okay for a few hours and we had to reboot it again in order to clear the problem.

BUG: soft lockup - CPU#1 stuck for 10s! [swapper:0]
Pid: 0, comm: swapper
EIP: 0060:[,C0417911.] CPU: 1
EIP is at smp_call_function+0x99/0xc3
EFLAGS: 00000297 Tainted: G (2.6.10-92.1.22.e15 #1)
EAX: 00000002 EBX: 00000000 ECX: 00000001 EDX: 000000fb
ESI: 00000003 EDI: 00000000 EBP: c0417ae0 DS: 007B ES: 007b
CR0: 8005003b CR2: b7fec780 CR3: 324B2000 CR4: 000006d0
[<c0417ae0>] stop_this_cpu+0x0/0x33
[<c041794e>] smp_send_stop+0x13/0x1c
[<c0425bcf>] panic+0x4c/0x16d
[<c040da17>] intel_machine_check+0xf9/0x146
[<c040d91e>] intel_machine_check+0x0/0x146
[<c0403ccf>] error_code+0x39/0x40
[<c0403ccf>] mwait_idel+0x25/0x38
[<c0522200>] acpi_processor_idle+0x154/0x3b4
[<c0403c90>] cpu_idle+0x9f/0xb9
=======================

Q1. A strange thing is I could not find this error message in /var/log/messages or dmesg. The soft lockup error message can only be found on the machine itself.
Q2. Could it be kernel incompatibility problem? However, we did not ever change anything since it was installed.
Q3. From the error message, how do I know it is a software (kernel?) or hardware problem?

I would appreciate if someone could give me any suggestions.
 
Old 08-15-2009, 10:38 AM   #2
rjlee
Senior Member
 
Registered: Jul 2004
Distribution: Ubuntu 7.04
Posts: 1,990

Rep: Reputation: 67
In general, something must have changed to cause a problem. Two possibilities spring to mind: either a piece of hardware has failed, or someone has broken in to your unpatched system.

I'm not sure if this is what you're seeing, but there is a kernel bug (http://bugzilla.kernel.org/show_bug.cgi?id=10753) which was fixed in 2.6.27 that caused similar error messages. It may be a good idea to update your system and see if the problem goes away.

I'd also run chkrootkit (http://www.chkrootkit.org/) just to be on the safe side.

Also, RHEL has technical support: they should be able to tell you if it's hardware problem or not.
 
Old 07-29-2012, 10:43 PM   #3
john lee
Member
 
Registered: Jun 2001
Location: Sydney, Australia
Posts: 41

Original Poster
Rep: Reputation: 15
rjlee, thanks for your reply.

Something which I did not tell you in the post - the Asterisk server was connected to a Rhino Channel Bank via a port on the Digium TE412P card. We found out that system would not freeze if we disconnect the Rhino Channel Bank.
We opened a call with Rhino but the guys were very confident that the problem was not caused by Rhino.

Anyway, as we were running out of ideas, we quickly ordered a new Rhino Channel Bank and connected it to the Asterisk server.
Guess what - problem disappeared!
 
  


Reply


Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off


Similar Threads
Thread Thread Starter Forum Replies Last Post
Crashing "BUG: soft lockup - CPU#1 stuck for 11s" DavidDiggs Linux - Server 2 06-05-2009 01:43 AM
BUG: soft lockup - CPU#3 stuck for 10s! chakkerz Linux - Server 2 06-16-2008 06:34 PM
BUG: soft lockup detected on CPU#0 and BUG: spinlock recursion on CPU#0 ... BloodyCat Linux - Hardware 3 11-07-2006 02:14 PM
bug:soft lockup detected on cpu#0! paraiso Linux - General 2 01-29-2006 06:42 AM
BUG: Soft lockup detected on CPU#0! 97tbird Linux - General 0 11-03-2005 03:12 PM


All times are GMT -5. The time now is 08:59 AM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
identi.ca: @linuxquestions
Facebook: linuxquestions Google+: linuxquestions
Open Source Consulting | Domain Registration