LinuxQuestions.org
Share your knowledge at the LQ Wiki.
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Distributions > Red Hat
User Name
Password
Red Hat This forum is for the discussion of Red Hat Linux.

Notices

Reply
 
Search this Thread
Old 03-06-2009, 12:07 PM   #1
verixnbi
LQ Newbie
 
Registered: Mar 2009
Posts: 1

Rep: Reputation: 0
On Kernel Panic system hangs, how to get crash info?


Hey all,

I'm running RHEL4 and am trying to get a kernel core dump on crash using netdump.

By all accounts netdump looks like it's working, the service is started on both client and server. The ip-timestamp folders appear correctly when the client comes up and the server appears to begin taking a dump when the client crashes. The only problem is the client does not actually dump core, it gets to the Oops and does not continue.

On the client 'service netdump propagate' and 'servce netdump start' have been run without error.

Parameters set:
sysctl -a | grep panic
kernel.panic_on_unrecovered_nmi = 0
kernel.unknown_nmi_panic = 0
kernel.panic_on_oops = 1
kernel.panic = 0

echo "h" > /proc/sysrq-trigger creates a new log on the server (As I think it's supposed to?).

I've tried crashing using a null pointer dereference and simply doing 'echo "c" > /proc/sysrq-trigger', both get the same (lack of) results.

From the console:
mynode # insmod panic.ko
Unable to handle kernel NULL pointer dereference at 0000000000000000 RIP:
<ffffffffa042e014>{anic:initKernelPanic+20}
PML4 2e373067 PGD 31af9067 PMD 0
Oops: 0000 [1] SMP

From the netdump logfile:
[...network console startup...]
warning: many lost ticks.
Your time source seems to be instable or some driver is hogging interupts
rip __do_softirq+0x54/0xf0
Unable to handle kernel NULL pointer dereference at 0000000000000000 RIP:
<ffffffffa042e014>{anic:initKernelPanic+20}
PML4 2e373067 PGD 31af9067 PMD 0
Oops: 0000 [1] SMP

Note: This is on a virtual machine through VMWare Server 2. My own initial suspicion was VMWare was disconnecting the NIC when the system crashed, but this is not the case. VMWare still reads the NIC as connected.


Any help here would be greatly appreciated, thanks!

Andrew
 
Old 03-07-2009, 11:25 PM   #2
anomie
Senior Member
 
Registered: Nov 2004
Location: Texas
Distribution: RHEL, Scientific Linux, Debian, Fedora, Lubuntu, FreeBSD
Posts: 3,930
Blog Entries: 5

Rep: Reputation: Disabled
Not meant to be a copout (I am just pointing you to the same resource I'd refer to), but the very excellent book "Self Service Linux" covers diagnosing kernel oops messages.

Free PDF download here: http://www.linux-books.us/linux_general_0013.php

Then again, if you're paying RH for support, that's where I would go with this.
 
  


Reply


Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off


Similar Threads
Thread Thread Starter Forum Replies Last Post
kernel panic hangs boot on centos dutler Linux - General 6 02-06-2009 11:08 PM
Upgraded Kernel, Kernel Panic, Can't read root file system. Romanus81 Slackware 25 05-04-2008 10:45 PM
2 PC Linux server, hangs with kernel panic. Stilltray Linux - Newbie 3 11-24-2007 04:30 PM
Strange repeated kernel panic & crash windle Linux - Kernel 1 04-18-2007 09:59 AM
Kernel panic Crash baetmaen Linux - General 4 05-23-2005 04:17 PM


All times are GMT -5. The time now is 01:20 PM.

Main Menu
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
identi.ca: @linuxquestions
Facebook: linuxquestions Google+: linuxquestions
Open Source Consulting | Domain Registration