LinuxQuestions.org
Welcome to the most active Linux Forum on the web.
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Newbie
User Name
Password
Linux - Newbie This Linux forum is for members that are new to Linux.
Just starting out and have a question? If it is not in the man pages or the how-to's this is the place!

Notices


Reply
  Search this Thread
Old 05-23-2005, 12:56 AM   #1
rmakhija
LQ Newbie
 
Registered: May 2005
Posts: 2

Rep: Reputation: 0
Unhappy Linux 2.6 kernel hangs


Hi All

I have been facing this problem from some time now and feel it's time that I float this around for some solution. It would be great if you could provide some definitive pointers in this regard.

I am running a networking application which runs over SCTP ( Stream control transport protocol) protocol stack ( implemented on the lines of RFC 2960 ),. The SCTP stack sits over IP layer and makes raw IP system calls for all the networking operations.

The setup configuration is as follows .
1. There are 2 linux machines which are connected back to back over 100 mbps ethernet interface each having an intel 1.6 GHz Pentium IV processor with 1 GB of RAM each . Each machine is has redhat 9.0 running .

2. Each machine has SCTP stack running over IP layer. On top of SCTP stack there is a load application which can pump data messages ( of 100 bytes each for a period of 5 minutes) at different rates (which could be configured at run time) .

3. The intent is to evaluate the performance of the of the SCTP stack with the above configuration scenario.

4. To start with a messages with a moderate rate ( 1000 MSG per/sec) are pumped from both ends and the number is gradually increased till the buffers at transport layer become insufficient to handle the data rate at which messages are being pumped by application application - which basically is a load generator) from both sides.

5. A time comes when (the message rate has reached 35000 MSG/sec) one of the two computers stops responding and remains hanged and I have to forcefully (hard) reboot the machine . Ctrl+C etc keys donot work and the traces on the console also stop coming, none of the keys except CAPS, SCROLL and NUM LOCK seem to work. When I telnet the hanged machine from peer the control stops at escape sequence but the login prompt does not appear. Although the hanged machine successfully responds to the ping requests.

6. The SCTP stack and the application are running as a binary in user mode, The binary runs under root privileges. ( in super user mode)

Could you let me know what is happening, how can a user mode program force a kernel to go in an infinite loop such that it stops responding. I have checked /var/log messages but didn't find anything fishy.

Waiting for your suggestions/replies

Thanks and Best Regards
Rajesh
 
Old 05-23-2005, 07:16 PM   #2
btmiller
Senior Member
 
Registered: May 2004
Location: In the DC 'burbs
Distribution: Arch, Scientific Linux, Debian, Ubuntu
Posts: 4,275

Rep: Reputation: 370Reputation: 370Reputation: 370Reputation: 370
This is a pretty obscure question ... you might have more luck in the networking forum. But just to be clear, the application is entirely user-space? No corresponding module inserted into the kernel? I could think of a couple of things it could be, perhaps related to corruption of kernel memory. But a couple questions:

1) What kernel version exactly are you running (type uname -a)?
2) Is it always the same machine that hangs? If so, it may simply be a case of dodgy hardware buckling under load.
 
Old 05-24-2005, 02:31 AM   #3
rmakhija
LQ Newbie
 
Registered: May 2005
Posts: 2

Original Poster
Rep: Reputation: 0
Hi
Thanks for your inputs.
Yes, the application is running entirely in user space, it's only that it makes raw ip system calls (for IP underlying layer) for all the networking operations.
No corresponding module inserted into the kernel
Secondly I have tried this thing on various machines (running on kernel 2.6 machines and kernel 2.4 machines ) with different harware configurations but the problem persists
I would like to add that the problem surfaces only when we are pumping messages at a very high rate and when CPU utilization approaches 100 %. The messages are pumped for around 5 minutes but if the machine hangs it does not recover even if left idle for next 10-12 hours.
Waiting for your suggestions.
Thanks and Best Regards
Rajesh
 
  


Reply


Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
FC4 Kernel 2.6.11->2.6.12 hangs singul Fedora - Installation 0 07-05-2005 03:28 AM
Kernel 2.6.5 Hangs During Boot jrbush82 Linux - Software 3 04-27-2004 01:49 PM
Boot hangs with 2.4.18 and 2.4.19 kernel donni Linux - Newbie 1 10-31-2003 10:30 PM
linux hangs after configuring kernel shanmugapriyan Linux - Software 0 08-07-2003 08:22 PM
Before Installation, Kernel hangs Lesserevil Linux - Software 4 05-17-2001 06:03 PM


All times are GMT -5. The time now is 02:32 AM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Facebook: linuxquestions Google+: linuxquestions
Open Source Consulting | Domain Registration