LinuxQuestions.org
Review your favorite Linux distribution.
Home Forums Tutorials Articles Register
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Distributions > Red Hat
User Name
Password
Red Hat This forum is for the discussion of Red Hat Linux.

Notices


Reply
  Search this Thread
Old 10-20-2004, 05:00 PM   #1
draeician73
LQ Newbie
 
Registered: Oct 2004
Posts: 4

Rep: Reputation: 0
Redhat AS2.1 system freeze


I currently have a server that has 26,000 users, and a lovely quota system. It appears there is a kernel bug out there related to quotas that causes the system to hit a race state every 24-36 hours. Over the past 6 months there are only two real solutions redhat has provided:

1. Turn off quotas (Not an option)
2. Reinstall to AS3.0 since they cannot replicate the problem on AS3.0.

Both of which I really don't like since I'm paying redhat for support on AS2.1.

Is anyone else out there experiencing this same problem? Do I really have to rebuild

Has anyone found a solution other than rebuilding the entire system and causing a major outage for my users?
 
Old 10-21-2004, 04:08 PM   #2
misc
Senior Member
 
Registered: Apr 2003
Distribution: Red Hat + Fedora
Posts: 1,084

Rep: Reputation: 54
Did you contact your sales representative at Red Hat about it?

Are there public bug reports about it which you can post here?

With regard to the age of RHEL 2.1, when did you install the server? Was the bug present since the beginning? Or was it introduced with a kernel erratum?

The suggestions you refer to come from Red Hat Support?
 
Old 10-23-2004, 02:27 PM   #3
hkb33
Member
 
Registered: Sep 2004
Location: Raleigh NC
Distribution: Fedora / RHEL
Posts: 171

Rep: Reputation: 30
I agree with misc...Red Hat is not going to make you install RHEL 3 as an option or disable quotas...I highly doubt that's what they reommended...if so, did a senior engineer or technician tell you this?

What kernel version are you using on your RHEL 2.1 system? If you are using the base kernel (2.4.9-e3) then I highly recommend upgrading to the latest one if you haven't already.

Also, as part of the troubleshooing process, Red Hat is most likely going to check and see if you have the latest kernel (2.4.9-e49) installed to see if the problem can be replicated.
 
Old 10-23-2004, 07:48 PM   #4
draeician73
LQ Newbie
 
Registered: Oct 2004
Posts: 4

Original Poster
Rep: Reputation: 0
I was running the enterprise kernel to take advantage of the SMP and 8 gig of ram. We backup up to the single proc kernel to see if it would relieve the problem.

Currnet Kernel: Linux xx.xx.xx 2.4.9-e.49 #1 Fri Aug 6 11:56:52 EDT 2004 i686 unknown

My trouble ticket with redhat was just updated today. It appears they have a NON-production set of rpms they would like for me to test. Yes.. the first solution I was told my the tecnhician was to turn off quotas.

Brought you to directly from the trouble ticket log:
---------------------------------------------------------------------------
"The escalation team has also suggested the following:

1) Please turn disk quota off if you are still using it.

2) Please enable nmi_watchdog and obtain during the panic the output of sysrq-w
(a couple times) and a sysrq-m through the serial console. Detailed instructions
on how to do this are provided in the Kernel Profiling document attached to this
ticket, sections Category 1 and Category 2. "

The upgrade to AS3 was suggested due the inability to replicate the bug under AS3.

I appreciate the quick replies. If I knew how to crack the kernel open and try to fix it I would.

Last edited by draeician73; 10-23-2004 at 07:53 PM.
 
Old 10-23-2004, 08:08 PM   #5
misc
Senior Member
 
Registered: Apr 2003
Distribution: Red Hat + Fedora
Posts: 1,084

Rep: Reputation: 54
But that sounds like they only want to trouble-shoot it together with you in order to find out details. It doesn't sound like they want you to turn off quotas as a suggested fix.
 
Old 10-26-2004, 09:35 AM   #6
draeician73
LQ Newbie
 
Registered: Oct 2004
Posts: 4

Original Poster
Rep: Reputation: 0
There is a non-production kernel and quota package they want me to do some load testing on. This might be a fix.

I know they just wanna help. Just getting a bit frustrated with 6 months of having to reboot my mail server daily, and it still locking up within 12 hours at random. I'll keep in touch.
 
  


Reply



Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
System Freeze with RedHat 9.0 MHOOO Red Hat 4 04-23-2005 05:17 AM
System freeze edgjerp Linux - General 2 01-22-2005 03:49 AM
Recovery password with RedHat Linux AS2.1 namdn Linux - Networking 1 10-07-2004 09:22 PM
System freeze-why? svar Linux - General 14 08-31-2004 02:13 AM
Ximian 2 on RedHat AS2.1: Doesn't install dheeraj_pandey Red Hat 1 10-02-2003 02:30 AM

LinuxQuestions.org > Forums > Linux Forums > Linux - Distributions > Red Hat

All times are GMT -5. The time now is 08:46 PM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration