LinuxQuestions.org
Visit the LQ Articles and Editorials section
Go Back   LinuxQuestions.org > Forums > Enterprise Linux Forums > Linux - Enterprise
User Name
Password
Linux - Enterprise This forum is for all items relating to using Linux in the Enterprise.

Notices

Reply
 
Search this Thread
Old 10-17-2006, 04:38 AM   #1
jdaniels73
LQ Newbie
 
Registered: Jul 2006
Posts: 16

Rep: Reputation: 0
Question Cluster Suite does not fail over


Hi all,

I had the following setup:

RHEL4 U3 (Kernel version 2.6.8-34)
Cluster Suite ( CMan kernel/headers 2.6.9-43.8 )
GFS shared storage

I have created a simple 2 node cluster running Apache httpd server. When it starts up as normal the virtual IP is in place and the apache daemon is running on the 'owning' server. However whenever I fail over, the floating IP doesn't get bonded to the standby server, and the apache daemon never starts on that standby server.

I was also having deadlocks between CMan and RGManager and found that this was due to a known and fixed (sort of) bug in RHEL4U3 and Cman so I upgraded them to the following:

RHEL4 U3 kernel version 2.6.9-34.0.1
CMan kernel/headers 2.6.9-43.8.3

The lockups stopped but the initial problem persists (floating IP and service not relocated).

Anyone have any workarounds?

Many thanks,
Jon
 
Old 10-17-2006, 07:18 AM   #2
Jaqui
Member
 
Registered: Jan 2006
Location: Vancouver BC
Distribution: LFS, SLak, Gentoo, Debian
Posts: 291

Rep: Reputation: 36
as root on the failover server:
run a cron script that checks for the primary server to respond.
[ ie ping ip number, with an echo request, set your primary to reply to the failover server ip to echo request ]
if no reply, the script runs apachectl start, sends and email to alter to server failure and exits.

just configure the failover to have the floating ip in apache's httpd.conf. it only runs when the primary is down so having the ip assigned in the conf isn't an issue.
 
Old 10-17-2006, 08:26 AM   #3
jdaniels73
LQ Newbie
 
Registered: Jul 2006
Posts: 16

Original Poster
Rep: Reputation: 0
Quote:
Originally Posted by Jaqui
as root on the failover server:
run a cron script that checks for the primary server to respond.
[ ie ping ip number, with an echo request, set your primary to reply to the failover server ip to echo request ]
if no reply, the script runs apachectl start, sends and email to alter to server failure and exits.

just configure the failover to have the floating ip in apache's httpd.conf. it only runs when the primary is down so having the ip assigned in the conf isn't an issue.
Thanks - you're basically saying "write our own clustering app"? I actually considered this but we have to use RHCS.

Anyway once I updated the kernel, rgmanager and cman, I tried again and now I find that the services aren't listed at all!

Any more thoughts?
 
Old 10-18-2006, 03:43 AM   #4
jdaniels73
LQ Newbie
 
Registered: Jul 2006
Posts: 16

Original Poster
Rep: Reputation: 0
I rolled back the kernel and CMan versions to the following:

RHEL4 U3 (Kernel version 2.6.8-34)
Cluster Suite ( CMan kernel/headers 2.6.9-43.8 )

While keeping the RG Manager version at rgmanager-1.9.54-1. This combination enables me to view and manage the services but the failover problem persists...
 
Old 10-20-2006, 07:34 AM   #5
Jaqui
Member
 
Registered: Jan 2006
Location: Vancouver BC
Distribution: LFS, SLak, Gentoo, Debian
Posts: 291

Rep: Reputation: 36
I've never used the RHCs, I've just used a shell script and cron job. completely reliable and simple to do.
With the bonus of adding needed functionality to the shell script is simple, such as alerting the tech on shift that the primary server failed.

But it does sound like your RHCS config is most likely the issue, since it's designed to perform the same tasks as the simple shell script / cron job method.
 
  


Reply


Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off


Similar Threads
Thread Thread Starter Forum Replies Last Post
RHEL cluster suite hardeep_ubhi Linux - Hardware 1 09-07-2006 04:59 AM
Looking for Expert on RedHat Cluster Suite 3 vukhanhthuy Linux - Enterprise 4 08-09-2006 06:36 AM
Why cannot my Cluster Suite be installed ? vqthang Linux - Enterprise 2 06-06-2006 09:32 AM
Redhat Cluster suite magobin Linux - Enterprise 0 03-13-2006 06:00 AM
redhat Cluster Suite marpel Red Hat 0 01-19-2006 08:05 AM


All times are GMT -5. The time now is 03:14 PM.

Main Menu
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
identi.ca: @linuxquestions
Facebook: linuxquestions Google+: linuxquestions
Open Source Consulting | Domain Registration