LinuxQuestions.org
Share your knowledge at the LQ Wiki.
Home Forums Tutorials Articles Register
Go Back   LinuxQuestions.org > Forums > Enterprise Linux Forums > Linux - Enterprise
User Name
Password
Linux - Enterprise This forum is for all items relating to using Linux in the Enterprise.

Notices


Reply
  Search this Thread
Old 07-07-2007, 11:55 AM   #1
rr262
LQ Newbie
 
Registered: Jul 2007
Posts: 3

Rep: Reputation: 0
ccsd - Unable to connect to cluster infrastructure


Trying to configure 2-node cluster on HP Porliant BL685c servers with RHEL 4.5. Once service ccsd is started getting error message in /var/log/messages saying,"unable to connect to infrastucture after 30 seconds.". This is happening every 30 seconds. Checked everything but no luck so far to get rid of this error and continue. These servers are not under any firewall setup. Please help me to troubleshoot this issue.

-regards
Raj
 
Old 07-12-2007, 01:42 PM   #2
elcody02
Member
 
Registered: Jun 2007
Posts: 52

Rep: Reputation: 17
Normally this only happens if your cluster isn't quorate. Did you start cman? And what does your cluster.conf look like?
 
Old 07-12-2007, 11:26 PM   #3
rr262
LQ Newbie
 
Registered: Jul 2007
Posts: 3

Original Poster
Rep: Reputation: 0
cman is starting but fenced is not. My cluster.conf file is as follows:
<?xml version="1.0" ?>
<cluster alias="hsestg_cluster" config_version="3" name="hsestg_cluster">
<fence_daemon post_fail_delay="0" post_join_delay="600"/>
<clusternodes>
<clusternode name="ma5orcl701aa.corp.halliburton.com" votes="1">
<fence>
<method name="1">
<device name="ma5orcl701_ILO"/>
</method>
</fence>
</clusternode>
<clusternode name="ma5orcl702aa.corp.halliburton.com" votes="1">
<fence>
<method name="1">
<device name="ma5orcl702_ILO"/>
</method>
</fence>
</clusternode>
</clusternodes>
<cman expected_votes="1" two_node="1" log_facility="local7" log_level="7"/>
<ccsd log_facility="local7" log_level="7"/>
<fencedevices>
<fencedevice agent="fence_ilo" hostname="ma5orcl701_ilo.corp.halliburton.com" login="Administrator" name="ma5orcl701_ILO" passwd="init2007"/>
<fencedevice agent="fence_ilo" hostname="ma5orcl702_ilo.corp.halliburton.com" login="Administrator" name="ma5orcl702_ILO" passwd="init2007"/>
</fencedevices>
<rm>
<failoverdomains/>
<resources/>
</rm>
</cluster>

When starting the fenced daemon the following error message is displayed:
Jul 9 01:33:13 ma5orcl701aa ccsd[30523]: Unable to connect to cluster infrastructure after 180 seconds.
Jul 9 01:33:34 ma5orcl701aa ccsd[30523]: Cluster is not quorate. Refusing connection.
Jul 9 01:33:34 ma5orcl701aa ccsd[30523]: Error while processing connect: Connection refused
Jul 9 01:33:35 ma5orcl701aa ccsd[30523]: Cluster is not quorate. Refusing connection.
Jul 9 01:33:35 ma5orcl701aa ccsd[30523]: Error while processing connect: Connection refused

Please let me know if you could think of anything.
 
Old 07-14-2007, 01:06 PM   #4
elcody02
Member
 
Registered: Jun 2007
Posts: 52

Rep: Reputation: 17
Re: ccsd - Unable to connect to cluster infrastructure

Hmm, everything looks quite ok.
I would like to see more messages from cman. And what does fenced tell you when you start it (fence_tool join).
Did you already tell about the versions and distribution you are using?
What does cman_tool status and cman_tool nodes tell you?
Also try to check if the hostnames resolve to ips. Best via /etc/hosts.

That's all for now.
Good look and don't loose patience
MG.
 
Old 07-16-2007, 02:47 AM   #5
elcody02
Member
 
Registered: Jun 2007
Posts: 52

Rep: Reputation: 17
Another thing to not forget:

For a two node cluster both nodes need to be up and joined to the cluster before it gets quorate.
 
Old 07-22-2007, 07:16 AM   #6
rr262
LQ Newbie
 
Registered: Jul 2007
Posts: 3

Original Poster
Rep: Reputation: 0
Sorry for the delay. I was busy with some other projects and would be starting on this one either today or tomorrow and would post more info.
-Raj
 
Old 10-18-2011, 10:14 PM   #7
No woman No war
LQ Newbie
 
Registered: Oct 2011
Posts: 1

Rep: Reputation: Disabled
[Solved]

I sent ages for this issue. Finally, I got the point. It is because yum command is hung. You can check by

ps aux | grep yum

You will see at least 1 yum process. Kill it/them by

killall yum

Good luck
 
  


Reply



Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
Linux cluster - master node can't connect to slave nodes anymore Baerek Linux - Networking 6 03-30-2007 02:02 PM
LXer: Cluster Programming: Explicit Implications of Cluster Computing LXer Syndicated Linux News 0 12-26-2006 08:54 PM
LXer: Hitting the Cluster Wall - A Study in Cluster Optimization LXer Syndicated Linux News 0 06-27-2006 12:33 PM
LXer: Life, The Universe, and Your Cluster - A Study in Cluster Optimization LXer Syndicated Linux News 0 05-08-2006 08:54 AM
Unable to connect eggoz Linux - Networking 2 11-22-2004 06:32 PM

LinuxQuestions.org > Forums > Enterprise Linux Forums > Linux - Enterprise

All times are GMT -5. The time now is 05:41 PM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration