LinuxQuestions.org
Download your favorite Linux distribution at LQ ISO.
Home Forums Tutorials Articles Register
Go Back   LinuxQuestions.org > Forums > Enterprise Linux Forums > Linux - Enterprise
User Name
Password
Linux - Enterprise This forum is for all items relating to using Linux in the Enterprise.

Notices


Reply
  Search this Thread
Old 06-18-2010, 04:09 AM   #1
DevinXu
LQ Newbie
 
Registered: May 2010
Posts: 6

Rep: Reputation: 0
Two node cluster, start CMAN fence the other node


Hi,all
I'm trying to install my first cluster of two nodes, using IPMILAN fence devices. I need some help because I can't find what I've missed.
My main problem is that the "service cman start" reboots the other node and I can't form the two nodes cluster.
I'm using nodes named separately fsommfe01 and fsommfe02, they are on the same VLAN and pings each other ok

1./etc/hosts;

127.0.0.1 localhost.localdomain localhost
###public service ip####
139.122.10.11 fsommfe01_pub
139.122.10.12 fsommfe02_pub
###rhcs heartbeat ip ####
192.168.100.1 fsommfe01 fsommfe01_pri1
192.168.100.2 fsommfe02 fsommfe02_pri1
###oracle rac heartbeat ip ####
192.168.101.1 fsommfe01 fsommfe01_pri2
192.168.101.2 fsommfe02 fsommfe02_pri2
###fence_ip####
139.122.10.201 fence01
139.122.10.202 fence02
###oracle rac vip ####
139.122.10.21 fsommfe01_vip
139.122.10.22 fsommfe02_vip

2.version and kernel;
[root@fsommfe02 log]# uname -a
Linux fsommfe02 2.6.18-92.el5 #1 SMP Tue Apr 29 13:16:15 EDT 2008 x86_64 x86_64 x86_64 GNU/Linux

3.cman version
[root@fsommfe02 log]# rpm -qa|grep cman
cman-2.0.84-2.el5

4. cluster.conf
[root@fsommfe01 cluster]# cat cluster.conf
<?xml version="1.0" ?>
<cluster alias="gtc_test" config_version="3" name="gtc_test">
<fence_daemon post_fail_delay="0" post_join_delay="10"/>
<clusternodes>
<clusternode name="fsommfe01" nodeid="1" votes="1">
<fence>
<method name="1">
<device name="fence01" nodename="fsommfe01"/>
</method>
</fence>
</clusternode>
<clusternode name="fsommfe02" nodeid="2" votes="1">
<fence>
<method name="1">
<device name="fence02" nodename="fsommfe02"/>
</method>
</fence>
</clusternode>
</clusternodes>
<cman expected_votes="1" two_node="1"/>
<fencedevices>
<fencedevice agent="fence_manual" name="fence01"/>
<fencedevice agent="fence_manual" name="fence02"/>
</fencedevices>
<rm>
<failoverdomains>
<failoverdomain name="test" ordered="0" restricted="1">
<failoverdomainnode name="fsommfe01" priority="1"/>
<failoverdomainnode name="fsommfe02" priority="1"/>
</failoverdomain>
</failoverdomains>
<resources/>
</rm>
</cluster>


When I start the cman service, it hangs up for some time at the "Starting fencing..." step and after those configured 10 secs it fences fsommfe02 and reboots it.
[root@fsommfe01 ~]# service cman start
Starting cluster:
Loading modules... done
Mounting configfs... done
Starting ccsd... done
Starting cman... done
Starting daemons... done
Starting fencing... done
[ OK ]

"fsommfe02" gets rebooted:
[root@fsommfe02 ~]#
Broadcast message from root (Thu Apr 15 18:42:24 2010):

The system is going down for system halt NOW!

5.fsommfe01 logs;
Jun 17 16:42:29 fsommfe01 ccsd[5014]: Starting ccsd 2.0.84:
Jun 17 16:42:29 fsommfe01 ccsd[5014]: Built: Apr 15 2008 16:19:15
Jun 17 16:42:29 fsommfe01 ccsd[5014]: Copyright (C) Red Hat, Inc. 2004 All rights reserved.
Jun 17 16:42:29 fsommfe01 ccsd[5014]: cluster.conf (cluster name = gtc_test, version = 5) found.
Jun 17 16:42:32 fsommfe01 openais[5020]: [MAIN ] AIS Executive Service RELEASE 'subrev 1358 version 0.80.3'
Jun 17 16:42:32 fsommfe01 openais[5020]: [MAIN ] Copyright (C) 2002-2006 MontaVista Software, Inc and contributors.
Jun 17 16:42:32 fsommfe01 openais[5020]: [MAIN ] Copyright (C) 2006 Red Hat, Inc.
Jun 17 16:42:32 fsommfe01 openais[5020]: [MAIN ] AIS Executive Service: started and ready to provide service.
Jun 17 16:42:32 fsommfe01 openais[5020]: [MAIN ] Using default multicast address of 239.192.105.199
Jun 17 16:42:32 fsommfe01 openais[5020]: [MAIN ] openais component openais_cpg loaded.
Jun 17 16:42:32 fsommfe01 openais[5020]: [MAIN ] Registering service handler 'openais cluster closed process group service v1.01'
Jun 17 16:42:32 fsommfe01 openais[5020]: [MAIN ] openais component openais_cfg loaded.
Jun 17 16:42:32 fsommfe01 openais[5020]: [MAIN ] Registering service handler 'openais configuration service'
Jun 17 16:42:32 fsommfe01 openais[5020]: [MAIN ] openais component openais_msg loaded.
Jun 17 16:42:32 fsommfe01 openais[5020]: [MAIN ] Registering service handler 'openais message service B.01.01'
Jun 17 16:42:32 fsommfe01 openais[5020]: [MAIN ] openais component openais_lck loaded.
Jun 17 16:42:32 fsommfe01 openais[5020]: [MAIN ] Registering service handler 'openais distributed locking service B.01.01'
Jun 17 16:42:32 fsommfe01 openais[5020]: [MAIN ] openais component openais_evt loaded.
Jun 17 16:42:32 fsommfe01 openais[5020]: [MAIN ] Registering service handler 'openais event service B.01.01'
Jun 17 16:42:32 fsommfe01 openais[5020]: [MAIN ] openais component openais_ckpt loaded.
Jun 17 16:42:32 fsommfe01 openais[5020]: [MAIN ] Registering service handler 'openais checkpoint service B.01.01'
Jun 17 16:42:32 fsommfe01 openais[5020]: [MAIN ] openais component openais_amf loaded.
Jun 17 16:42:32 fsommfe01 openais[5020]: [MAIN ] Registering service handler 'openais availability management framework B.01.01'
Jun 17 16:42:32 fsommfe01 openais[5020]: [MAIN ] openais component openais_clm loaded.
Jun 17 16:42:32 fsommfe01 openais[5020]: [MAIN ] Registering service handler 'openais cluster membership service B.01.01'
Jun 17 16:42:32 fsommfe01 openais[5020]: [MAIN ] openais component openais_evs loaded.
Jun 17 16:42:32 fsommfe01 openais[5020]: [MAIN ] Registering service handler 'openais extended virtual synchrony service'
Jun 17 16:42:32 fsommfe01 openais[5020]: [MAIN ] openais component openais_cman loaded.
Jun 17 16:42:32 fsommfe01 openais[5020]: [MAIN ] Registering service handler 'openais CMAN membership service 2.01'
Jun 17 16:42:32 fsommfe01 openais[5020]: [TOTEM] Token Timeout (10000 ms) retransmit timeout (495 ms)
Jun 17 16:42:32 fsommfe01 openais[5020]: [TOTEM] token hold (386 ms) retransmits before loss (20 retrans)
Jun 17 16:42:32 fsommfe01 openais[5020]: [TOTEM] join (60 ms) send_join (0 ms) consensus (4800 ms) merge (200 ms)
Jun 17 16:42:32 fsommfe01 openais[5020]: [TOTEM] downcheck (1000 ms) fail to recv const (50 msgs)
Jun 17 16:42:32 fsommfe01 openais[5020]: [TOTEM] seqno unchanged const (30 rotations) Maximum network MTU 1500
Jun 17 16:42:32 fsommfe01 openais[5020]: [TOTEM] window size per rotation (50 messages) maximum messages per rotation (17 messages)
Jun 17 16:42:32 fsommfe01 openais[5020]: [TOTEM] send threads (0 threads)
Jun 17 16:42:32 fsommfe01 openais[5020]: [TOTEM] RRP token expired timeout (495 ms)
Jun 17 16:42:32 fsommfe01 openais[5020]: [TOTEM] RRP token problem counter (2000 ms)
Jun 17 16:42:32 fsommfe01 openais[5020]: [TOTEM] RRP threshold (10 problem count)
Jun 17 16:42:32 fsommfe01 openais[5020]: [TOTEM] RRP mode set to none.
Jun 17 16:42:32 fsommfe01 openais[5020]: [TOTEM] heartbeat_failures_allowed (0)
Jun 17 16:42:32 fsommfe01 openais[5020]: [TOTEM] max_network_delay (50 ms)
Jun 17 16:42:32 fsommfe01 openais[5020]: [TOTEM] HeartBeat is Disabled. To enable set heartbeat_failures_allowed > 0
Jun 17 16:42:32 fsommfe01 openais[5020]: [TOTEM] Receive multicast socket recv buffer size (262142 bytes).
Jun 17 16:42:32 fsommfe01 openais[5020]: [TOTEM] Transmit multicast socket send buffer size (262142 bytes).
Jun 17 16:42:32 fsommfe01 openais[5020]: [TOTEM] The network interface [192.168.100.1] is now up.
Jun 17 16:42:32 fsommfe01 openais[5020]: [TOTEM] Created or loaded sequence id 24.192.168.100.1 for this ring.
Jun 17 16:42:32 fsommfe01 openais[5020]: [TOTEM] entering GATHER state from 15.
Jun 17 16:42:32 fsommfe01 openais[5020]: [SERV ] Initialising service handler 'openais extended virtual synchrony service'
Jun 17 16:42:32 fsommfe01 openais[5020]: [SERV ] Initialising service handler 'openais cluster membership service B.01.01'
Jun 17 16:42:32 fsommfe01 openais[5020]: [SERV ] Initialising service handler 'openais availability management framework B.01.01'
Jun 17 16:42:32 fsommfe01 openais[5020]: [SERV ] Initialising service handler 'openais checkpoint service B.01.01'
Jun 17 16:42:32 fsommfe01 openais[5020]: [SERV ] Initialising service handler 'openais event service B.01.01'
Jun 17 16:42:32 fsommfe01 openais[5020]: [SERV ] Initialising service handler 'openais distributed locking service B.01.01'
Jun 17 16:42:32 fsommfe01 openais[5020]: [SERV ] Initialising service handler 'openais message service B.01.01'
Jun 17 16:42:32 fsommfe01 openais[5020]: [SERV ] Initialising service handler 'openais configuration service'
Jun 17 16:42:32 fsommfe01 ccsd[5014]: Initial status:: Quorate
Jun 17 16:42:32 fsommfe01 openais[5020]: [SERV ] Initialising service handler 'openais cluster closed process group service v1.01'
Jun 17 16:42:32 fsommfe01 openais[5020]: [SERV ] Initialising service handler 'openais CMAN membership service 2.01'
Jun 17 16:42:32 fsommfe01 openais[5020]: [CMAN ] CMAN 2.0.84 (built Apr 15 2008 16:19:19) started
Jun 17 16:42:32 fsommfe01 openais[5020]: [SYNC ] Not using a virtual synchrony filter.
Jun 17 16:42:32 fsommfe01 openais[5020]: [TOTEM] Creating commit token because I am the rep.
Jun 17 16:42:32 fsommfe01 openais[5020]: [TOTEM] Saving state aru 0 high seq received 0
Jun 17 16:42:32 fsommfe01 openais[5020]: [TOTEM] Storing new sequence id for ring 1c
Jun 17 16:42:32 fsommfe01 openais[5020]: [TOTEM] entering COMMIT state.
Jun 17 16:42:32 fsommfe01 openais[5020]: [TOTEM] entering RECOVERY state.
Jun 17 16:42:33 fsommfe01 openais[5020]: [TOTEM] position [0] member 192.168.100.1:
Jun 17 16:42:33 fsommfe01 openais[5020]: [TOTEM] previous ring seq 24 rep 192.168.100.1
Jun 17 16:42:33 fsommfe01 openais[5020]: [TOTEM] aru 0 high delivered 0 received flag 1
Jun 17 16:42:33 fsommfe01 openais[5020]: [TOTEM] Did not need to originate any messages in recovery.
Jun 17 16:42:33 fsommfe01 openais[5020]: [TOTEM] Sending initial ORF token
Jun 17 16:42:33 fsommfe01 openais[5020]: [CLM ] CLM CONFIGURATION CHANGE
Jun 17 16:42:33 fsommfe01 openais[5020]: [CLM ] New Configuration:
Jun 17 16:42:33 fsommfe01 openais[5020]: [CLM ] Members Left:
Jun 17 16:42:33 fsommfe01 openais[5020]: [CLM ] Members Joined:
Jun 17 16:42:33 fsommfe01 openais[5020]: [CLM ] CLM CONFIGURATION CHANGE
Jun 17 16:42:33 fsommfe01 openais[5020]: [CLM ] New Configuration:
Jun 17 16:42:33 fsommfe01 openais[5020]: [CLM ] r(0) ip(192.168.100.1)
Jun 17 16:42:33 fsommfe01 openais[5020]: [CLM ] Members Left:
Jun 17 16:42:33 fsommfe01 openais[5020]: [CLM ] Members Joined:
Jun 17 16:42:33 fsommfe01 openais[5020]: [CLM ] r(0) ip(192.168.100.1)
Jun 17 16:42:33 fsommfe01 openais[5020]: [SYNC ] This node is within the primary component and will provide service.
Jun 17 16:42:33 fsommfe01 openais[5020]: [TOTEM] entering OPERATIONAL state.
Jun 17 16:42:33 fsommfe01 openais[5020]: [CMAN ] quorum regained, resuming activity
Jun 17 16:42:33 fsommfe01 openais[5020]: [CLM ] got nodejoin message 192.168.100.1
Jun 17 16:42:37 fsommfe01 fenced[5037]: fsommfe02 not a cluster member after 3 sec post_join_delay
Jun 17 16:42:37 fsommfe01 fenced[5037]: fencing node "fsommfe02"
Jun 17 16:42:37 fsommfe01 fence_manual: Node fsommfe02 needs to be reset before recovery can procede. Waiting for fsommfe02 to rejoin the cluster or for manual acknowledgement that it has been reset (i.e. fence_ack_manual -n fsommfe02)

6.fsommfe02 logs;
节点B fsommfe02的日志;
Jun 17 16:42:34 fsommfe02 ccsd[4985]: Starting ccsd 2.0.84:
Jun 17 16:42:34 fsommfe02 ccsd[4985]: Built: Apr 15 2008 16:19:15
Jun 17 16:42:34 fsommfe02 ccsd[4985]: Copyright (C) Red Hat, Inc. 2004 All rights reserved.
Jun 17 16:42:34 fsommfe02 ccsd[4985]: cluster.conf (cluster name = gtc_test, version = 5) found.
Jun 17 16:42:36 fsommfe02 openais[4991]: [MAIN ] AIS Executive Service RELEASE 'subrev 1358 version 0.80.3'
Jun 17 16:42:36 fsommfe02 openais[4991]: [MAIN ] Copyright (C) 2002-2006 MontaVista Software, Inc and contributors.
Jun 17 16:42:36 fsommfe02 openais[4991]: [MAIN ] Copyright (C) 2006 Red Hat, Inc.
Jun 17 16:42:36 fsommfe02 openais[4991]: [MAIN ] AIS Executive Service: started and ready to provide service.
Jun 17 16:42:36 fsommfe02 openais[4991]: [MAIN ] Using default multicast address of 239.192.105.199
Jun 17 16:42:36 fsommfe02 openais[4991]: [MAIN ] openais component openais_cpg loaded.
Jun 17 16:42:36 fsommfe02 openais[4991]: [MAIN ] Registering service handler 'openais cluster closed process group service v1.01'
Jun 17 16:42:36 fsommfe02 openais[4991]: [MAIN ] openais component openais_cfg loaded.
Jun 17 16:42:36 fsommfe02 openais[4991]: [MAIN ] Registering service handler 'openais configuration service'
Jun 17 16:42:36 fsommfe02 openais[4991]: [MAIN ] openais component openais_msg loaded.

Jun 17 16:42:36 fsommfe02 openais[4991]: [MAIN ] Registering service handler 'openais availability management framework B.01.01'
Jun 17 16:42:36 fsommfe02 openais[4991]: [MAIN ] openais component openais_clm loaded.
Jun 17 16:42:37 fsommfe02 openais[4991]: [MAIN ] Registering service handler 'openais cluster membership service B.01.01'
Jun 17 16:42:37 fsommfe02 openais[4991]: [MAIN ] openais component openais_evs loaded.
Jun 17 16:42:37 fsommfe02 openais[4991]: [MAIN ] Registering service handler 'openais extended virtual synchrony service'
Jun 17 16:42:37 fsommfe02 openais[4991]: [MAIN ] openais component openais_cman loaded.
Jun 17 16:42:37 fsommfe02 openais[4991]: [MAIN ] Registering service handler 'openais CMAN membership service 2.01'
Jun 17 16:42:37 fsommfe02 openais[4991]: [TOTEM] Token Timeout (10000 ms) retransmit timeout (495 ms)
Jun 17 16:42:37 fsommfe02 openais[4991]: [TOTEM] token hold (386 ms) retransmits before loss (20 retrans)
Jun 17 16:42:37 fsommfe02 openais[4991]: [TOTEM] join (60 ms) send_join (0 ms) consensus (4800 ms) merge (200 ms)
Jun 17 16:42:37 fsommfe02 ccsd[4985]: Initial status:: Quorate
Jun 17 16:42:37 fsommfe02 openais[4991]: [TOTEM] downcheck (1000 ms) fail to recv const (50 msgs)
Jun 17 16:42:37 fsommfe02 openais[4991]: [TOTEM] seqno unchanged const (30 rotations) Maximum network MTU 1500
Jun 17 16:42:37 fsommfe02 openais[4991]: [TOTEM] window size per rotation (50 messages) maximum messages per rotation (17 messages)
Jun 17 16:42:37 fsommfe02 openais[4991]: [TOTEM] send threads (0 threads)
Jun 17 16:42:37 fsommfe02 openais[4991]: [TOTEM] RRP token expired timeout (495 ms)
Jun 17 16:42:37 fsommfe02 openais[4991]: [TOTEM] RRP token problem counter (2000 ms)
Jun 17 16:42:37 fsommfe02 openais[4991]: [TOTEM] RRP threshold (10 problem count)
Jun 17 16:42:37 fsommfe02 openais[4991]: [TOTEM] RRP mode set to none.
Jun 17 16:42:37 fsommfe02 openais[4991]: [TOTEM] heartbeat_failures_allowed (0)
Jun 17 16:42:37 fsommfe02 openais[4991]: [TOTEM] max_network_delay (50 ms)
Jun 17 16:42:37 fsommfe02 openais[4991]: [TOTEM] HeartBeat is Disabled. To enable set heartbeat_failures_allowed > 0
Jun 17 16:42:37 fsommfe02 openais[4991]: [TOTEM] Receive multicast socket recv buffer size (262142 bytes).
Jun 17 16:42:37 fsommfe02 openais[4991]: [TOTEM] Transmit multicast socket send buffer size (262142 bytes).
Jun 17 16:42:37 fsommfe02 openais[4991]: [TOTEM] The network interface [192.168.100.2] is now up.
Jun 17 16:42:37 fsommfe02 openais[4991]: [TOTEM] Created or loaded sequence id 16.192.168.100.2 for this ring.
Jun 17 16:42:37 fsommfe02 openais[4991]: [TOTEM] entering GATHER state from 15.
Jun 17 16:42:37 fsommfe02 openais[4991]: [SERV ] Initialising service handler 'openais extended virtual synchrony service'
Jun 17 16:42:37 fsommfe02 openais[4991]: [SERV ] Initialising service handler 'openais cluster membership service B.01.01'
Jun 17 16:42:37 fsommfe02 openais[4991]: [SERV ] Initialising service handler 'openais availability management framework B.01.01'
Jun 17 16:42:37 fsommfe02 openais[4991]: [SERV ] Initialising service handler 'openais checkpoint service B.01.01'
Jun 17 16:42:37 fsommfe02 openais[4991]: [SERV ] Initialising service handler 'openais event service B.01.01'
Jun 17 16:42:37 fsommfe02 openais[4991]: [SERV ] Initialising service handler 'openais distributed locking service B.01.01'
Jun 17 16:42:37 fsommfe02 openais[4991]: [SERV ] Initialising service handler 'openais message service B.01.01'
Jun 17 16:42:37 fsommfe02 openais[4991]: [SERV ] Initialising service handler 'openais configuration service'
Jun 17 16:42:37 fsommfe02 openais[4991]: [SERV ] Initialising service handler 'openais cluster closed process group service v1.01'
Jun 17 16:42:37 fsommfe02 openais[4991]: [SERV ] Initialising service handler 'openais CMAN membership service 2.01'
Jun 17 16:42:37 fsommfe02 openais[4991]: [CMAN ] CMAN 2.0.84 (built Apr 15 2008 16:19:19) started
Jun 17 16:42:37 fsommfe02 openais[4991]: [SYNC ] Not using a virtual synchrony filter.
Jun 17 16:42:37 fsommfe02 openais[4991]: [TOTEM] Creating commit token because I am the rep.
Jun 17 16:42:37 fsommfe02 openais[4991]: [TOTEM] Saving state aru 0 high seq received 0
Jun 17 16:42:37 fsommfe02 openais[4991]: [TOTEM] Storing new sequence id for ring 14
Jun 17 16:42:37 fsommfe02 openais[4991]: [TOTEM] entering COMMIT state.
Jun 17 16:42:37 fsommfe02 openais[4991]: [TOTEM] entering RECOVERY state.
Jun 17 16:42:37 fsommfe02 openais[4991]: [TOTEM] position [0] member 192.168.100.2:
Jun 17 16:42:37 fsommfe02 openais[4991]: [TOTEM] previous ring seq 16 rep 192.168.100.2
Jun 17 16:42:37 fsommfe02 openais[4991]: [TOTEM] aru 0 high delivered 0 received flag 1
Jun 17 16:42:37 fsommfe02 openais[4991]: [TOTEM] Did not need to originate any messages in recovery.
Jun 17 16:42:37 fsommfe02 openais[4991]: [TOTEM] Sending initial ORF token
Jun 17 16:42:37 fsommfe02 openais[4991]: [CLM ] CLM CONFIGURATION CHANGE
Jun 17 16:42:37 fsommfe02 openais[4991]: [CLM ] New Configuration:
Jun 17 16:42:37 fsommfe02 openais[4991]: [CLM ] Members Left:
Jun 17 16:42:37 fsommfe02 openais[4991]: [CLM ] Members Joined:
Jun 17 16:42:38 fsommfe02 openais[4991]: [CLM ] CLM CONFIGURATION CHANGE
Jun 17 16:42:38 fsommfe02 openais[4991]: [CLM ] New Configuration:
Jun 17 16:42:38 fsommfe02 openais[4991]: [CLM ] r(0) ip(192.168.100.2)
Jun 17 16:42:38 fsommfe02 openais[4991]: [CLM ] Members Left:
Jun 17 16:42:38 fsommfe02 openais[4991]: [CLM ] Members Joined:
Jun 17 16:42:38 fsommfe02 openais[4991]: [CLM ] r(0) ip(192.168.100.2)
Jun 17 16:42:38 fsommfe02 openais[4991]: [SYNC ] This node is within the primary component and will provide service.
Jun 17 16:42:38 fsommfe02 openais[4991]: [TOTEM] entering OPERATIONAL state.
Jun 17 16:42:38 fsommfe02 openais[4991]: [CMAN ] quorum regained, resuming activity
Jun 17 16:42:38 fsommfe02 openais[4991]: [CLM ] got nodejoin message 192.168.100.2
Jun 17 16:42:42 fsommfe02 fenced[5008]: fsommfe01 not a cluster member after 3 sec post_join_delay
Jun 17 16:42:42 fsommfe02 fenced[5008]: fencing node "fsommfe01"
Jun 17 16:42:42 fsommfe02 fence_manual: Node fsommfe01 needs to be reset before recovery can procede. Waiting for fsommfe01 to rejoin the cluster or for manual acknowledgement that it has been reset (i.e. fence_ack_manual -n fsommfe01)


7.both nodes can ping each other;
[root@fsommfe01 cluster]# ping fsommfe02
PING fsommfe02 (192.168.100.2) 56(84) bytes of data.
64 bytes from fsommfe02 (192.168.100.2): icmp_seq=1 ttl=64 time=0.074 ms
64 bytes from fsommfe02 (192.168.100.2): icmp_seq=2 ttl=64 time=0.061 ms

[root@fsommfe02 cluster]# ping fsommfe01
PING fsommfe01 (192.168.100.1) 56(84) bytes of data.
64 bytes from fsommfe01 (192.168.100.1): icmp_seq=1 ttl=64 time=0.065 ms
64 bytes from fsommfe01 (192.168.100.1): icmp_seq=2 ttl=64 time=0.061 ms

Here are my questions;

I can't get the two nodes, joining the cluster...
I guess I'm missing something at the cluster.conf file??? I can't find what I'm making wrong.

In my test envrionment,the two nodes connecting the switch to deliver the heartbeat,if my switch fails, I think the two nodes will think the opposite as dead and fence it. and method to solve this? if it's yes any guide given to me to follow?/thank you very much;

My os version is RHEL5.2, my friend told me if my cluster software version is lower or equal to 5.3,upgrade it to 5.3 from RHN,does it true???

My sincere thanks

Thanks for any help!
 
Old 06-21-2010, 12:37 PM   #2
bret
Member
 
Registered: Apr 2005
Location: SLC, Utah
Distribution: RHEL 4, RHEL 5
Posts: 59

Rep: Reputation: 16
If you're getting an endless boot problem, it then sounds like fencing is not configured properly. I also ran into this issue with my setup (2 dell PE 1950s). Once I set up fencing to use DRAC, I was good to go.
 
  


Reply



Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
mysqld node of mysql cluster system not connecting to management node coal-fire-ice Linux - Server 1 07-27-2015 08:33 AM
[SOLVED] Rocks Cluster node asking for rolls upon node install Shouraku ROCK 1 04-15-2010 10:28 AM
Heartbeat cluster won't recognize other node, resource won't start. slinx Linux - Enterprise 2 04-08-2010 10:48 PM
Heartbeat cluster won't recognize other node, resource won't start. slinx Linux - Enterprise 1 03-31-2010 11:19 AM
How to make a DRBD node start itself as a primary node automatically? pyruby Linux - Newbie 1 01-29-2010 12:41 PM

LinuxQuestions.org > Forums > Enterprise Linux Forums > Linux - Enterprise

All times are GMT -5. The time now is 12:09 AM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration