LinuxQuestions.org
Welcome to the most active Linux Forum on the web.
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Software
User Name
Password
Linux - Software This forum is for Software issues.
Having a problem installing a new program? Want to know which application is best for the job? Post your question in this forum.

Notices


Reply
  Search this Thread
Old 02-06-2017, 11:10 AM   #1
lidoravi
LQ Newbie
 
Registered: Feb 2017
Posts: 1

Rep: Reputation: Disabled
HA cluster - Pacemaker - OFFLINE nodes status


I'm using Pacemaker + Corosync in Centos7
Create Cluster using these commands:
Code:
    pcs cluster auth pcmk01-cr pcmk02-cr -u hacluster -p passwd
    pcs cluster setup --name my_cluster pcmk01-cr pcmk02-cr

    [pcmk01]# pcs cluster start --all
    [root@pcmk01 /]# pcs cluster cib clust_cfg
    [pcmk01]# pcs -f clust_cfg property set stonith-enabled=false
    [root@pcmk01 /]# pcs -f clust_cfg property set no-quorum-policy=ignore
    [root@pcmk01 /]# pcs -f clust_cfg resource defaults resource-stickiness=200
When I check the status of cluster I see strange and diffrent behavior between nodes, it's looks like the nodes doesn't know each other.

pcs status on **NODE1**:
Code:
    [root@rvpcmk01 ~]# pcs status
    Cluster name: my_cluster
    Stack: corosync
    Current DC: rvpcmk01-cr (version 1.1.15-11.el7_3.2-e174ec8) - partition WITHOUT quorum
    Last updated: Mon Feb  6 15:18:18 2017          Last change: Mon Feb  6 15:03:03 2017 by root via cibadmin on rvpcmk01-cr
    
    2 nodes and 0 resources configured
    
    Online: [ rvpcmk01-cr ]
    OFFLINE: [ rvpcmk02-cr ]
    
    No resources
    
    
    PCSD Status:
      rvpcmk01-cr: Online
      rvpcmk02-cr: Online
    
    Daemon Status:
      corosync: active/enabled
      pacemaker: active/enabled
      pcsd: active/enabled

pcs status on **NODE2**:

Code:
  [root@rvpcmk02 ~]# pcs status
    Cluster name: RV_cluster
    Stack: corosync
    Current DC: rvpcmk02-cr (version 1.1.15-11.el7_3.2-e174ec8) - partition WITHOUT quorum
    Last updated: Mon Feb  6 15:19:53 2017          Last change: Mon Feb  6 15:04:12 2017 by root via crm_attribute on rvpcmk02-cr
    
    2 nodes and 0 resources configured
    
    Online: [ rvpcmk02-cr ]
    OFFLINE: [ rvpcmk01-cr ]
    
    No resources
    
    
    PCSD Status:
      rvpcmk01-cr: Offline
      rvpcmk02-cr: Online
    
    Daemon Status:
      corosync: active/enabled
      pacemaker: active/enabled
      pcsd: active/enabled
I also run this command on both nodes:

NODE1:
Code:
    [root@rvpcmk01 ~]#  pcs status corosync
    
    Membership information
    ----------------------
        Nodeid      Votes Name
             1          1 rvpcmk01-cr (local)
NODE2:

Code:
 [root@rvpcmk02 ~]# pcs status corosync
    
    Membership information
    ----------------------
        Nodeid      Votes Name
             2          1 rvpcmk02-cr (local)
As I know both nodes should be appear in this above status.

Can you please help & advise what i'm missing here?
why nodes seems not know one on each other?

here is also my /etc/hosts files on both servers:

Code:
    [root@rvpcmk01 ~]# cat /etc/hosts
    127.0.0.1   localhost localhost.localdomain localhost4 localhost4.localdomain4
    ::1         localhost localhost.localdomain localhost6 localhost6.localdomain6
    172.17.235.109 rvpcmkvip
    172.17.235.43 rvpcmk01
    172.17.235.44 rvpcmk02
    172.17.235.75 rvpcmk01-cr
    172.17.235.106 rvpcmk02-cr
    172.17.235.119 rvpcmk01-drbd
    172.17.235.46 rvpcmk02-drbd
Code:
   [root@rvpcmk02 ~]# cat /etc/hosts
    127.0.0.1   localhost localhost.localdomain localhost4 localhost4.localdomain4
    ::1         localhost localhost.localdomain localhost6 localhost6.localdomain6
    172.17.235.109 rvpcmkvip
    172.17.235.43 rvpcmk01
    172.17.235.44 rvpcmk02
    172.17.235.75 rvpcmk01-cr
    172.17.235.106 rvpcmk02-cr
    172.17.235.119 rvpcmk01-drbd
    172.17.235.46 rvpcmk02-drbd
I check the authorization (that for sure was authorized when I start to configure my cluster and now I can see that there is a problem but I don't understand what it is and what is the root cause for it:
Code:
    [root@rvpcmk02 drbd.d]# pcs cluster auth rvpcmk01-cr rvpcmk02-cr -u hacluster -p passwd --debug
    Running: /usr/bin/ruby -I/usr/lib/pcsd/ /usr/lib/pcsd/pcsd-cli.rb auth
    --Debug Input Start--
    {"username": "hacluster", "local": false, "nodes": ["rvpcmk01-cr", "rvpcmk02-cr"], "password": "passwd", "force": false}
    --Debug Input End--
    
    Return Value: 0
    --Debug Output Start--
    {
      "status": "ok",
      "data": {
        "auth_responses": {
          "rvpcmk01-cr": {
            "status": "noresponse"
          },
          "rvpcmk02-cr": {
            "status": "ok",
            "token": "e340f461-12ef-4701-a1a6-ef44439dda94"
          }
        },
        "sync_successful": true,
        "sync_nodes_err": [
          "rvpcmk01-cr"
        ],
        "sync_responses": {
          "rvpcmk01-cr": {
            "status": "error"
          },
          "rvpcmk02-cr": {
            "status": "ok",
            "result": {
              "tokens": "accepted"
            }
          }
        }
      },
      "log": [
        "I, [2017-02-06T17:58:16.146090 #27368]  INFO -- : PCSD Debugging enabled\n",
        "D, [2017-02-06T17:58:16.146225 #27368] DEBUG -- : Did not detect RHEL 6\n",
        "I, [2017-02-06T17:58:16.146304 #27368]  INFO -- : Running: /usr/sbin/corosync-cmapctl totem.cluster_name\n",
        "I, [2017-02-06T17:58:16.146375 #27368]  INFO -- : CIB USER: hacluster, groups: \n",
        "D, [2017-02-06T17:58:16.156239 #27368] DEBUG -- : [\"totem.cluster_name (str) = RV-cluster\\n\"]\n",
        "D, [2017-02-06T17:58:16.156390 #27368] DEBUG -- : Duration: 0.009834697s\n",
        "I, [2017-02-06T17:58:16.156525 #27368]  INFO -- : Return Value: 0\n",
        "I, [2017-02-06T17:58:16.157212 #27368]  INFO -- : SRWT Node: rvpcmk02-cr Request: check_auth\n",
        "I, [2017-02-06T17:58:16.160119 #27368]  INFO -- : SRWT Node: rvpcmk01-cr Request: check_auth\n",
        "I, [2017-02-06T17:58:16.162104 #27368]  INFO -- : No response from: rvpcmk01-cr request: /check_auth, exception: Connection refused - connect(2)\n",
        "I, [2017-02-06T17:58:16.240477 #27368]  INFO -- : No response from: rvpcmk01-cr request: /auth, exception: Connection refused - connect(2)\n",
        "I, [2017-02-06T17:58:16.387075 #27368]  INFO -- : Running: /usr/sbin/pcs status nodes corosync\n",
        "I, [2017-02-06T17:58:16.387208 #27368]  INFO -- : CIB USER: hacluster, groups: \n",
        "D, [2017-02-06T17:58:16.670689 #27368] DEBUG -- : [\"Corosync Nodes:\\n\", \" Online: rvpcmk02-cr \\n\", \" Offline: rvpcmk01-cr \\n\"]\n",
        "D, [2017-02-06T17:58:16.670889 #27368] DEBUG -- : Duration: 0.28344494s\n",
        "I, [2017-02-06T17:58:16.671033 #27368]  INFO -- : Return Value: 0\n",
        "I, [2017-02-06T17:58:16.671978 #27368]  INFO -- : Sending config 'tokens' version 36 1f36b1c29146694381cd47b96c01d65876ba8db9 to nodes: rvpcmk02-cr, rvpcmk01-cr\n",
        "I, [2017-02-06T17:58:16.672428 #27368]  INFO -- : SRWT Node: rvpcmk01-cr Request: set_configs\n",
        "I, [2017-02-06T17:58:16.673819 #27368]  INFO -- : SRWT Node: rvpcmk02-cr Request: set_configs\n",
        "I, [2017-02-06T17:58:16.681322 #27368]  INFO -- : No response from: rvpcmk01-cr request: /set_configs, exception: Connection refused - connect(2)\n",
        "I, [2017-02-06T17:58:16.783061 #27368]  INFO -- : Sending config response from rvpcmk01-cr: {\"status\"=>\"error\"}\n",
        "I, [2017-02-06T17:58:16.783169 #27368]  INFO -- : Sending config response from rvpcmk02-cr: {\"status\"=>\"ok\", \"result\"=>{\"tokens\"=>\"accepted\"}}\n"
      ]
    }
    
    --Debug Output End--
    
    Error: Unable to communicate with rvpcmk01-cr
    rvpcmk02-cr: Authorized
    Error: Unable to synchronize and save tokens on nodes: rvpcmk01-cr. Are they authorized?

If you need some other info please tell me what and I will provide.
 
Old 02-16-2017, 04:22 PM   #2
dijetlo
Senior Member
 
Registered: Jan 2009
Location: RHELtopia....
Distribution: Solaris 11.2/Slackware/RHEL/
Posts: 1,491
Blog Entries: 2

Rep: Reputation: Disabled
Quote:
--Debug Output Start--
{
"status": "ok",
"data": {
"auth_responses": {
"rvpcmk01-cr": {
"status": "noresponse"
},
Your node1 is not responding to the corosync auth request.
 
Old 04-16-2017, 02:18 AM   #3
chilam
LQ Newbie
 
Registered: Apr 2017
Posts: 1

Rep: Reputation: Disabled
add resource freeswitch service in pacemaker and corosync

I need to monitor the systemd:freeswitch in the active server using the following commands.
When I stop the freeswitch process on the active, switch over to standby did not happen.
Please advise what is the correct configuration.

-----------------------------------------------------------------------
pcs resource create freeswitch_vip ocf:heartbeat:IPaddr2 ip=10.205.236.35 cidr_netmask=28 op monitor interval=20s
pcs resource create freeswitch_service systemd:freeswitch op monitor interval=20s
pcs constraint colocation add freeswitch_servcie freeswitch_vip
pcs constraint order freeswitch_vip then freeswitch_servcie


systemctl stop freeswitch

[root@freeswitch1 log]# pcs status
Cluster name: freeswitch_cluster
Stack: corosync
Current DC: freeswitch1 (version 1.1.15-11.el7_3.4-e174ec8) - partition with quorum
Last updated: Sun Apr 16 02:11:03 2017 Last change: Sun Apr 16 02:10:09 2017 by root via cibadmin on f reeswitch1

2 nodes and 3 resources configured

Online: [ freeswitch1 freeswitch2 ]

Full list of resources:

freeswitch_vip (ocf::heartbeat:IPaddr2): Started freeswitch1
freeswitch_recover (ocf::heartbeat:freeswitch_recover): Started freeswitch1
freeswitch_service (systemd:freeswitch): Stopped

Failed Actions:
* freeswitch_service_monitor_20000 on freeswitch1 'not running' (7): call=61, status=complete, exitreason='none ',
last-rc-change='Sun Apr 16 02:10:59 2017', queued=0ms, exec=0ms


Daemon Status:
corosync: active/disabled
pacemaker: active/enabled
pcsd: active/enabled
[root@freeswitch1 log]#
 
  


Reply


Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
Not able to add two nodes in Pacemaker Cluster z_haseeb Debian 4 04-16-2014 01:57 PM
cluster (corosync, pacemaker, drbd, mysql) lost communication between nodes arrals.vl Linux - Server 2 05-10-2012 11:09 AM
Building some sort of cluster: slurm, pacemaker, cluster-glue or .... kaz2100 Linux - Software 2 07-21-2011 01:04 AM
cman_tool status says member of cluster, yet nodes & votes = 0, and other blank value ineloquucius Linux - Server 0 01-26-2010 07:05 PM

LinuxQuestions.org > Forums > Linux Forums > Linux - Software

All times are GMT -5. The time now is 09:13 PM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Facebook: linuxquestions Google+: linuxquestions
Open Source Consulting | Domain Registration