LinuxQuestions.org
Review your favorite Linux distribution.
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Software
User Name
Password
Linux - Software This forum is for Software issues.
Having a problem installing a new program? Want to know which application is best for the job? Post your question in this forum.

Notices


Reply
  Search this Thread
Old 05-04-2017, 10:13 AM   #1
handra
LQ Newbie
 
Registered: May 2017
Posts: 2

Rep: Reputation: Disabled
Unhappy Unable to start cluster (Pacemaker/Corosync)


Hi there,

I am currently trying to configure Pacemaker/Corosync. I managed to install the required packages for the cluster configuration, however I could not start the cluster service. Based on the log file, there was an issue with the directory /var/lib/pacemaker/.

I have tried some suggestions from checking the GID of the root user and ensuring the permission of the folder to be owned by hacluster:haclient, unfortunately there was no luck.

I am currently using RedHat 6.8. Thank you in advance for the help.

Here is the log for your reference:
Quote:
Set r/w permissions for uid=189, gid=189 on /var/log/cluster/corosync.log
May 04 15:20:38 [4337] CGATE1SP pacemakerd: info: get_config_opt: Found 'yes' for option: to_syslog
May 04 15:20:38 [4337] CGATE1SP pacemakerd: info: get_config_opt: Found 'local4' for option: syslog_facility
May 04 15:20:38 [4337] CGATE1SP pacemakerd: notice: main: Starting Pacemaker 1.1.14-8.el6 (Build: 70404b0): generated-manpages agent-manpages ascii-docs ncurses libqb-logging libqb-ipc nagios corosync-plugin cman acls
May 04 15:20:38 [4337] CGATE1SP pacemakerd: info: qb_ipcs_us_publish: server name: pacemakerd
May 04 15:20:38 [4337] CGATE1SP pacemakerd: info: cman_node_name: Using CMAN node name node1 for 1
May 04 15:20:38 [4337] CGATE1SP pacemakerd: info: crm_get_peer: Created entry 0780c6e9-7fce-4231-b994-5124f6440300/0x16b6100 for node node1/1 (1 total)
May 04 15:20:38 [4337] CGATE1SP pacemakerd: info: crm_get_peer: Node 1 is now known as node1
May 04 15:20:38 [4337] CGATE1SP pacemakerd: info: crm_get_peer: Node 1 has uuid node1
May 04 15:20:38 [4337] CGATE1SP pacemakerd: info: crm_update_peer_proc: cluster_connect_cpg: Node node1[1] - corosync-cpg is now online
May 04 15:20:38 [4337] CGATE1SP pacemakerd: info: init_cman_connection: Configuring Pacemaker to obtain quorum from cman
May 04 15:20:38 [4337] CGATE1SP pacemakerd: notice: cman_event_callback: Membership 6156: quorum acquired
May 04 15:20:38 [4337] CGATE1SP pacemakerd: notice: crm_update_peer_state_iter: cman_event_callback: Node node1[1] - state is now member (was (null))
May 04 15:20:38 [4337] CGATE1SP pacemakerd: info: cman_node_name: Using CMAN node name node1 for 0
May 04 15:20:38 [4337] CGATE1SP pacemakerd: info: start_child: Using uid=189 and group=189 for process cib
May 04 15:20:38 [4337] CGATE1SP pacemakerd: info: start_child: Forked child 4343 for process cib
May 04 15:20:38 [4337] CGATE1SP pacemakerd: info: start_child: Forked child 4344 for process stonith-ng
May 04 15:20:38 [4337] CGATE1SP pacemakerd: info: start_child: Forked child 4345 for process lrmd
May 04 15:20:38 [4337] CGATE1SP pacemakerd: info: start_child: Using uid=189 and group=189 for process attrd
May 04 15:20:38 [4337] CGATE1SP pacemakerd: info: start_child: Forked child 4346 for process attrd
May 04 15:20:38 [4337] CGATE1SP pacemakerd: info: start_child: Using uid=189 and group=189 for process pengine
May 04 15:20:38 [4337] CGATE1SP pacemakerd: info: start_child: Forked child 4347 for process pengine
May 04 15:20:38 [4337] CGATE1SP pacemakerd: info: start_child: Forked child 4348 for process crmd
May 04 15:20:38 [4337] CGATE1SP pacemakerd: info: main: Starting mainloop
May 04 15:20:38 [4337] CGATE1SP pacemakerd: info: pcmk_cpg_membership: Node 1 joined group pacemakerd (counter=0.0)
May 04 15:20:38 [4337] CGATE1SP pacemakerd: info: pcmk_cpg_membership: Node 1 still member of group pacemakerd (peer=node1, counter=0.0)
May 04 15:20:38 [4337] CGATE1SP pacemakerd: info: mcp_cpg_deliver: Ignoring process list sent by peer for local node
May 04 15:20:38 [4337] CGATE1SP pacemakerd: info: mcp_cpg_deliver: Ignoring process list sent by peer for local node
May 04 15:20:38 [4337] CGATE1SP pacemakerd: info: mcp_cpg_deliver: Ignoring process list sent by peer for local node
May 04 15:20:38 [4337] CGATE1SP pacemakerd: info: mcp_cpg_deliver: Ignoring process list sent by peer for local node
May 04 15:20:38 [4337] CGATE1SP pacemakerd: info: mcp_cpg_deliver: Ignoring process list sent by peer for local node
May 04 15:20:38 [4337] CGATE1SP pacemakerd: info: mcp_cpg_deliver: Ignoring process list sent by peer for local node
May 04 15:20:38 [4337] CGATE1SP pacemakerd: info: mcp_cpg_deliver: Ignoring process list sent by peer for local node
May 04 15:20:38 [4345] CGATE1SP lrmd: info: crm_log_init: Changed active directory to /var/lib/pacemaker/cores/root
May 04 15:20:38 [4345] CGATE1SP lrmd: info: qb_ipcs_us_publish: server name: lrmd
May 04 15:20:38 [4345] CGATE1SP lrmd: info: main: Starting
May 04 15:20:38 [4343] CGATE1SP cib: info: crm_log_init: Cannot change active directory to /var/lib/pacemaker/cores: Permission denied (13)
May 04 15:20:38 [4343] CGATE1SP cib: notice: main: Using new config location: /var/lib/pacemaker/cib
May 04 15:20:38 [4343] CGATE1SP cib: error: crm_is_writable: /var/lib/pacemaker/cib must exist and be a directory
May 04 15:20:38 [4343] CGATE1SP cib: error: main: Bad permissions on /var/lib/pacemaker/cib. Terminating
May 04 15:20:38 [4346] CGATE1SP attrd: notice: crm_cluster_connect: Connecting to cluster infrastructure: cman
May 04 15:20:38 [4337] CGATE1SP pacemakerd: warning: pcmk_child_exit: The cib process (4343) can no longer be respawned, shutting the cluster down.
May 04 15:20:38 [4344] CGATE1SP stonith-ng: info: crm_log_init: Changed active directory to /var/lib/pacemaker/cores/root
May 04 15:20:38 [4337] CGATE1SP pacemakerd: notice: pcmk_shutdown_worker: Shutting down Pacemaker
May 04 15:20:38 [4344] CGATE1SP stonith-ng: info: get_cluster_type: Verifying cluster type: 'cman'
May 04 15:20:38 [4337] CGATE1SP pacemakerd: notice: stop_child: Stopping crmd: Sent -15 to process 4348
May 04 15:20:38 [4344] CGATE1SP stonith-ng: info: get_cluster_type: Assuming an active 'cman' cluster
May 04 15:20:38 [4344] CGATE1SP stonith-ng: notice: crm_cluster_connect: Connecting to cluster infrastructure: cman
May 04 15:20:38 [4337] CGATE1SP pacemakerd: info: mcp_cpg_deliver: Ignoring process list sent by peer for local node
May 04 15:20:38 [4337] CGATE1SP pacemakerd: error: pcmk_child_exit: The crmd process (4348) terminated with signal 15 (core=0)
May 04 15:20:38 [4347] CGATE1SP pengine: info: crm_log_init: Cannot change active directory to /var/lib/pacemaker/cores: Permission denied (13)
May 04 15:20:38 [4347] CGATE1SP pengine: error: crm_is_writable: /var/lib/pacemaker/pengine must exist and be a directory
May 04 15:20:38 [4347] CGATE1SP pengine: error: main: Bad permissions on /var/lib/pacemaker/pengine. Terminating
May 04 15:20:38 [4337] CGATE1SP pacemakerd: notice: stop_child: Stopping pengine: Sent -15 to process 4347
May 04 15:20:38 [4337] CGATE1SP pacemakerd: info: mcp_cpg_deliver: Ignoring process list sent by peer for local node
May 04 15:20:38 [4337] CGATE1SP pacemakerd: warning: pcmk_child_exit: The pengine process (4347) can no longer be respawned, shutting the cluster down.
May 04 15:20:38 [4337] CGATE1SP pacemakerd: notice: stop_child: Stopping attrd: Sent -15 to process 4346
May 04 15:20:38 [4337] CGATE1SP pacemakerd: info: mcp_cpg_deliver: Ignoring process list sent by peer for local node
May 04 15:20:38 [4346] CGATE1SP attrd: notice: crm_update_peer_state_iter: crm_update_peer_proc: Node node1[1] - state is now member (was (null))
May 04 15:20:38 [4346] CGATE1SP attrd: notice: main: Starting mainloop...
May 04 15:20:38 [4346] CGATE1SP attrd: notice: crm_signal_dispatch: Invoking handler for signal 15: Terminated
May 04 15:20:38 [4346] CGATE1SP attrd: notice: main: Exiting...
May 04 15:20:38 [4344] CGATE1SP stonith-ng: info: cman_node_name: Using CMAN node name node1 for 1
May 04 15:20:38 [4337] CGATE1SP pacemakerd: info: pcmk_child_exit: The attrd process (4346) exited: OK (0)
May 04 15:20:38 [4344] CGATE1SP stonith-ng: info: crm_get_peer: Created entry 719da6bb-59d9-4a1a-b12c-39803290168d/0x1dcd0a0 for node node1/1 (1 total)
May 04 15:20:38 [4344] CGATE1SP stonith-ng: info: crm_get_peer: Node 1 is now known as node1
May 04 15:20:38 [4344] CGATE1SP stonith-ng: info: crm_get_peer: Node 1 has uuid node1
May 04 15:20:38 [4344] CGATE1SP stonith-ng: info: crm_update_peer_proc: cluster_connect_cpg: Node node1[1] - corosync-cpg is now online
May 04 15:20:38 [4344] CGATE1SP stonith-ng: notice: crm_update_peer_state_iter: crm_update_peer_proc: Node node1[1] - state is now member (was (null))
May 04 15:20:38 [4344] CGATE1SP stonith-ng: info: init_cs_connection_once: Connection to 'cman': established
May 04 15:20:38 [4337] CGATE1SP pacemakerd: notice: stop_child: Stopping lrmd: Sent -15 to process 4345
May 04 15:20:38 [4345] CGATE1SP lrmd: notice: crm_signal_dispatch: Invoking handler for signal 15: Terminated
May 04 15:20:38 [4345] CGATE1SP lrmd: info: lrmd_exit: Terminating with 0 clients
May 04 15:20:38 [4345] CGATE1SP lrmd: info: qb_ipcs_us_withdraw: withdrawing server sockets
May 04 15:20:38 [4337] CGATE1SP pacemakerd: info: mcp_cpg_deliver: Ignoring process list sent by peer for local node
May 04 15:20:38 [4345] CGATE1SP lrmd: info: crm_xml_cleanup: Cleaning up memory from libxml2
May 04 15:20:38 [4337] CGATE1SP pacemakerd: info: pcmk_child_exit: The lrmd process (4345) exited: OK (0)
May 04 15:20:38 [4344] CGATE1SP stonith-ng: info: cman_node_name: Using CMAN node name node1 for 0
May 04 15:20:38 [4337] CGATE1SP pacemakerd: notice: stop_child: Stopping stonith-ng: Sent -15 to process 4344
May 04 15:20:38 [4337] CGATE1SP pacemakerd: info: mcp_cpg_deliver: Ignoring process list sent by peer for local node
May 04 15:20:47 [4344] CGATE1SP stonith-ng: error: setup_cib: Could not connect to the CIB service: Transport endpoint is not connected (-107)
May 04 15:20:47 [4344] CGATE1SP stonith-ng: info: qb_ipcs_us_publish: server name: stonith-ng
May 04 15:20:47 [4344] CGATE1SP stonith-ng: info: main: Starting stonith-ng mainloop
May 04 15:20:47 [4344] CGATE1SP stonith-ng: notice: crm_signal_dispatch: Invoking handler for signal 15: Terminated
May 04 15:20:47 [4344] CGATE1SP stonith-ng: info: stonith_shutdown: Terminating with 0 clients
May 04 15:20:47 [4344] CGATE1SP stonith-ng: info: qb_ipcs_us_withdraw: withdrawing server sockets
May 04 15:20:47 [4344] CGATE1SP stonith-ng: info: main: Done
May 04 15:20:47 [4344] CGATE1SP stonith-ng: info: crm_xml_cleanup: Cleaning up memory from libxml2
May 04 15:20:47 [4337] CGATE1SP pacemakerd: info: pcmk_child_exit: The stonith-ng process (4344) exited: OK (0)
May 04 15:20:47 [4337] CGATE1SP pacemakerd: notice: pcmk_shutdown_worker: Shutdown complete
May 04 15:20:47 [4337] CGATE1SP pacemakerd: notice: pcmk_shutdown_worker: Attempting to inhibit respawning after fatal error
May 04 15:20:47 [4337] CGATE1SP pacemakerd: info: crm_xml_cleanup: Cleaning up memory from libxml2
 
Old 05-06-2017, 01:51 AM   #2
handra
LQ Newbie
 
Registered: May 2017
Posts: 2

Original Poster
Rep: Reputation: Disabled
Anyone has any recommendations on this?
 
  


Reply


Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
Pacemaker - corosync - Cluster monitors/probes only active resources sgeigenmueller Linux - Enterprise 0 12-28-2013 06:45 PM
[SOLVED] cluster on slack14 (corosync, pacemaker) ciorny Slackware 2 09-19-2013 02:17 AM
cluster (corosync, pacemaker, drbd, mysql) lost communication between nodes arrals.vl Linux - Server 2 05-10-2012 10:09 AM
Debian Corosync/Pacemaker Cluster Frustrations mpapet Linux - Server 1 05-09-2012 12:40 AM
MySQL HA-cluster with DRBD, Pacemaker and Corosync Patric.F Linux - Server 2 01-28-2012 05:27 AM

LinuxQuestions.org > Forums > Linux Forums > Linux - Software

All times are GMT -5. The time now is 07:02 PM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration