[Openais] [PATCH orosync] select a new sync member if the node with the lowest nodeid has left.

Angus Salkeld asalkeld at redhat.com
Thu Apr 22 15:21:13 PDT 2010


On Thu, 2010-04-22 at 16:35 -0500, David Teigland wrote:
> On Thu, Apr 22, 2010 at 11:06:19AM +1000, Angus Salkeld wrote:
> > Problem:
> > 
> > Under certain circumstances cpg does not send group leave messages.
> > 
> > With a big token timeout (tested with token == 5min).
> > 1 start all nodes
> > 2 start ./test/testcpg on all nodes
> > 2 go to the node with the lowest nodeid
> > 3 ifconfig <int> down && killall -9 corosync && /etc/init.d/corosync restart && ./testcpg
> > 4 the other nodes will not get the cpg leave event
> > 5 testcpg reports an extra cpg group (basically one was not removed)
> > 
> > Solution:
> > If a member gets removed using the new trans_list and
> > that member is the node used for syncing (lowest nodeid)
> > then the next lowest node needs to be chosen for syncing.
> > 
> > David would you mind confirming that this solves your problem?
> 
> It works great, thanks!
> Dave
> 
Thanks for testing.

-Angus




More information about the Openais mailing list