<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 TRANSITIONAL//EN">

<HTML>

<HEAD>

  <META HTTP-EQUIV="Content-Type" CONTENT="text/html; CHARSET=UTF-8">

  <META NAME="GENERATOR" CONTENT="GtkHTML/3.28.2">

</HEAD>

<BODY>

Steven<BR>

<BR>

I still do not understand why Corosync was forked off from OpenAIS.<BR>

<BR>

We now have an incompatible mess of two 80% overlapping and partially co-dependant applications (You cannot even now install OpenAIS now without first installing Corosync).<BR>

<BR>

What would it take to re-merge these two ugly duckings back into a single cohesive product again?<BR>

<BR>

Darren<BR>

<BR>

<BR>

<BR>

On Tue, 2010-04-06 at 15:57 -0700, Steven Dake wrote:

<BLOCKQUOTE TYPE=CITE>

<PRE>

On Tue, 2010-04-06 at 15:26 +0200, Filip Sakalos wrote:

&gt; Hi,

&gt; 

&gt; I am using openAIS and Pacemaker for clustering. I want to use two

&gt; rings for communication between nodes. The problem is, that one of the

&gt; rings is always marked as faulty on on one or both nodes:

&gt; 

&gt;  xen1:/home/filip # openais-cfgtool -s

&gt;  Printing ring status.

&gt;  RING ID 0

&gt;          id      = 192.168.58.124

&gt;          status  = Marking ringid 0 interface 192.168.58.124 FAULTY -

&gt; adminisrtative intervention required.

&gt;  RING ID 1

&gt;          id      = 192.168.7.1

&gt;          status  = ring 1 active with no faults

&gt; 

&gt; 

&gt; Same on the other node:

&gt; 

&gt; xen2:~ # openais-cfgtool -s

&gt; Printing ring status.

&gt; RING ID 0

&gt;         id      = 192.168.58.172

&gt;         status  = Marking seqid 12298 ringid 0 interface

&gt; 192.168.58.172 FAULTY - adminisrtative intervention required.

&gt; RING ID 1

&gt;         id      = 192.168.7.2

&gt;         status  = ring 1 active with no faults

&gt; 

&gt; This is my configuration file (/etc/ais/openais.conf):

&gt; 

&gt; # Please read the openais.conf.5 manual page

&gt; 

&gt; aisexec {

&gt;     # Run as root - this is necessary to be able to manage resources

&gt; with Pacemaker

&gt;     user:    root

&gt;     group:    root

&gt; }

&gt; 

&gt; service {

&gt;     # Load the Pacemaker Cluster Resource Manager

&gt;     ver:       0

&gt;     name:      pacemaker

&gt;     use_mgmtd: 1

&gt; }

&gt; 

&gt; totem {

&gt;     version: 2

&gt; 

&gt;     # How long before declaring a token lost (ms)

&gt;     token:          1000

&gt; 

&gt;     # How many token retransmits before forming a new configuration

&gt;     token_retransmits_before_loss_const: 10

&gt; 

&gt;     # How long to wait for join messages in the membership protocol (ms)

&gt;     join:           60

&gt; 

&gt;     # How long to wait for consensus to be achieved before starting a

&gt; new round of membership configuration (ms)

&gt;     consensus:      1500

&gt; 

&gt;     # Turn off the virtual synchrony filter

&gt;     vsftype:        none

&gt; 

&gt;     # Number of messages that may be sent by one processor on receipt

&gt; of the token

&gt;     max_messages:   20

&gt; 

&gt;     # Stagger sending the node join messages by 1..send_join ms

&gt;     send_join: 45

&gt; 

&gt;     # Limit generated nodeids to 31-bits (positive signed integers)

&gt;     clear_node_high_bit: yes

&gt; 

&gt;     # Disable encryption

&gt;     secauth:    on

&gt; 

&gt;     # How many threads to use for encryption/decryption

&gt;     threads:       0

&gt; 

&gt;     # Optionally assign a fixed node id (integer)

&gt;     # nodeid:         1234

&gt; 

&gt;     rrp_mode: passive

&gt; 

&gt;     interface {

&gt;         ringnumber: 0

&gt;         # The following values need to be set based on your environment

&gt;         bindnetaddr: 192.168.58.0

&gt;         mcastaddr: 226.94.1.1

&gt;         mcastport: 5405

&gt;     }

&gt; 

&gt;     interface {

&gt; 

&gt;         ringnumber: 1

&gt;         bindnetaddr: 192.168.7.0

&gt;         mcastaddr: 226.94.1.2

&gt;         mcastport: 5405

&gt;     }

&gt; }

&gt; 

&gt; #logging {

&gt; #    debug: off

&gt; #    fileline: off

&gt; #    to_syslog: yes

&gt; #    to_stderr: off

&gt; #    syslog_facility: daemon

&gt; #    timestamp: on

&gt; #}

&gt; 

&gt; logging {

&gt;     debug: on

&gt;     to_file: yes

&gt;     logfile: /var/log/openais.log

&gt;     to_syslog: yes

&gt;     syslog_facility: daemon

&gt;     timestamp: on

&gt; }

&gt; 

&gt; amf {

&gt;     mode: disabled

&gt; }

&gt; 

&gt; #eof

&gt; 

&gt; I can ping the other node without problem, ssh works too. Can anyone help?

&gt; 

&gt; 

I recommend using corosync instead of openais.  Corosync is much more

suitable for running pacemaker yet is nearly the same from a user

perspective (similar configuration, etc).

Provide the syslog output for the two nodes

Run ifconfig on the nodes and paste the output

Regards

-steve

&gt; 

&gt; Sincerely,

&gt; Filip Sakalos

&gt; _______________________________________________

&gt; Openais mailing list

&gt; <A HREF="mailto:Openais@lists.linux-foundation.org">Openais@lists.linux-foundation.org</A>

&gt; <A HREF="https://lists.linux-foundation.org/mailman/listinfo/openais">https://lists.linux-foundation.org/mailman/listinfo/openais</A>

_______________________________________________

Openais mailing list

<A HREF="mailto:Openais@lists.linux-foundation.org">Openais@lists.linux-foundation.org</A>

<A HREF="https://lists.linux-foundation.org/mailman/listinfo/openais">https://lists.linux-foundation.org/mailman/listinfo/openais</A>

</PRE>

</BLOCKQUOTE>

</BODY>

</HTML>