Uploaded image for project: 'Couchbase Server'
  1. Couchbase Server
  2. MB-15158

GoXDCR tries to start replication on new node before rebalance-in is complete ("Invalid configuration. No source nozzle can be constructed...")

    XMLWordPrintable

Details

    • Bug
    • Resolution: Duplicate
    • Major
    • bug-backlog
    • 4.0.0, 5.0.0
    • XDCR
    • Security Level: Public
    • None

    Description

      Build


      4.0.0-2206

      Found during manual testing

      C1: .186 (data node)
      1. C1.default --> C2.default
      2. Add in node .187 (data node) to C1.
      3. Seeing replication error "Invalid configuration. No source nozzle can be constructed since the source kv nodes are not the master for any vbuckets"

      Note: both nodes in C1 are data nodes. This error is seen before rebalance-in is complete. It appears from the timing of these error messages that we are attempting to start replication from the new node even before rebalance is complete, which results in the error.

      Last 6 xdcr errors:

      2015-05-26 17:05:01 map[ToplogyChangeDetector:Topology has changed on source cluster]
      2015-05-26 17:04:59 map[ToplogyChangeDetector:Topology has changed on source cluster]
      2015-05-26 17:04:50 map[ToplogyChangeDetector:Topology has changed on source cluster]
      2015-05-26 17:04:49 map[ToplogyChangeDetector:Topology has changed on source cluster]
      2015-05-26 17:04:40 map[dcp_7bb576d02957c226159302cc3b684822/default/default_10.3.4.186:11210_1:dcp stream for vb=1023 is closed by producer]
      2015-05-26 17:04:38 Invalid configuration. No source nozzle can be constructed since the source kv nodes are not the master for any vbuckets.
      

      Rebalance log:

      Rebalance completed successfully.
      ns_orchestrator001	ns_1@10.3.4.186	17:05:17 - Tue May 26, 2015
       
      Replication 7bb576d02957c226159302cc3b684822/default/default started running.	xdcr000	ns_1@10.3.4.186	17:05:01 - Tue May 26, 2015
       
      Replication 7bb576d02957c226159302cc3b684822/default/default started running. (repeated 1 times)	xdcr000	ns_1@10.3.4.186	17:04:53 - Tue May 26, 2015
       
      Replication 7bb576d02957c226159302cc3b684822/default/default failed. err=map[ToplogyChangeDetector:Topology has changed on source cluster]	xdcr000
      ns_1@10.3.4.186	17:04:50 - Tue May 26, 2015
       
      Replication 7bb576d02957c226159302cc3b684822/default/default failed. err=map[ToplogyChangeDetector:Topology has changed on source cluster]	xdcr000	ns_1@10.3.4.187	17:04:49 - Tue May 26, 2015
       
      Replication 7bb576d02957c226159302cc3b684822/default/default started running.	xdcr000	ns_1@10.3.4.187	17:04:49 - Tue May 26, 2015
       
      Replication 7bb576d02957c226159302cc3b684822/default/default failed. err=map[dcp_7bb576d02957c226159302cc3b684822/default/default_10.3.4.186:11210_1:dcp stream for vb=1023 is closed by producer]	xdcr000	ns_1@10.3.4.186	17:04:40 - Tue May 26, 2015
       
      Bucket "default" rebalance appears to be swap rebalance	ns_vbucket_mover000	ns_1@10.3.4.186	17:04:39 - Tue May 26, 2015
      Bucket "default" loaded on node 'ns_1@10.3.4.187' in 0 seconds.	ns_memcached000	ns_1@10.3.4.187	17:04:38 - Tue May 26, 2015
       
      Started rebalancing bucket default	ns_rebalancer000	ns_1@10.3.4.186	17:04:38 - Tue May 26, 2015
       
      Starting rebalance, KeepNodes = ['ns_1@10.3.4.186','ns_1@10.3.4.187'], EjectNodes = [], Failed over and being ejected nodes = []; no delta recovery nodes
      ns_orchestrator004	ns_1@10.3.4.186	17:04:38 - Tue May 26, 2015
      

      Attaching links to cbcollect from both nodes.

      Attachments

        Issue Links

          No reviews matched the request. Check your Options in the drop-down menu of this sections header.

          Activity

            People

              neil.huang Neil Huang
              apiravi Aruna Piravi (Inactive)
              Votes:
              0 Vote for this issue
              Watchers:
              10 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Gerrit Reviews

                  There are no open Gerrit changes

                  PagerDuty