Uploaded image for project: 'Couchbase Server'
  1. Couchbase Server
  2. MB-5910

xdcr - "Replication failed " while replicating between 2 clusters - unidirectional replcation. Replicated 1/3rd items and replication failed/stopped

    XMLWordPrintable

Details

    Description

      Setup
      ------------

      1. Setup a 2:2 node cluster.
      2. Create bucket(west2) on source and bucket(east2) on destination.
      3. Setup unidirectional replication from(west2) to (east2)
      4. Load 1M+ items on source, keep the load running.
      5. Added a node at destination and rebalance
      6. 598K items are replicated as expected on the destination cluster
      7. Replication shows "replication failed"status message. Replication is stopped.

      • Nodes on the destination were intermittently losing connection ( this could be an environment issue here) but rebalancing at the destination was completed successfully.

      Error Output
      ---------------
      1. Replication "failed" at source.

      • Note: Another replication that was deleted, goes into the past-replications as "Replicating" .. this should be put as "Cancelled/Deleted" status and not "Replicating"

      Attaching screen-shots.
      1. Source cluster
      2. Destination cluster
      3. Replication status page on source node.

      The diags can be accessed at - https://s3.amazonaws.com/bugdb/jira/xdcr-1/xdcr.tar

      ketaki@ubu-2506:~$ /opt/couchbase/bin/cbstats 10.3.3.20:11210 -b east2 all | grep curr_
      curr_connections: 31
      curr_conns_on_port_11209: 27
      curr_conns_on_port_11210: 2
      curr_items: 168999
      curr_items_tot: 373003
      curr_temp_items: 0
      vb_active_curr_items: 168999
      vb_pending_curr_items: 0
      vb_replica_curr_items: 204004
      ketaki@ubu-2506:~$ /opt/couchbase/bin/cbstats 10.3.3.21:11210 -b east2 all | grep curr_
      curr_connections: 33
      curr_conns_on_port_11209: 27
      curr_conns_on_port_11210: 4
      curr_items: 196204
      curr_items_tot: 399743
      curr_temp_items: 0
      vb_active_curr_items: 196204
      vb_pending_curr_items: 0
      vb_replica_curr_items: 203539
      ketaki@ubu-2506:~$ /opt/couchbase/bin/cbstats 10.3.3.22:11210 -b east2 all | grep curr_
      curr_connections: 31
      curr_conns_on_port_11209: 27
      curr_conns_on_port_11210: 2
      curr_items: 232355
      curr_items_tot: 422370
      curr_temp_items: 0
      vb_active_curr_items: 232355
      vb_pending_curr_items: 0
      vb_replica_curr_items: 190015
      ketaki@ubu-2506:~$

      Let me know if you need any other information.

      Attachments

        No reviews matched the request. Check your Options in the drop-down menu of this sections header.

        Activity

          People

            ketaki Ketaki Gangal (Inactive)
            ketaki Ketaki Gangal (Inactive)
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Gerrit Reviews

                There are no open Gerrit changes

                PagerDuty