Uploaded image for project: 'Couchbase Server'
  1. Couchbase Server
  2. MB-11621

Rebalance fails when adding node back after failover (delta recovery) due to "{badmatch, {error,{failed_nodes,['ns_1@172.23.96.14']}}}"

    XMLWordPrintable

Details

    • Bug
    • Resolution: Fixed
    • Blocker
    • 3.0
    • 3.0
    • couchbase-bucket
    • Security Level: Public
    • Build 3.0.0-918 (Beta)

      Platform = Physical
      OS = CentOS 6.5
      CPU = Intel Xeon E5-2630 (24 vCPU)
      Memory = 64 GB
      Disk = RAID 10 HDD

    Description

      4 nodes, 1 bucket x 100M x 2KB, 10K mixed ops/sec

      Steps:
      1. Manually fail over one node (172.23.96.14)
      2. Add it back
      3. Sleep 20 minutes
      4. Trigger rebalance "in"

      Warmup completed successfully but node was automatically "failovered" shortly after that.

      Attachments

        For Gerrit Dashboard: MB-11621
        # Subject Branch Project Status CR V

        Activity

          People

            pavelpaulau Pavel Paulau (Inactive)
            pavelpaulau Pavel Paulau (Inactive)
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Gerrit Reviews

                There are no open Gerrit changes

                PagerDuty