Uploaded image for project: 'Couchbase Server'
  1. Couchbase Server
  2. MB-59699

[Rebalance] : Rebalance fails with reason {{badmatch,{leader_activities_error,{default,rebalance},{quorum_lost,lease_lost,

    XMLWordPrintable

Details

    Description

      Steps to reproduce

      1. Created a 4 node kv cluster

      2. Created a couchstore bucket named 'default' with 2 replicas and loaded 100000 items into it

      3. Rebalanced out two nodes from the cluster

      4. Continuous persist to majority sync creates, updates and deletes during the rebalance

       

      Rebalance fails with this reason

      2023-11-16T10:12:36.696-08:00, ns_orchestrator:0:critical:message(ns_1@172.23.123.22) - Rebalance exited with reason {{badmatch,{leader_activities_error,{default,rebalance},{quorum_lost,lease_lost,'ns_1@172.23.123.17'}}}},                              [{ns_rebalancer,rebalance,5,                                [{file,"src/ns_rebalancer.erl"},{line,453}]},                               {proc_lib,init_p_do_apply,3,                                [{file,"proc_lib.erl"},{line,240}]}]}.Rebalance Operation Id = 6a6eea07122ece9a13b511c4afbe5f1e 

      Marking as a regression since this was not seen during our recent 7.2.3 runs

      Marking as a blocker since many tests are failing with the same rebalance failure


      TAF Script to reproduce

      guides/gradlew --refresh-dependencies testrunner -P jython=/opt/jython/bin/jython -P 'args=-i /data/workspace/debian-p0-durability-vset00-00-rebalance_out_persist_majority_6.5_P0/testexec.6669.ini num_items=100000,GROUP=P0;durability,durability=PERSIST_TO_MAJORITY,upgrade_version=7.2.4-6966,sirius_url=http://172.23.120.103:4000 -t rebalance_new.rebalance_out.RebalanceOutTests.rebalance_out_with_ops,upgrade_version=7.2.4-6966,GROUP=P0;durability,doc_ops=create:update:delete,nodes_out=2,get-cbcollect-info=True,replicas=2,durability=PERSIST_TO_MAJORITY,log_level=info,nodes_init=4,num_items=100000,sirius_url=http://172.23.120.103:4000,infra_log_level=info'

      Job name : debian-durability_rebalance_out_persist_majority_6.5_P0

      Job ref link : https://cb-logs-qe.s3-website-us-west-2.amazonaws.com/7.2.4-6966/jenkins_logs/test_suite_executor-TAF/285827/

       

      Attachments

        Issue Links

          No reviews matched the request. Check your Options in the drop-down menu of this sections header.

          Activity

            People

              raghav.sk Raghav S K
              raghav.sk Raghav S K
              Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Gerrit Reviews

                  There are no open Gerrit changes

                  PagerDuty