Uploaded image for project: 'Couchbase Server'
  1. Couchbase Server
  2. MB-61637

Rebalance failed with reason 'quorum_lost, leader_activities_error'

    XMLWordPrintable

Details

    Description

      Build: 7.6.2-3551

      Steps:

      • 3 node cluster, magma bucket with replica=1
      • Load 100K items as initial load
      • Rebalance in new node

        +---------------+-----------------------+------+--------------+
        | Nodes         | Version               | CPU  | Status       |
        +---------------+---------------------------------+------------
        | 172.23.108.67 | 7.6.2-3551-enterprise | 4.10 | Cluster node |
        | 172.23.108.68 | 7.6.2-3551-enterprise | 3.80 | Cluster node |
        | 172.23.108.69 | 7.6.2-3551-enterprise | 3.87 | Cluster node |
        | 172.23.108.70 |                       |      | <--- IN ---  |
        +---------------+-----------------------+------+--------------+

      • Trigger compaction when reblance is running

      Observation:

      [error_logger:error,2024-04-24T05:43:15.286-07:00,ns_1@172.23.108.67:<0.10755.0>:ale_error_logger_handler:do_log:101]
      =========================CRASH REPORT=========================
        crasher:
          initial call: erlang:apply/2
          pid: <0.10755.0>
          registered_name: []
          exception error: no match of right hand side value
                           {leader_activities_error,
                               {default,rebalance},
                               {quorum_lost,{lease_lost,'ns_1@172.23.108.68'}}}
            in function  ns_rebalancer:rebalance/7 (src/ns_rebalancer.erl, line 456)
          ancestors: [<0.2349.0>,ns_orchestrator_child_sup,ns_orchestrator_sup,
                        mb_master_sup,mb_master,leader_registry_sup,
                        leader_services_sup,<0.775.0>,ns_server_sup,
                        ns_server_nodes_sup,<0.290.0>,ns_server_cluster_sup,
                        root_sup,<0.155.0>]
          message_queue_len: 0
          messages: []
          links: [<0.2349.0>]
          dictionary: []
          trap_exit: false
          status: running
          heap_size: 17731
          stack_size: 28
          reductions: 3331
        neighbours:[user:error,2024-04-24T05:43:15.286-07:00,ns_1@172.23.108.67:<0.2349.0>:ns_orchestrator:log_rebalance_completion:1661]Rebalance exited with reason {{badmatch,
                                     {leader_activities_error,
                                      {default,rebalance},
                                      {quorum_lost,
                                       {lease_lost,'ns_1@172.23.108.68'}}}},
                                    [{ns_rebalancer,rebalance,7,
                                      [{file,"src/ns_rebalancer.erl"},{line,456}]},
                                     {proc_lib,init_p_do_apply,3,
                                      [{file,"proc_lib.erl"},{line,240}]}]}.
      Rebalance Operation Id = c76bf5afdfac41ec65c2622c133de7aa
      [ns_server:debug,2024-04-24T05:43:15.286-07:00,ns_1@172.23.108.67:net_kernel<0.2139.0>:cb_dist:info_msg:1098]cb_dist: Setting up new connection to 'ns_1@172.23.108.68' using inet_tcp_dist
      [ns_server:debug,2024-04-24T05:43:15.286-07:00,ns_1@172.23.108.67:<0.2349.0>:auto_rebalance:retry_rebalance:58]Retry rebalance is not enabled. Failed Rebalance with Id c76bf5afdfac41ec65c2622c133de7aa will not be retried.
      [ns_server:debug,2024-04-24T05:43:15.286-07:00,ns_1@172.23.108.67:cb_dist<0.2137.0>:cb_dist:info_msg:1098]cb_dist: Added connection {con,#Ref<0.1485091246.127401988.144254>,
                                     inet_tcp_dist,undefined,undefined}
      [ns_server:debug,2024-04-24T05:43:15.286-07:00,ns_1@172.23.108.67:cb_dist<0.2137.0>:cb_dist:info_msg:1098]cb_dist: Updated connection: {con,#Ref<0.1485091246.127401988.144254>,
                                        inet_tcp_dist,<0.12065.0>,
                                        #Ref<0.1485091246.127401988.144257>}
      [ns_server:info,2024-04-24T05:43:15.288-07:00,ns_1@172.23.108.67:<0.11983.0>:compaction_daemon:maybe_compact_vbucket:753]Compaction of <<"default/51">> has finished with ok
      

      Attachments

        Issue Links

          No reviews matched the request. Check your Options in the drop-down menu of this sections header.

          Activity

            People

              ashwin.govindarajulu Ashwin Govindarajulu
              ashwin.govindarajulu Ashwin Govindarajulu
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Gerrit Reviews

                  There are no open Gerrit changes

                  PagerDuty