Uploaded image for project: 'Couchbase Server'
  1. Couchbase Server
  2. MB-11975

{DCP}:: Bad Replicas leads to Rebalance Exit after add-back recovery=delta for gracefully failver node

    XMLWordPrintable

Details

    • Bug
    • Resolution: Duplicate
    • Blocker
    • 3.0
    • 3.0
    • couchbase-bucket, ns_server
    • Security Level: Public
    • None
    • 10.6.2.144-10.6.2.150

    Description

      1166, Centos 6x

      Test Case:: /testrunner -i ubuntu_x64.ini get-cbcollect-info=False,get-logs=False,stop-on-failure=False,get-coredumps=True,force_kill_memached=False,verify_unacked_bytes=True,total_vbuckets=128,std_vbuckets_dist=5 -t failover.failovertests.FailoverTests.test_failover_then_add_back,replicas=1,num_failed_nodes=1,items=100000,withMutationOps=True,doc_ops=update,upr_check=False,recoveryType=delta,graceful=True,GROUP=P0;GRACEFUL

      1. Create 7 node cluster
      2. Create default bucket with 100 k items
      3. Graceful failover 1 node with mutations running in parallel
      4. Add-back with recovery set to delta
      5. Rebalance clusters with mutations running in parallel
      Starting rebalance, KeepNodes = ['ns_1@10.6.2.146','ns_1@10.6.2.144',
      'ns_1@10.6.2.145','ns_1@10.6.2.147',
      'ns_1@10.6.2.148','ns_1@10.6.2.150',
      'ns_1@10.6.2.149'], EjectNodes = [], Failed over and being ejected nodes = [], Delta recovery nodes = ['ns_1@10.6.2.146'], Delta recovery buckets = all

      Step 5 leads to Rebalance exit with bad replicators issue

      Bad replicators after rebalance:
      Missing = [

      {'ns_1@10.6.2.147','ns_1@10.6.2.146',64}

      ]
      Extras = []

      ERROR - {u'node': u'ns_1@172.23.107.153', u'code': 2, u'text': u'Rebalance exited with reason {unexpected_exit,\n {\'EXIT\',<0.16515.3>,\n {dcp_wait_for_data_move_failed,"default",83,\n \'ns_1@172.23.107.153\',\n [\'ns_1@172.23.107.156\'],\n {error,no_stats_for_this_vbucket}}}}\n', u'shortText': u'message', u'serverTime': u'2014-08-15T19:43:54.062Z', u'module': u'ns_orchestrator', u'tstamp': 1408157034062, u'type': u'info'}
      [2014-08-15 19:39:05,783] - [rest_client:2011] ERROR - {u'node': u'ns_1@172.23.107.153', u'code': 0, u'text': u'<0.16503.3> exited with {unexpected_exit,\n {\'EXIT\',<0.16515.3>,\n {dcp_wait_for_data_move_failed,"default",83,\n \'ns_1@172.23.107.153\',\n [\'ns_1@172.23.107.156\'],\n {error,no_stats_for_this_vbucket}}}}', u'shortText': u'message', u'serverTime': u'2014-08-15T19:43:54.060Z', u'module': u'ns_vbucket_mover', u'tstamp': 1408157034060, u'type': u'critical'}
      [2014-08-15 19:39:05,783] - [rest_client:2011] ERROR -

      {u'node': u'ns_1@172.23.107.153', u'code': 0, u'text': u'Bucket "default" rebalance does not seem to be swap rebalance', u'shortText': u'message', u'serverTime': u'2014-08-15T19:43:53.999Z', u'module': u'ns_vbucket_mover', u'tstamp': 1408157033999, u'type': u'info'}

      [2014-08-15 19:39:05,783] - [rest_client:2011] ERROR -

      {u'node': u'ns_1@172.23.107.153', u'code': 0, u'text': u'Started rebalancing bucket default', u'shortText': u'message', u'serverTime': u'2014-08-15T19:43:53.398Z', u'module': u'ns_rebalancer', u'tstamp': 1408157033398, u'type': u'info'}

      [2014-08-15 19:39:05,783] - [rest_client:2011] ERROR -

      {u'node': u'ns_1@172.23.107.156', u'code': 0, u'text': u'Bucket "default" loaded on node \'ns_1@172.23.107.156\' in 0 seconds.', u'shortText': u'message', u'serverTime': u'2014-08-15T19:43:52.890Z', u'module': u'ns_memcached', u'tstamp': 1408157032890, u'type': u'info'}

      [2014-08-15 19:39:05,784] - [rest_client:2011] ERROR -

      {u'node': u'ns_1@172.23.107.153', u'code': 4, u'text': u"Starting rebalance, KeepNodes = ['ns_1@172.23.107.153','ns_1@172.23.107.156',\n 'ns_1@172.23.107.154','ns_1@172.23.107.157',\n 'ns_1@172.23.107.155'], EjectNodes = [], Failed over and being ejected nodes = [], Delta recovery nodes = ['ns_1@172.23.107.156'], Delta recovery buckets = all", u'shortText': u'message', u'serverTime': u'2014-08-15T19:43:52.317Z', u'module': u'ns_orchestrator', u'tstamp': 1408157032317, u'type': u'info'}

      [2014-08-15 19:39:05,784] - [rest_client:2011] ERROR -

      {u'node': u'ns_1@172.23.107.156', u'code': 0, u'text': u'Shutting down bucket "default" on \'ns_1@172.23.107.156\' for deletion', u'shortText': u'message', u'serverTime': u'2014-08-15T19:42:03.021Z', u'module': u'ns_memcached', u'tstamp': 1408156923021, u'type': u'info'}

      [2014-08-15 19:39:05,784] - [rest_client:2011] ERROR -

      {u'node': u'ns_1@172.23.107.153', u'code': 0, u'text': u"Failed over 'ns_1@172.23.107.156': ok", u'shortText': u'message', u'serverTime': u'2014-08-15T19:42:02.728Z', u'module': u'ns_rebalancer', u'tstamp': 1408156922728, u'type': u'info'}

      [2014-08-15 19:39:05,784] - [rest_client:2011] ERROR -

      {u'node': u'ns_1@172.23.107.153', u'code': 0, u'text': u"Starting failing over 'ns_1@172.23.107.156'", u'shortText': u'message', u'serverTime': u'2014-08-15T19:42:02.701Z', u'module': u'ns_rebalancer', u'tstamp': 1408156922701, u'type': u'info'}

      [2014-08-15 19:39:05,784] - [rest_client:2011] ERROR -

      {u'node': u'ns_1@172.23.107.153', u'code': 0, u'text': u'Bucket "default" rebalance does not seem to be swap rebalance', u'shortText': u'message', u'serverTime': u'2014-08-15T19:42:01.975Z', u'module': u'ns_vbucket_mover', u'tstamp': 1408156921975, u'type': u'info'}

      ERROR

      This issue is recent. Was not occuring when last tested with 1160

      Attachments

        No reviews matched the request. Check your Options in the drop-down menu of this sections header.

        Activity

          People

            chiyoung Chiyoung Seo (Inactive)
            parag Parag Agarwal (Inactive)
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Gerrit Reviews

                There are no open Gerrit changes

                PagerDuty