Uploaded image for project: 'Couchbase Server'
  1. Couchbase Server
  2. MB-30327

[System Test] Index node rebalance hung for ~20 hrs

    XMLWordPrintable

Details

    • Bug
    • Resolution: Fixed
    • Critical
    • 5.5.0
    • 5.5.0
    • secondary-index
    • centos-launcher-2

    Description

      Build : 5.5.0-2938
      Test : -test tests/2i/test_idx_rebalance_replica_vulcan_kv_opt.yml -scope tests/2i/scope_idx_rebalance_replica_vulcan_new.yml
      Scale : 2
      Iteration : 1

      The test has a step to rebalance out 2 indexer nodes - 172.23.96.48 & 172.23.96.254. This rebalance operation is stuck for almost 20 hrs now because on node 172.23.96.122, index building activity for other-1.#primary is not getting completed.

      From the stats, another discrepancy is observed. When I collected indexer stats first, following were the numbers for other-1.#primary :
      "other-1:#primary:num_docs_indexed":96656563,
      "other-1:#primary:num_docs_pending":50,
      "other-1:#primary:num_docs_processed":157305700,
      "other-1:#primary:num_docs_queued":56,
      Then, after 1 min, when I collected stats again, the same stats show totally different numbers:
      "other-1:#primary:num_docs_indexed":82702150,
      "other-1:#primary:num_docs_pending":4419513,
      "other-1:#primary:num_docs_processed":152954092,
      "other-1:#primary:num_docs_queued":3,

      How can the value for num_docs_processed go down? Also the ops rate for other-1 bucket is around 1700-1800 ops / sec, and the total item count is around 1.1M

      The environment is available for debugging : http://172.23.96.206:8091

      Attachments

        1. patch-MB-30327.1.diff
          0.9 kB
        2. patch-MB-30327.diff
          0.8 kB
        3. patch-MB-30327.fix.diff
          0.5 kB
        4. verbose
          124 kB
        For Gerrit Dashboard: MB-30327
        # Subject Branch Project Status CR V

        Activity

          People

            prataprc Pratap Chakravarthy (Inactive)
            mihir.kamdar Mihir Kamdar (Inactive)
            Votes:
            0 Vote for this issue
            Watchers:
            9 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Gerrit Reviews

                There are no open Gerrit changes

                PagerDuty