Uploaded image for project: 'Couchbase Server'
  1. Couchbase Server
  2. MB-60493

[System-test] Index rebalance failed with error "EquivIndexViolation"

    XMLWordPrintable

Details

    • Bug
    • Resolution: Duplicate
    • Critical
    • 7.6.0
    • 7.2.3
    • secondary-index
    • 7.2.3-6710
    • Untriaged
    • 0
    • Unknown

    Description

      Index rebalance failed with below error - 

       

      Rebalance exited with reason {service_rebalance_failed,index,
                                    {worker_died,
                                     {'EXIT',<0.1540.153>,
                                      {rebalance_failed,
                                       {service_error,
                                        <<"\nMemoryQuota: 58993934336\nCpuQuota: 36\n--- Violations for index <idx11_gGYNoR5Q 1 (replica 1), default8, scope_0, coll_8> (mem 179.297K, cpu 0) at node svc-i-node-018.fj0yg7wxliw5k7y9.sandbox.nonprod-project-avengers.com:18091 \n\tCannot move to svc-i-node-023.fj0yg7wxliw5k7y9.sandbox.nonprod-project-avengers.com:18091: EquivIndexViolation (free mem 35.2258G, free cpu 30.57924303201806)\n\tCannot move to svc-i-node-021.fj0yg7wxliw5k7y9.sandbox.nonprod-project-avengers.com:18091: ServerGroupViolation (free mem 43.6656G, free cpu 33.58705070115184)\n\tCannot move to svc-i-node-020.fj0yg7wxliw5k7y9.sandbox.nonprod-project-avengers.com:18091: ReplicaViolation (free mem 41.0809G, free cpu 32.97896175774347)\n\tCannot move to svc-i-node-019.fj0yg7wxliw5k7y9.sandbox.nonprod-project-avengers.com:18091: EquivIndexViolation (free mem 33.56G, free cpu 31.54383883791578)\n--- Violations for index <idx9_EGV0CTRWM3_idxprefix 2, default8, scope_0, coll_7> (mem 0, cpu 0) at node svc-i-node-018.fj0yg7wxliw5k7y9.sandbox.nonprod-project-avengers.com:18091 \n\tCannot move to svc-i-node-023.fj0yg7wxliw5k7y9.sandbox.nonprod-project-avengers.com:18091: EquivIndexViolation (free mem 35.2258G, free cpu 30.57924303201806)\n\tCannot move to svc-i-node-021.fj0yg7wxliw5k7y9.sandbox.nonprod-project-avengers.com:18091: ServerGroupViolation (free mem 43.6656G, free cpu 33.58705070115184)\n\tCannot move to svc-i-node-020.fj0yg7wxliw5k7y9.sandbox.nonprod-project-avengers.com:18091: ReplicaViolation (free mem 41.0809G, free cpu 32.97896175774347)\n\tCannot move to svc-i-node-019.fj0yg7wxliw5k7y9.sandbox.nonprod-project-avengers.com:18091: EquivIndexViolation (free mem 33.56G, free cpu 31.54383883791578)\n--- Violations for index <idx9_EGV0CTRWM3_idxprefix 4, default8, scope_0, coll_7> (mem 0, cpu 0) at node svc-i-node-018.fj0yg7wxliw5k7y9.sandbox.nonprod-project-avengers.com:18091 \n\tCannot move to svc-i-node-023.fj0yg7wxliw5k7y9.sandbox.nonprod-project-avengers.com:18091: EquivIndexViolation (free mem 35.2258G, free cpu 30.57924303201806)\n\tCannot move to svc-i-node-021.fj0yg7wxliw5k7y9.sandbox.nonprod-project-avengers.com:18091: ServerGroupViolation (free mem 43.6656G, free cpu 33.58705070115184)\n\tCannot move to svc-i-node-020.fj0yg7wxliw5k7y9.sandbox.nonprod-project-avengers.com:18091: ReplicaViolation (free mem 41.0809G, free cpu 32.97896175774347)\n\tCannot move to svc-i-node-019.fj0yg7wxliw5k7y9.sandbox.nonprod-project-avengers.com:18091: EquivIndexViolation (free mem 33.56G, free cpu 31.54383883791578)\n--- Violations for index <idx9_EGV0CTRWM3_idxprefix 3, default8, scope_0, coll_7> (mem 0, cpu 0) at node svc-i-node-018.fj0yg7wxliw5k7y9.sandbox.nonprod-project-avengers.com:18091 \n\tCannot move to svc-i-node-023.fj0yg7wxliw5k7y9.sandbox.nonprod-project-avengers.com:18091: EquivIndexViolation (free mem 35.2258G, free cpu 30.57924303201806)\n\tCannot move to svc-i-node-021.fj0yg7wxliw5k7y9.sandbox.nonprod-project-avengers.com:18091: ServerGroupViolation (free mem 43.6656G, free cpu 33.58705070115184)\n\tCannot move to svc-i-node-020.fj0yg7wxliw5k7y9.sandbox.nonprod-project-avengers.com:18091: ReplicaViolation (free mem 41.0809G, free cpu 32.97896175774347)\n\tCannot move to svc-i-node-019.fj0yg7wxliw5k7y9.sandbox.nonprod-project-avengers.com:18091: EquivIndexViolation (free mem 33.56G, free cpu 31.54383883791578)\n--- Violations for index <idx11_TKBCG4JS0O_idxprefix 2, default8, scope_0, coll_8> (mem 219.313K, cpu 0) at node svc-i-node-018.fj0yg7wxliw5k7y9.sandbox.nonprod-project-avengers.com:18091 \n\tCannot move to svc-i-node-023.fj0yg7wxliw5k7y9.sandbox.nonprod-project-avengers.com:18091: EquivIndexViolation (free mem 35.2258G, free cpu 30.57924303201806)\n\tCannot move to svc-i-node-021.fj0yg7wxliw5k7y9.sandbox.nonprod-project-avengers.com:18091: ServerGroupViolation (free mem 43.6656G, free cpu 33.58705070115184)\n\tCannot move to svc-i-node-020.fj0yg7wxliw5k7y9.sandbox.nonprod-project-avengers.com:18091: ReplicaViolation (free mem 41.0809G, free cpu 32.97896175774347)\n\tCannot move to svc-i-node-019.fj0yg7wxliw5k7y9.sandbox.nonprod-project-avengers.com:18091: EquivIndexViolation (free mem 33.56G, free cpu 31.54383883791578)\n--- Violations for index <idx11_TKBCG4JS0O_idxprefix 3 (replica 1), default8, scope_0, coll_8> (mem 66.2422K, cpu 0) at node svc-i-node-018.fj0yg7wxliw5k7y9.sandbox.nonprod-project-avengers.com:18091 \n\tCannot move to svc-i-node-023.fj0yg7wxliw5k7y9.sandbox.nonprod-project-avengers.com:18091: EquivIndexViolation (free mem 35.2258G, free cpu 30.57924303201806)\n\tCannot move to svc-i-node-021.fj0yg7wxliw5k7y9.sandbox.nonprod-project-avengers.com:18091: ServerGroupViolation (free mem 43.6656G, free cpu 33.58705070115184)\n\tCannot move to svc-i-node-020.fj0yg7wxliw5k7y9.sandbox.nonprod-project-avengers.com:18091: ReplicaViolation (free mem 41.0809G, free cpu 32.97896175774347)\n\tCannot move to svc-i-node-019.fj0yg7wxliw5k7y9.sandbox.nonprod-project-avengers.com:18091: EquivIndexViolation (free mem 33.56G, free cpu 31.54383883791578)\n--- Violations for index <idx7_DUQMJL2M21_idxprefix 1, default8, scope_0, coll_7> (mem 18.6533K, cpu 0) at node svc-i-node-018.fj0yg7wxliw5k7y9.sandbox.nonprod-project-avengers.com:18091 \n\tCannot move to svc-i-node-023.fj0yg7wxliw5k7y9.sandbox.nonprod-project-avengers.com:18091: EquivIndexViolation (free mem 35.2258G, free cpu 30.57924303201806)\n\tCannot move to svc-i-node-021.fj0yg7wxliw5k7y9.sandbox.nonprod-project-avengers.com:18091: ServerGroupViolation (free mem 43.6656G, free cpu 33.58705070115184)\n\tCannot move to svc-i-node-020.fj0yg7wxliw5k7y9.sandbox.nonprod-project-avengers.com:18091: ReplicaViolation (free mem 41.0809G, free cpu 32.97896175774347)\n\tCannot move to svc-i-node-019.fj0yg7wxliw5k7y9.sandbox.nonprod-project-avengers.com:18091: EquivIndexViolation (free mem 33.56G, free cpu 31.54383883791578)\n--- Violations for index <idx7_DUQMJL2M21_idxprefix 2 (replica 1), default8, scope_0, coll_7> (mem 18.2334K, cpu 0) at node svc-i-node-018.fj0yg7wxliw5k7y9.sandbox.nonprod-project-avengers.com:18091 \n\tCannot move to svc-i-node-023.fj0yg7wxliw5k7y9.sandbox.nonprod-project-avengers.com:18091: EquivIndexViolation (free mem 35.2258G, free cpu 30.57924303201806)\n\tCannot move to svc-i-node-021.fj0yg7wxliw5k7y9.sandbox.nonprod-project-avengers.com:18091: ServerGroupViolation (free mem 43.6656G, free cpu 33.58705070115184)\n\tCannot move to svc-i-node-020.fj0yg7wxliw5k7y9.sandbox.nonprod-project-avengers.com:18091: ReplicaViolation (free mem 41.0809G, free cpu 32.97896175774347)\n\tCannot move to svc-i-node-019.fj0yg7wxliw5k7y9.sandbox.nonprod-project-avengers.com:18091: EquivIndexViolation (free mem 33.56G, free cpu 31.54383883791578)\n">>}}}}}.
      Rebalance Operation Id = 78b52f295098dfb231faac5f00cd7acc

      Logs are available here - 

       https://cb-engineering.s3.amazonaws.com/SysTestCapella/collectinfo-2024-01-23T054456-ns_1%40svc-d-node-013.fj0yg7wxliw5k7y9.sandbox.nonprod-project-avengers.com.zip
      https://cb-engineering.s3.amazonaws.com/SysTestCapella/collectinfo-2024-01-23T054456-ns_1%40svc-d-node-014.fj0yg7wxliw5k7y9.sandbox.nonprod-project-avengers.com.zip
      https://cb-engineering.s3.amazonaws.com/SysTestCapella/collectinfo-2024-01-23T054456-ns_1%40svc-d-node-015.fj0yg7wxliw5k7y9.sandbox.nonprod-project-avengers.com.zip
      https://cb-engineering.s3.amazonaws.com/SysTestCapella/collectinfo-2024-01-23T054456-ns_1%40svc-d-node-016.fj0yg7wxliw5k7y9.sandbox.nonprod-project-avengers.com.zip
      https://cb-engineering.s3.amazonaws.com/SysTestCapella/collectinfo-2024-01-23T054456-ns_1%40svc-i-node-018.fj0yg7wxliw5k7y9.sandbox.nonprod-project-avengers.com.zip
      https://cb-engineering.s3.amazonaws.com/SysTestCapella/collectinfo-2024-01-23T054456-ns_1%40svc-i-node-019.fj0yg7wxliw5k7y9.sandbox.nonprod-project-avengers.com.zip
      https://cb-engineering.s3.amazonaws.com/SysTestCapella/collectinfo-2024-01-23T054456-ns_1%40svc-i-node-020.fj0yg7wxliw5k7y9.sandbox.nonprod-project-avengers.com.zip
      https://cb-engineering.s3.amazonaws.com/SysTestCapella/collectinfo-2024-01-23T054456-ns_1%40svc-i-node-021.fj0yg7wxliw5k7y9.sandbox.nonprod-project-avengers.com.zip
      https://cb-engineering.s3.amazonaws.com/SysTestCapella/collectinfo-2024-01-23T054456-ns_1%40svc-i-node-023.fj0yg7wxliw5k7y9.sandbox.nonprod-project-avengers.com.zip
      https://cb-engineering.s3.amazonaws.com/SysTestCapella/collectinfo-2024-01-23T054456-ns_1%40svc-q-node-017.fj0yg7wxliw5k7y9.sandbox.nonprod-project-avengers.com.zip
      https://cb-engineering.s3.amazonaws.com/SysTestCapella/collectinfo-2024-01-23T054456-ns_1%40svc-q-node-022.fj0yg7wxliw5k7y9.sandbox.nonprod-project-avengers.com.zip
      https://cb-engineering.s3.amazonaws.com/SysTestCapella/collectinfo-2024-01-23T054456-ns_1%40svc-q-node-024.fj0yg7wxliw5k7y9.sandbox.nonprod-project-avengers.com.zip

       

      This is part of system-test run running here - 

      http://qe-jenkins1.sc.couchbase.com/job/cp-cli-gsi-system-test-2/29/console

      Cluster logs are collected via this job for every 1hr interval - 

      http://qe-jenkins1.sc.couchbase.com/job/cp-cli-gsi-system-test-log-analysis-2/4/console

       

      This rebalance is a part of scale down of indexer service configuration from 5 node to 4 nodes. Below was the configuration of Index nodes on the cluster

      {
        "size": 5, 
        "services": ["index"],
        "aws": {
          "instanceSize": "c5.9xlarge",
          "ebsSizeGib": 400
        },
        "azure": {
          "instanceSize": "Standard_F16s_v2",
          "diskSize": "P50"
        },
        "gcp": {
          "instanceSize": "n2-custom-36-73728",
          "storageSizeGib": 450
        }
      }, 

       

       

      Attachments

        Issue Links

          No reviews matched the request. Check your Options in the drop-down menu of this sections header.

          Activity

            People

              shivansh.rustagi Shivansh Rustagi
              hemant.rajput Hemant Rajput
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Gerrit Reviews

                  There are no open Gerrit changes

                  PagerDuty