Uploaded image for project: 'Couchbase Server'
  1. Couchbase Server
  2. MB-30750

Cluster getting into un-autofailoverable state because it's stuck in warmup

    XMLWordPrintable

Details

    • Bug
    • Resolution: Won't Fix
    • Major
    • 6.5.0
    • 5.5.0
    • ns_server
    • Untriaged
    • Unknown

    Description

      Linked to https://issues.couchbase.com/browse/K8S-497

      The linked issue has logs and stuff attached.

      QE's test basically starts with a 1 node cluster and an empty bucket with 1 replica.  It then adds 2 new nodes and kills one as soon as we communicate a rebalance has started.  This is inherently racy in that sometimes the node is reported as failed-add (which is tested for), much rarer sometimes goes down then fails over and sometimes refuses to auto fail over and needs manual intervention.

      It is this last case that we're interested in specifically as it requires user intervention.  In general QE will need to learn to handle non-deterministic behaviour in their test cases.

      Attachments

        Issue Links

          No reviews matched the request. Check your Options in the drop-down menu of this sections header.

          Activity

            People

              mikew Mike Wiederhold [X] (Inactive)
              simon.murray Simon Murray
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Gerrit Reviews

                  There are no open Gerrit changes

                  PagerDuty