Uploaded image for project: 'Couchbase Server'
  1. Couchbase Server
  2. MB-60008

Inconsistent NC behavior when CC is failed-over

    XMLWordPrintable

Details

    • Untriaged
    • 0
    • Unknown

    Description

      When the CC is failed over, the behavior varies depending on the state of the NC:

      1. for NCs which were connected to the CC at the time of failover, they will start reporting 503s for requests.
      2. While the CC is failed over, the NCs will constantly attempt to reestablish a connection to the CC in vain- or worse if the failed-over CC is network partitioned from the ns_server orchestrator, but otherwise reachable by the NC. This will cause the NC to send requests to a CC which should be failed over, leading to inconsistent results.
      3. for NCs whose java or cbas process restarts for any reason, they do not start the driver, resulting in connection refused for any requests

      It seems to be the correct behavior should be:

      1. 503 for requests when CC is failed over
        • this should be the case irrespective of the when the NC process started
      2. NCs should not keep spamming failed-over CCs in vain- they should wait for a topology update indicating a restored or replacement CC.

      Attachments

        No reviews matched the request. Check your Options in the drop-down menu of this sections header.

        Activity

          People

            michael.blow Michael Blow
            michael.blow Michael Blow
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:

              Gerrit Reviews

                There are no open Gerrit changes

                PagerDuty