Uploaded image for project: 'Couchbase Server'
  1. Couchbase Server
  2. MB-5115

New late bucket database files deletion logic may fail to delete bucket db files after interrupted failover

    XMLWordPrintable

Details

    • Bug
    • Resolution: Fixed
    • Blocker
    • 1.8.1
    • 1.8.1-release-candidate
    • ns_server
    • Security Level: Public
    • None

    Description

      Here's what I'm found in code while trying to address MB-5062 (second instance):

      • on cluster with multiple buckets somebody starts failover
      • our failover is sequential. Imagine orchestrator crashes after failing over node for one bucket and before failing over another bucket
      • then orchestrator is restarted and user decides to add node back instead of continuing failover
      • our code that detects if database files need to be deleted, now incorrectly does not deletes any db files on that partly failed over node, because one of buckets still resides on this node. Thus first bucket for which failover was completed is reusing old data files on that node.

      Attachments

        No reviews matched the request. Check your Options in the drop-down menu of this sections header.

        Activity

          People

            alkondratenko Aleksey Kondratenko (Inactive)
            alkondratenko Aleksey Kondratenko (Inactive)
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Gerrit Reviews

                There are no open Gerrit changes

                PagerDuty