Uploaded image for project: 'Couchbase Server'
  1. Couchbase Server
  2. MB-3164

Need to fence failed nodes

    XMLWordPrintable

Details

    • Improvement
    • Resolution: Incomplete
    • Major
    • feature-backlog
    • None
    • ns_server
    • Security Level: Public
    • None

    Description

      Right now, failed nodes are kept in the cluster until rebalance. Because of the "all or nothing" nature of Erlang access control, this leaves us open to all sorts of Byzantine failures, and it means we'd have to add complexity to prevent, say, the orchestrator from running on the failed node. It would be easier, simpler, and far less bug-prone to kick the node out of the cluster and implement the "add back" functionality in a different way, or completely remove the "add back" functionality.

      Attachments

        No reviews matched the request. Check your Options in the drop-down menu of this sections header.

        Activity

          People

            alkondratenko Aleksey Kondratenko (Inactive)
            sean Sean Lynch (Inactive)
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Gerrit Reviews

                There are no open Gerrit changes

                PagerDuty