Loading...

XML

Word

Printable

Details

Type: Bug
Resolution: Fixed
Priority: Critical
Fix Version/s: 7.0.0
Affects Version/s: Cheshire-Cat
Component/s: tools
Labels:
None

Triage:
Untriaged
Story Points:
1
Is this a Regression?:
Unknown

Description

Problem
The fix for ~~MB-45110~~ in ns_server allows nodes that already regularly failed over to be subsequently unsafely failed over. We should permit this also in the CLI. Essentially if the --force option is passed to the CLI when failover is invoked, we should permit it to be applied to nodes that are in the inactiveFailed state. This is needed so clusters don't get stuck as they can do as described in ~~MB-45110~~.

Steps to reproduce
1. Create a 5 node cluster: .137, .138, .139, .140, .142
2. Stop-server on .140 and when the node becomes unresponsive, fail it over, but don't rebalance it out yet.
3. Now stop server on .138, .139 nodes.
4. --force failover of node .138, .139 and .140

Attachments

Issue Links

relates to

MB-45110 [Chronicle] Cluster can get potentially stuck such that we may not be able to remove failed nodes out of the cluster

Closed

Activity

People

Assignee:: Carlos Gonzalez Betancort (Inactive)

Reporter:: Dave Finlay

Votes:: 0 Vote for this issue

Watchers:: 7 Start watching this issue

Dates

Created:: 04/Apr/21 8:37 PM

Updated:: 17/Jun/21 3:30 PM

Resolved:: 07/Apr/21 9:21 AM

CLI should allow failing over inactive nodes when allowUnsafe is true