Details
-
Bug
-
Resolution: Fixed
-
Critical
-
6.5.1, 6.6.1, 6.6.2, 6.5.2, 6.5.0, 6.6.3
-
Untriaged
-
1
-
Unknown
Description
If a node fails to join the cluster because of connectivity problems, it is possible that it connects to the cluster later (when connectivity is fixed) and destroys the cluster's config by wining all ns_config merge conflics.
To repro:
0) To emulate connectivity problems modify the code (cb_dist:setup), so that dist connections can't be established if a specific key in ets table is set
1) Start single node cluster, create sample bucket (node1) and disable dist connections establishment using #0
2) Start another node (node2), and disable dist connections establishment for it as well (using #0)
3) Try adding node2 to node1, addition will fail with the same reason as in the logs above
4) Enable dist conneciton establishment on node2
5) In few seconds sample bucket from node1 disappears
So broken connectivity and a little bit of bad luck is all that is needed for the problem to happen
Attachments
Issue Links
- is a backport of
-
MB-46153 A node crashing during completeJoin goes into an infinite restart loop
- Closed