Details
-
Bug
-
Resolution: Fixed
-
Critical
-
7.1.0
-
Enterprise Edition 7.1.0 build 2179
-
Untriaged
-
-
1
-
Unknown
Description
Steps:
- 111 node cluster running multiple services
- Two couchbase bucket
- One KV node failed over
- Updated travel-sample bucket to replica=3
- Perform full add back of failed node
- Trigger rebalance
Observation:
Rebalance failed due to following reason,
Rebalance exited with reason {service_rebalance_failed,n1ql,
|
{agent_died,<33863.30199.52>,
|
{linked_process_died,<33863.14699.864>,
|
{'ns_1@172.23.122.178',
|
{timeout,
|
{gen_server,call,
|
[<33863.3087.53>,
|
{call,"ServiceAPI.PrepareTopologyChange",
|
#Fun<json_rpc_connection.0.86436583>,
|
#{timeout => 60000}},
|
60000]}}}}}}.
|
Rebalance Operation Id = f821e08ef755342990a10bc0f2641714
|
Following that observing the issues,
- Rebalance button is not getting enabled in the UI
- Rest endpoint reports that the cluster is balanced as per ns_server (But then the auto-failover counter is not reset)
"balanced": true,
|
Attachments
For Gerrit Dashboard: MB-50716 | ||||||
---|---|---|---|---|---|---|
# | Subject | Branch | Project | Status | CR | V |
169928,1 | MB-50716 Add diagnostics. | master | query | Status: ABANDONED | 0 | 0 |
170091,4 | MB-50716 Reuse prepared admin operations where possible. | master | query | Status: MERGED | +2 | +1 |