Loading...

XML

Word

Printable

Details

Type: Bug
Resolution: Fixed
Priority: Critical
Fix Version/s: 6.6.6
Affects Version/s: 6.6.5, Cheshire-Cat
Component/s: eventing
Labels:
- approved-for-6.6.6
Environment:
Kubernetes 1.19, Operator 2.1

Triage:
Untriaged
Story Points:
1
Is this a Regression?:
Unknown

Description

What the test does

Spins up a 3 node cluster, kills a pod, waits for recovery. Does this N times.

What happened

The first pod is killed, the operator sees it go down, failover and we scale back up to 3 nodes. Same for the second instance. On the third attempt, the rebalance of the new node fails, and continues to do so until the end of time. The nature of the failure is the cluster continuing to report an unbalanced status.

Expectation

When things report as balanced at the very least, it's safe to go around killing stuff and the cluster should be recoverable. This is a deadlock situation for the Operator and Couchbase Cloud.

Attachments

Issue Links

duplicates

MB-51563 Rebalance stuck for more than ~20 minutes with eventing service reporting: "Failed to start dcp feed".

Resolved

is a backport of

MB-42968 Eventing Enabled Cluster Fails to Recover

Closed

Gerrit Reviews

- Issue Only
- Show All Reviews
- Show Open Reviews
- Show All Issues
- Show Open Issues

No reviews matched the request. Check your Options in the drop-down menu of this sections header.

Activity

People

Assignee:: Sujay Gad

Reporter:: Jeelan Poola

Votes:: 1 Vote for this issue

Watchers:: 4 Start watching this issue

Dates

Created:: 14/Mar/22 9:58 AM

Updated:: 10/Jan/23 4:44 AM

Resolved:: 04/Dec/22 9:35 PM

Gerrit Reviews

There are no open Gerrit changes

Show There is 1 closed Gerrit change

Hide There is 1 closed Gerrit change

MB-51435: Restrict DCP feed name length to 200 chars: Gerrit Review:

[BP 6.6.6 MB-42968] - Eventing Enabled Cluster Fails to Recover - long DCP connection names

Details

Description

Attachments

Issue Links

Gerrit Reviews

Activity

People

Dates

Gerrit Reviews

PagerDuty