Loading...

XML

Word

Printable

Type: Bug
Resolution: Unresolved
Priority: Major
Fix Version/s: 2.8.0
Affects Version/s: 2.7.0
Component/s: operator
Labels:
- Kubernetes
- k8s
- kubernetes
- operator
Environment:
Couchbase Version : 7.6.0-2176
Kubernetes Version : v1.30.0
CAO and operator : 2.7.0 built locally
Environment : Kind cluster

Cluster Setup

Created a cluster
On one pod, ran a bash script to kill memcached in a loop
The node fails over in the cluster and delta recovery rebalances continuously fail as expected.
Stopped the memcached kill loop
The rebalance post this fails again and again due to a problem with eventing service.

The couchbase server issues are tracked under - ~~MB-62725~~, MB-62724

The rebalance fails due to timeouts with eventing service continuously in a loop for 2+ hours
When rebalance is failing continuously with the same error, there should be a break point to stop the rebalance loop and operator should not attempt to retry rebalance again and again.

The cao tool and operator images were built locally on this commit

commit 127d1f23932294386bf0375be927758a8dee282c (HEAD -> master, origin/master, origin/HEAD)

Author: usamah jassat <usamah.jassat@couchbase.com>

Date:   Mon Jul 1 18:24:20 2024 +0100    K8S-3417: Allow rescheduling to different AZ

    Change-Id: I4194d211dabd7bb680a61930b5ac4d63ab4996f1

    Reviewed-on: https://review.couchbase.org/c/couchbase-operator/+/212115

    Reviewed-by: Justin Ashworth <justin.ashworth@couchbase.com>

    Tested-by: Build Bot <build@couchbase.com>

No reviews matched the request. Check your Options in the drop-down menu of this sections header.

There are no open Gerrit changes