[BP 7.2.1] - XDCR - repeated spec del on non-KV node will hang because handler is not running

Description

Issue

Resolution

When a replication spec change was made to a non-Data Service node, delete replication hung and caused the node to return an incorrect replication configuration.

XDCR now checks that the node is running the Data Service and handles it correctly.

 

  1. Create a 2-node source cluster, 1-node target cluster. 1 source node is KV and 1 source node is back up (anything but KV)

  2. Create replication, delete replication, repeat 6x, which will fill up the channel.

  3. Capture go-routine of the non-KV source node, and we can see the lock contention:

Components

Fix versions

Labels

Environment

None

Link to Log File, atop/blg, CBCollectInfo, Core dump

None

Release Notes Description

None

Activity

Show:

Ayush Nayyar July 27, 2023 at 6:58 AM

Verified on 7.2.1-5861.

CB robot June 15, 2023 at 9:29 PM

Build couchbase-server-7.2.1-5796 contains goxdcr commit b340360 with commit message:
: backfillMgr to not hang on spec callback for non-KV nodes

Fixed
Pinned fields
Click on the next to a field label to start pinning.

Details

Assignee

Reporter

Is this a Regression?

Yes

Triage

Untriaged

Story Points

Priority

Instabug

Open Instabug

PagerDuty

Sentry

Zendesk Support

Created June 2, 2023 at 5:40 PM
Updated February 6, 2025 at 6:43 PM
Resolved June 15, 2023 at 9:17 PM
Instabug