Loading...

XML

Word

Printable

Details

Type: Bug
Resolution: Fixed
Priority: Major
Fix Version/s: 5.1.0, 5.5.0
Affects Version/s: master
Component/s: cbdatasource, fts
Labels:
None

Description

Steps to repro:

Start a 3 node couchbase server cluster using Enterprise Edition 5.0.0 build 3456
- Replicas: 1
- Autofailover: enabled (max timeout: 30s)
Create cbdatasource.NewBucketDataSource() and pass in an array of urls: ["http://host1:8091", "http://host2:8091", "http://host3:8091"]
On host2, run sudo systemctl stop couchbase-server to abruptly stop the node
Add docs to couchbase server via GoCB SDK

It appears that the cbdatasource client (Sync Gateway) is not receiving DCP DataUpdate messages for docs that hash to vbuckets owned by host2, the one that's abruptly killed. The expected behavior is that the cluster map would be refreshed somehow, and it would be noticed that another node is now the active node for those vbuckets, and events for those vbuckets would be received over the DCP feed for that node.

If the cluster is rebalanced to remove host2 from the serverlist, the cbdatasource client still is not receiving the DCP DataUpdate messages.

If host2 is restarted, and then joins the cluster and a rebalance add operation is run, now the cbddatasource client receives the DCP DataUpdate messages for those vbuckets.

Adam Fraser and I spent some time digging through the cbdatasource code to try to understand the mechanism how the cluster map is reloaded in this scenario, and couldn't find it. (but maybe it's there and we just couldn't spot it). Is there anything different about the way we need to call cbdatasource in order to make this work? The Sync Gateway code that interfaces with cbdatasource is in dcp_feed.go

Related Sync Gateway issue: SG Issue 2197

go-couchbase version: 6c44a8829958bfe71283ed9fec2c28d722a3be27

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending
- Thumbnails
- List
- Download All

couchbase_ui_after_couchbase_node_shutdown.png
128 kB
01/Sep/17 4:59 PM

Gerrit Reviews

- Issue Only
- Show All Reviews
- Show Open Reviews
- Show All Issues
- Show Open Issues

No reviews matched the request. Check your Options in the drop-down menu of this sections header.

Activity

People

Assignee:: Sridevi Saragadam (Inactive)

Reporter:: Traun Leyden (Inactive)

Votes:: 0 Vote for this issue

Watchers:: 6 Start watching this issue

Dates

Created:: 01/Sep/17 4:42 PM

Updated:: 18/Jan/18 9:57 PM

Resolved:: 05/Sep/17 11:48 AM

Gerrit Reviews

There are no open Gerrit changes

Show There is 1 closed Gerrit change

Hide There is 1 closed Gerrit change

MB-25912 - cbdatasource worker that can't connect should kick: Gerrit Review:

Not receiving DCP messages for vbuckets owned by a Couchbase Server node that is abruptly stopped

Details

Description

Attachments

Attachments

Gerrit Reviews

Activity

People

Dates

Gerrit Reviews

PagerDuty