Loading...

XML

Word

Printable

Details

Type: Bug
Resolution: Fixed
Priority: Critical
Fix Version/s: 7.0.0
Affects Version/s: Cheshire-Cat
Component/s: fts
Labels:
Environment:
Enterprise Edition 7.0.0 build 5095

Triage:
Untriaged
Operating System:
Centos 64-bit
Link to Log File, atop/blg, CBCollectInfo, Core dump:

Hide
Snapshot: https://supportal.couchbase.com/snapshot/ff8cc1ad0a5b1fdc2c01fdd52cec4165::0

https://cb-engineering.s3.amazonaws.com/fts_rebalance_failed/collectinfo-2021-05-04T181638-ns_1%40172.23.105.175.zip
https://cb-engineering.s3.amazonaws.com/fts_rebalance_failed/collectinfo-2021-05-04T181638-ns_1%40172.23.106.233.zip
https://cb-engineering.s3.amazonaws.com/fts_rebalance_failed/collectinfo-2021-05-04T181638-ns_1%40172.23.106.236.zip
https://cb-engineering.s3.amazonaws.com/fts_rebalance_failed/collectinfo-2021-05-04T181638-ns_1%40172.23.106.238.zip
https://cb-engineering.s3.amazonaws.com/fts_rebalance_failed/collectinfo-2021-05-04T181638-ns_1%40172.23.106.250.zip
https://cb-engineering.s3.amazonaws.com/fts_rebalance_failed/collectinfo-2021-05-04T181638-ns_1%40172.23.106.251.zip
https://cb-engineering.s3.amazonaws.com/fts_rebalance_failed/collectinfo-2021-05-04T181638-ns_1%40172.23.107.43.zip
https://cb-engineering.s3.amazonaws.com/fts_rebalance_failed/collectinfo-2021-05-04T181638-ns_1%40172.23.107.44.zip
https://cb-engineering.s3.amazonaws.com/fts_rebalance_failed/collectinfo-2021-05-04T181638-ns_1%40172.23.107.45.zip
https://cb-engineering.s3.amazonaws.com/fts_rebalance_failed/collectinfo-2021-05-04T181638-ns_1%40172.23.107.58.zip
https://cb-engineering.s3.amazonaws.com/fts_rebalance_failed/collectinfo-2021-05-04T181638-ns_1%40172.23.121.74.zip
https://cb-engineering.s3.amazonaws.com/fts_rebalance_failed/collectinfo-2021-05-04T181638-ns_1%40172.23.121.78.zip

Show
Snapshot: https://supportal.couchbase.com/snapshot/ff8cc1ad0a5b1fdc2c01fdd52cec4165::0 https://cb-engineering.s3.amazonaws.com/fts_rebalance_failed/collectinfo-2021-05-04T181638-ns_1%40172.23.105.175.zip https://cb-engineering.s3.amazonaws.com/fts_rebalance_failed/collectinfo-2021-05-04T181638-ns_1%40172.23.106.233.zip https://cb-engineering.s3.amazonaws.com/fts_rebalance_failed/collectinfo-2021-05-04T181638-ns_1%40172.23.106.236.zip https://cb-engineering.s3.amazonaws.com/fts_rebalance_failed/collectinfo-2021-05-04T181638-ns_1%40172.23.106.238.zip https://cb-engineering.s3.amazonaws.com/fts_rebalance_failed/collectinfo-2021-05-04T181638-ns_1%40172.23.106.250.zip https://cb-engineering.s3.amazonaws.com/fts_rebalance_failed/collectinfo-2021-05-04T181638-ns_1%40172.23.106.251.zip https://cb-engineering.s3.amazonaws.com/fts_rebalance_failed/collectinfo-2021-05-04T181638-ns_1%40172.23.107.43.zip https://cb-engineering.s3.amazonaws.com/fts_rebalance_failed/collectinfo-2021-05-04T181638-ns_1%40172.23.107.44.zip https://cb-engineering.s3.amazonaws.com/fts_rebalance_failed/collectinfo-2021-05-04T181638-ns_1%40172.23.107.45.zip https://cb-engineering.s3.amazonaws.com/fts_rebalance_failed/collectinfo-2021-05-04T181638-ns_1%40172.23.107.58.zip https://cb-engineering.s3.amazonaws.com/fts_rebalance_failed/collectinfo-2021-05-04T181638-ns_1%40172.23.121.74.zip https://cb-engineering.s3.amazonaws.com/fts_rebalance_failed/collectinfo-2021-05-04T181638-ns_1%40172.23.121.78.zip
Story Points:
1
Is this a Regression?:
Unknown

Description

Build: 7.0.0-5095

Scenario:

Cluster with index,eventing,fts,backup services actively running
3 FTS indexes created (2 with indexPartition=1 and 1 index with indexPartition=6)
Rebalance in extra FTS node into the cluster along with few other nodes with other services

Observation:

Fts rebalance failed with reason fts worker died,

Rebalance exited with reason {service_rebalance_failed,fts,

{worker_died,

{'EXIT',<0.12316.37>,

{rebalance_failed,inactivity_timeout}}}}.

Rebalance Operation Id = b5ce1e972169e0377441008a937cb498

..

..

Starting rebalance, KeepNodes = ['ns_1@172.23.105.175','ns_1@172.23.106.233',

'ns_1@172.23.106.236','ns_1@172.23.106.238',

'ns_1@172.23.106.250','ns_1@172.23.106.251',

'ns_1@172.23.107.43','ns_1@172.23.107.44',

'ns_1@172.23.107.45','ns_1@172.23.107.58',

'ns_1@172.23.121.74','ns_1@172.23.121.78'], EjectNodes = [], Failed over and being ejected nodes = []; no delta recovery nodes; Operation Id = b5ce1e972169e0377441008a937cb49

Snapshot: https://supportal.couchbase.com/snapshot/ff8cc1ad0a5b1fdc2c01fdd52cec4165::0

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending
- Thumbnails
- List
- Download All

cluster_overview.png
481 kB
04/May/21 11:33 AM
fts_indexes.png
464 kB
04/May/21 11:36 AM

Issue Links

relates to

MB-44725 [System Test]service_rebalance_failed,fts: rebalance_failed,inactivity_timeout

Closed

MB-45989 FTS: rollback not happening - FTS not catching up with KV

Closed

MB-46154 FTS rebalance_out stuck for >1hr

Closed

Gerrit Reviews

- Issue Only
- Show All Reviews
- Show Open Reviews
- Show All Issues
- Show Open Issues

No reviews matched the request. Check your Options in the drop-down menu of this sections header.

Activity

People

Assignee:: Ashwin Govindarajulu

Reporter:: Ashwin Govindarajulu

Votes:: 0 Vote for this issue

Watchers:: 4 Start watching this issue

Dates

Created:: 04/May/21 11:37 AM

Updated:: 17/Jun/21 3:43 PM

Resolved:: 05/May/21 11:39 AM

Gerrit Reviews

There are no open Gerrit changes

Show There are 2 closed Gerrit changes

Hide There are 2 closed Gerrit changes

MB-46112: Log feed name when StartGocbcoreDCPFeed fails: Gerrit Review:

MB-46112: Notify manager only on agent error on feed initiation: Gerrit Review:

Rebalance failed with reason "agent died"

Details

Description

Attachments

Attachments

Issue Links

Gerrit Reviews

Activity

People

Dates

Gerrit Reviews

PagerDuty