Loading...

XML

Word

Printable

Details

Type: Bug
Resolution: Duplicate
Priority: Major
Fix Version/s: 5.5.0
Affects Version/s: 5.5.0
Component/s: couchbase-bucket
Labels:
- performance
Environment:
https://raw.githubusercontent.com/couchbase/perfrunner/master/clusters/titan.spec

Triage:
Untriaged
Operating System:
Centos 64-bit
Link to Log File, atop/blg, CBCollectInfo, Core dump:

Hide
https://s3-us-west-2.amazonaws.com/perf-artifacts/jenkins-titan-reb-227/172.23.96.100.zip
https://s3-us-west-2.amazonaws.com/perf-artifacts/jenkins-titan-reb-227/172.23.96.101.zip
https://s3-us-west-2.amazonaws.com/perf-artifacts/jenkins-titan-reb-227/172.23.96.102.zip
https://s3-us-west-2.amazonaws.com/perf-artifacts/jenkins-titan-reb-227/172.23.96.103.zip
https://s3-us-west-2.amazonaws.com/perf-artifacts/jenkins-titan-reb-227/172.23.96.104.zip
https://s3-us-west-2.amazonaws.com/perf-artifacts/jenkins-titan-reb-227/172.23.96.105.zip
https://s3-us-west-2.amazonaws.com/perf-artifacts/jenkins-titan-reb-227/172.23.96.106.zip
https://s3-us-west-2.amazonaws.com/perf-artifacts/jenkins-titan-reb-227/172.23.96.107.zip
https://s3-us-west-2.amazonaws.com/perf-artifacts/jenkins-titan-reb-227/172.23.96.108.zip
https://s3-us-west-2.amazonaws.com/perf-artifacts/jenkins-titan-reb-227/172.23.96.109.zip

Show
https://s3-us-west-2.amazonaws.com/perf-artifacts/jenkins-titan-reb-227/172.23.96.100.zip https://s3-us-west-2.amazonaws.com/perf-artifacts/jenkins-titan-reb-227/172.23.96.101.zip https://s3-us-west-2.amazonaws.com/perf-artifacts/jenkins-titan-reb-227/172.23.96.102.zip https://s3-us-west-2.amazonaws.com/perf-artifacts/jenkins-titan-reb-227/172.23.96.103.zip https://s3-us-west-2.amazonaws.com/perf-artifacts/jenkins-titan-reb-227/172.23.96.104.zip https://s3-us-west-2.amazonaws.com/perf-artifacts/jenkins-titan-reb-227/172.23.96.105.zip https://s3-us-west-2.amazonaws.com/perf-artifacts/jenkins-titan-reb-227/172.23.96.106.zip https://s3-us-west-2.amazonaws.com/perf-artifacts/jenkins-titan-reb-227/172.23.96.107.zip https://s3-us-west-2.amazonaws.com/perf-artifacts/jenkins-titan-reb-227/172.23.96.108.zip https://s3-us-west-2.amazonaws.com/perf-artifacts/jenkins-titan-reb-227/172.23.96.109.zip
Is this a Regression?:
Yes

Description

A separate ticket for rebalance failures observed in ~~MB-29217~~ and ~~MB-26791~~.

Test scenario:

9 nodes
1 bucket, 1 replica, full eviction
1B items (~1KB), 5-10% resident ratio
15K ops/sec (90% read, 10% update), 10% cache miss ratio (before rebalance)
Swap rebalance of one node (172.23.96.108 -> 172.23.96.109)

[user:error,2018-05-06T09:34:17.933-07:00,ns_1@172.23.96.100:<0.2270.0>:ns_orchestrator:do_log_rebalance_completion:1122]Rebalance exited with reason {child_died,bad_replicas}

[user:info,2018-05-06T09:34:17.932-07:00,ns_1@172.23.96.100:<0.241.36>:ns_rebalancer:verify_replication:985]Bad replicators after rebalance:

Missing = [{'ns_1@172.23.96.109','ns_1@172.23.96.107',1023}]

Previously:

2018-05-06T08:07:22.818569Z INFO (bucket-1) DCP (Consumer) eq_dcpq:replication:ns_1@172.23.96.109->ns_1@172.23.96.107:bucket-1 - Disconnecting because a message has not been received for 360s. lastMessageTime:361

Attachments

Issue Links

relates to

MB-29217 9 node rebalance is 5-8x slower in vulcan

Closed

MB-26791 Latency of GET operations exceeds minutes during swap rebalance (9 nodes, 1TB)

Closed

Gerrit Reviews

- Issue Only
- Show All Reviews
- Show Open Reviews
- Show All Issues
- Show Open Issues

No reviews matched the request. Check your Options in the drop-down menu of this sections header.

Activity

People

Assignee:: David Haikney (Inactive)

Reporter:: Pavel Paulau (Inactive)

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 15/May/18 11:44 AM

Updated:: 17/May/18 9:46 AM

Resolved:: 17/May/18 6:59 AM

Gerrit Reviews

There are no open Gerrit changes

Details

Description

Attachments

Issue Links

Gerrit Reviews

Activity

People

Dates

Gerrit Reviews

PagerDuty