Loading...

XML

Word

Printable

Details

Type: Bug
Resolution: Duplicate
Priority: Major
Fix Version/s: 5.0.0
Affects Version/s: 5.0.0
Component/s: couchbase-bucket
Labels:
None

Triage:
Untriaged
Is this a Regression?:
Unknown

Description

3 node Spock cluster built from master 3rd March 2017. Running a heavy pillowfight workload and rebalance to do some profiling of the threads. Noticed that rebalance completely halted, even after killing the client workload, there was no DCP traffic (many successive DCP backoffs) and no disk activity, despite all 4 writer threads being pegged at 100% CPU.
Logs captured while rebalance was running:
https://cb-engineering.s3.amazonaws.com/davidH/collectinfo-2017-03-03T162931-ns_1%40dhaikney-server-1.c.cb-googbench-101.internal.zip
https://cb-engineering.s3.amazonaws.com/davidH/collectinfo-2017-03-03T162931-ns_1%40dhaikney-server-2.c.cb-googbench-101.internal.zip
https://cb-engineering.s3.amazonaws.com/davidH/collectinfo-2017-03-03T162931-ns_1%40dhaikney-server-3.c.cb-googbench-101.internal.zip
Also attached a perf trace of a writer thread at the time.

Couple of questions:
why the (perceived) rebalance hang / why was there no forward progress?
(ii) what were the writer threads doing using 100% CPU whilst there was no write traffic)

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending
- Thumbnails
- List
- Download All

perf-report
06/Mar/17 5:36 AM
732 kB
David Haikney

Issue Links

duplicates

MB-22451 Rebalance occasionally gets stuck when adding a new node to the destination cluster

Closed

Gerrit Reviews

- Issue Only
- Show All Reviews
- Show Open Reviews
- Show All Issues
- Show Open Issues

No reviews matched the request. Check your Options in the drop-down menu of this sections header.

Activity

People

Assignee:: David Haikney (Inactive)

Reporter:: David Haikney (Inactive)

Votes:: 0 Vote for this issue

Watchers:: 5 Start watching this issue

Dates

Created:: 06/Mar/17 5:34 AM

Updated:: 17/Mar/17 8:07 PM

Resolved:: 10/Mar/17 7:59 PM

Gerrit Reviews

There are no open Gerrit changes

Show There is 1 closed Gerrit change

Hide There is 1 closed Gerrit change

MB-23163: Fix log messsage in ActiveStream: Gerrit Review:

Rebalance deadlock with busy writer threads and no traffic

Details

Description

Attachments

Attachments

Issue Links

Gerrit Reviews

Activity

People

Dates

Gerrit Reviews

PagerDuty