Loading...

XML

Word

Printable

Details

Type: Bug
Resolution: Fixed
Priority: Critical
Fix Version/s: 6.6.1
Affects Version/s: 6.6.0
Component/s: couchbase-bucket
Labels:
- approved-for-6.6.1
- functional-test
Environment:
Enterprise Edition 6.0.1 build 2037 (Existing 4 node cluster)
Enterprise Edition 6.6.1 build 9182 (New node coming in)

Triage:
Triaged
Operating System:
Centos 64-bit
Link to Log File, atop/blg, CBCollectInfo, Core dump:

Hide
https://cb-jira.s3.us-east-2.amazonaws.com/logs/reb_in_failed/collectinfo-2020-11-17T063943-ns_1%40172.23.105.155.zip
https://cb-jira.s3.us-east-2.amazonaws.com/logs/reb_in_failed/collectinfo-2020-11-17T063943-ns_1%40172.23.105.211.zip
https://cb-jira.s3.us-east-2.amazonaws.com/logs/reb_in_failed/collectinfo-2020-11-17T063943-ns_1%40172.23.105.212.zip
https://cb-jira.s3.us-east-2.amazonaws.com/logs/reb_in_failed/collectinfo-2020-11-17T063943-ns_1%40172.23.105.213.zip
https://cb-jira.s3.us-east-2.amazonaws.com/logs/reb_in_failed/collectinfo-2020-11-17T063943-ns_1%40172.23.105.244.zip

Show
https://cb-jira.s3.us-east-2.amazonaws.com/logs/reb_in_failed/collectinfo-2020-11-17T063943-ns_1%40172.23.105.155.zip https://cb-jira.s3.us-east-2.amazonaws.com/logs/reb_in_failed/collectinfo-2020-11-17T063943-ns_1%40172.23.105.211.zip https://cb-jira.s3.us-east-2.amazonaws.com/logs/reb_in_failed/collectinfo-2020-11-17T063943-ns_1%40172.23.105.212.zip https://cb-jira.s3.us-east-2.amazonaws.com/logs/reb_in_failed/collectinfo-2020-11-17T063943-ns_1%40172.23.105.213.zip https://cb-jira.s3.us-east-2.amazonaws.com/logs/reb_in_failed/collectinfo-2020-11-17T063943-ns_1%40172.23.105.244.zip
Story Points:
1
Is this a Regression?:
Yes

Description

Scenario:

4 Node cluster (6.0.1 build 2037)
Create couchbase-bucket with replica=1, size=100M

Load bucket into DGM using the cbworkloadgen,

./cbworkloadgen -n 172.23.105.155:8091 -b default -u Administrator -p password --max-items=1200000 -r .95 -l

Rebalance_in 6.6.1-9182 node into the cluster (172.23.105.244)

Observation:

Rebalance failed with following reason,

Worker <0.3486.0> (for action {move,{577,

['ns_1@172.23.105.212',

'ns_1@172.23.105.155'],

['ns_1@172.23.105.155',

'ns_1@172.23.105.244'],

[]}}) exited with reason {unexpected_exit,

{'EXIT', <0.3512.0>,

{{bulk_set_vbucket_state_failed,

[{'ns_1@172.23.105.244',

{'EXIT',

{{{{{badmatch,

[{<0.3659.0>,

{done, exit, {socket_closed,

{gen_server, call, [<0.2333.0>,

{setup_streams, [83, 339]}, infinity]}},

[{gen_server, call, 3,

[{file, "gen_server.erl"}, {line, 214}]},

{dcp_replicator, '-spawn_and_wait/1-fun-0-', 1,

[{file, "src/dcp_replicator.erl"}, {line, 243}]}]}}]},

[{misc, sync_shutdown_many_i_am_trapping_exits, 1,

[{file, "src/misc.erl"}, {line, 1374}]},

{dcp_replicator, spawn_and_wait, 1,

[{file, "src/dcp_replicator.erl"}, {line, 265}]},

{dcp_replicator, handle_call, 3,

[{file, "src/dcp_replicator.erl"}, {line, 121}]},

{gen_server, try_handle_call, 4,

[{file, "gen_server.erl"}, {line, 636}]},

{gen_server, handle_msg, 6,

[{file, "gen_server.erl"}, {line, 665}]},

{proc_lib, init_p_do_apply, 3,

[{file, "proc_lib.erl"}, {line, 247}]}]},

{gen_server, call,

[<0.2332.0>, get_partitions, infinity]}},

{gen_server, call,

['dcp_replication_manager-default',

{get_replicator_pid, 83}, infinity]}},

{gen_server, call,

[{'janitor_agent-default', 'ns_1@172.23.105.244'},

{if_rebalance, <0.2246.0>,

{update_vbucket_state, 577, replica,

undefined, 'ns_1@172.23.105.212'}}, infinity]}}}}]},

[{janitor_agent, bulk_set_vbucket_state, 4,

[{file, "src/janitor_agent.erl"}, {line, 403}]},

{proc_lib, init_p,3,

[{file, "proc_lib.erl"}, {line, 232}]}]}}}

Note: Hit this while trying to validate ~~MB-41283~~

CC Richard deMellow

Attachments

Issue Links

is triggering

MB-42805 Remove or restrict the usage of CheckpointManager::updateCurrentSnapshot()

Closed

relates to

MB-41283 Crash in checkpoint code due to keyIndexes pointing to freed queued_items

Closed

Gerrit Reviews

- Issue Only
- Show All Reviews
- Show Open Reviews
- Show All Issues
- Show Open Issues

No reviews matched the request. Check your Options in the drop-down menu of this sections header.

Activity

People

Assignee:: Ashwin Govindarajulu

Reporter:: Ashwin Govindarajulu

Votes:: 0 Vote for this issue

Watchers:: 6 Start watching this issue

Dates

Due:: 19/Nov/20

Created:: 16/Nov/20 10:54 PM

Updated:: 19/Feb/21 6:56 AM

Resolved:: 19/Nov/20 7:43 AM

Gerrit Reviews

There are no open Gerrit changes

Show There are 9 closed Gerrit changes

Hide There are 9 closed Gerrit changes

MB-42780: Remove Checkpoint::getMutationId(): Gerrit Review:

MB-42780: Make replica resilient to missing MARKER_FLAG_CHK: Gerrit Review:

MB-42780: Expand tests and improve comments: Gerrit Review:

MB-42780: Logically revert MB-41283: Gerrit Review:

MB-42780: Logically revert MB-41283: Gerrit Review:

Merge remote-tracking branch 'couchbase/mad-hatter': Gerrit Review:

Merge remote-tracking branch 'couchbase/mad-hatter': Gerrit Review:

MB-42780: CM allows extending only Memory checkpoints: Gerrit Review:

MB-43299: Remove Checkpoint::getMutationId(): Gerrit Review:

[Upgrade] Rebalance_in failed with reason "bulk_set_vbucket_state_failed :: sync_shutdown_many_i_am_trapping_exits"

Details

Description

Attachments

Issue Links

Gerrit Reviews

Activity

People

Dates

Gerrit Reviews

PagerDuty