Loading...

XML

Word

Printable

Details

Type: Bug
Resolution: Fixed
Priority: Major
Fix Version/s: 6.0.0
Affects Version/s: 5.5.0
Component/s: eventing
Labels:
- eventing-ga
Environment:
5.5.0-2497

Triage:
Untriaged
Operating System:
Centos 64-bit
Is this a Regression?:
No

Description

Script to Repro

./testrunner -i /tmp/testexec.8098.ini -p get-cbcollect-info=True,GROUP=bucket_op -t eventing.eventing_rebalance.EventingRebalance.test_memcache_crash_on_kv_and_eventing_node_during_eventing_rebalance,doc-per-day=10,dataset=default,nodes_init=5,services_init=kv-kv-eventing-eventing-index:n1ql,groups=simple,reset_services=True,GROUP=bucket_op

Steps
1) Create a 5 node cluster of kv-kv-eventing-eventing-index:n1ql
2) Deployed a eventing function.
3) Start loading docs on source bucket.
4) When 3 is in progress rebalance in an eventing node.
5) After rebalance reaches 30% or so kill memcached on 1 kv(172.23.108.91) and 1 eventing(172.23.109.137) node.

Rebalance hangs. Logs attached.

Cluster details
172.23.107.67 - kv
172.23.108.91 - kv
172.23.109.137 - eventing
172.23.109.152 - eventing
172.23.109.153 - index:n1ql
172.23.98.165 - eventing (Rebalancing in)

cbcollect_info : https://s3.amazonaws.com/bugdb/jira/memcache_crash_hang/test_24.zip

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending
- Thumbnails
- List
- Download All

172.23.107.187-20180903-0225-diag.zip
35.69 MB
04/Sep/18 1:18 AM
172.23.107.196-20180903-0227-diag.zip
18.57 MB
04/Sep/18 1:18 AM
172.23.107.201-20180903-0229-diag.zip
22.59 MB
04/Sep/18 1:18 AM
172.23.107.210-20180903-0230-diag.zip
21.74 MB
04/Sep/18 1:18 AM
172.23.107.211-20180903-0232-diag.zip
18.91 MB
04/Sep/18 1:18 AM
172.23.107.212-20180903-0234-diag.zip
15.71 MB
04/Sep/18 1:17 AM
Screen Shot 2018-08-09 at 4.32.35 PM.png
363 kB
09/Aug/18 4:02 AM

Issue Links

blocks

MB-30782 Janitor should be run when service rebalance is in progress

Reopened

relates to

MB-30908 Rebalance exited with reason {service_rebalance_failed,eventing,{rebalance_failed

Closed

Gerrit Reviews

- Issue Only
- Show All Reviews
- Show Open Reviews
- Show All Issues
- Show Open Issues

For Gerrit Dashboard: MB-29271
#	Subject	Branch	Project	Status	CR	V
97500,4	MB-29271 Bail out Eventing rebalance if it's struck for 600s	unstable	eventing	Status: MERGED	+2	+1

Activity

People

Assignee:: Abhishek Singh (Inactive)

Reporter:: Balakumaran Gopal

Votes:: 0 Vote for this issue

Watchers:: 10 Start watching this issue

Dates

Created:: 15/Apr/18 11:34 PM

Updated:: 04/Sep/18 1:26 AM

Resolved:: 04/Sep/18 1:26 AM

Gerrit Reviews

There are no open Gerrit changes

Show There is 1 closed Gerrit change

Hide There is 1 closed Gerrit change

MB-29271 Bail out Eventing rebalance if it's struck for 600s: Gerrit Review:

Eventing Rebalance in hangs when memcached is killed on kv and eventing nodes

Details

Description

Attachments

Attachments

Issue Links

Gerrit Reviews

Activity

People

Dates

Gerrit Reviews

PagerDuty