Loading...

XML

Word

Printable

Details

Type: Bug
Resolution: Duplicate
Priority: Critical
Fix Version/s: 6.0.0
Affects Version/s: 6.0.0
Component/s: eventing
Labels:
None
Environment:
6.0.0-1494

Triage:
Untriaged
Link to Log File, atop/blg, CBCollectInfo, Core dump:
https://s3.amazonaws.com/bugdb/jira/eventing_rebalance_exit/test_25.zip
Is this a Regression?:
Unknown

Description

Script to Repro

./testrunner -i /tmp/testexec.2350.ini -p get-cbcollect-info=True,GROUP=n1ql_op_without_timers -t eventing.eventing_rebalance.EventingRebalance.test_erl_crash_on_kv_and_eventing_node_during_eventing_rebalance,doc-per-day=20,dataset=default,nodes_init=5,services_init=kv-kv-eventing-eventing-index:n1ql,groups=simple,reset_services=True,handler_code=n1ql_op_without_timers,replicas=1,GROUP=n1ql_op_without_timers

I am aware there is a bug ~~MB-30897~~ in the system test. However since the following error message is a generic error for all rebalance hang, I am tracking it as a new bug. Also this failure is only seen in this scenario where retry of the failed rebalance(because of killing erlang) fails.

[2018-08-13 11:24:00,280] - [rest_client:1598] ERROR - {u'status': u'none', u'errorMessage': u'Rebalance failed. See logs for detailed reason. You can try again.'} - rebalance failed

[2018-08-13 11:24:00,308] - [rest_client:3134] INFO - Latest logs from UI on 172.23.104.109:

[2018-08-13 11:24:00,308] - [rest_client:3135] ERROR - {u'node': u'ns_1@172.23.104.109', u'code': 0, u'text': u'Rebalance exited with reason {service_rebalance_failed,eventing,\n                              {rebalance_failed,\n                               {service_error,\n                                <<"eventing rebalance hasn\'t made progress for past 600 secs">>}}}', u'shortText': u'message', u'serverTime': u'2018-08-13T11:23:51.051Z', u'module': u'ns_orchestrator', u'tstamp': 1534184631051, u'type': u'critical'}

Logs attached.

I don't this is specific to n1ql operations from eventing, However if you think it is feel free to decrease the priority.

Attachments

Issue Links

relates to

MB-29271 Eventing Rebalance in hangs when memcached is killed on kv and eventing nodes

Closed

MB-30782 Janitor should be run when service rebalance is in progress

Reopened

Gerrit Reviews

- Issue Only
- Show All Reviews
- Show Open Reviews
- Show All Issues
- Show Open Issues

No reviews matched the request. Check your Options in the drop-down menu of this sections header.

Activity

People

Assignee:: Satya Nand (Inactive)

Reporter:: Balakumaran Gopal

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Due:: 17/Aug/18

Created:: 14/Aug/18 1:43 AM

Updated:: 17/Aug/18 12:16 AM

Resolved:: 16/Aug/18 10:07 PM

Gerrit Reviews

There are no open Gerrit changes

Rebalance exited with reason {service_rebalance_failed,eventing,{rebalance_failed

Details

Description

Attachments

Issue Links

Gerrit Reviews

Activity

People

Dates

Gerrit Reviews

PagerDuty