Details
Description
Setup
1. Setup a unidirectional replication on a 2:2 node , source and destination cluster.
2. Load 2M items at source, seeing 2M items replicated to destination cluster.
3. Load 2M items on source, and Start mutating data at source.
4. Add 1 node on destination cluster, rebalance
5. Expect 4M items in total on destination.
Error
1. Seeing only 2.31M items replicated destination cluster and replication appears to have stopped.
Logs show mutliple Crash reports, stating "worked died.. timeout error". while connecting to the destination nodes.
-
- Reason for termination ==
- {worker_died,<0.14149.70>,
{http_request_failed,"POST",
"http://Administrator:*****@10.3.121.33:8092/default%2f843%3b6faefb8ab82b9b2b9f2ecb8381ce4a94/_revs_diff",
{error,Unknown macro: {error,timeout}}}}
[xdcr:info,2012-08-15T15:53:54.544,ns_1@10.3.121.38:xdc_rep_manager:xdc_rep_manager:handle_info:244]9a3d22328fbbcd0a28e91164dc000b21: replication of vbucket 843 failed due to reason: {worker_died,
<0.14149.70>,
{http_request_failed,
"POST",
"http://Administrator:*****@10.3.121.33:8092/default%2f843%3b6faefb8ab82b9b2b9f2ecb8381ce4a94/_revs_diff",
{error,
{error,
timeout}}}}
[xdcr:info,2012-08-15T15:53:54.544,ns_1@10.3.121.38:xdc_rep_manager:xdc_rep_manager:max_concurrent_reps:604]MAX_CONCURRENT_REPS_PER_DOC set to 8
[error_logger:error,2012-08-15T15:53:54.549,ns_1@10.3.121.38:error_logger:ale_error_logger_handler:log_report:72]
=========================CRASH REPORT=========================
crasher:
initial call: xdc_replicator:init/1
pid: <0.14138.70>
registered_name: []
exception exit: {worker_died,<0.14149.70>,
{http_request_failed,"POST",
"http://Administrator:*****@10.3.121.33:8092/default%2f843%3b6faefb8ab82b9b2b9f2ecb8381ce4a94/_revs_diff",
{error,{error,timeout}}}}
in function gen_server:terminate/6
ancestors: [xdc_rep_sup,ns_server_sup,ns_server_cluster_sup,<0.60.0>]
messages: [
]
links: [<0.14148.70>,<0.14151.70>,<0.14153.70>,<0.14150.70>,
<0.408.0>,<0.14146.70>]
dictionary: [{task_status_props,
[
,
,
,
,
,
,
,
,
,
,
,
,
,
,
]},
{task_status_update,1345,71234,539370},1000000]
trap_exit: true
status: running
heap_size: 4181
stack_size: 24
reductions: 58230
neighbours:
neighbour: [
,
,
{initial_call,{lhttpc_client,request,9}},
{current_function,{prim_inet,recv0,3}},
,
,
,
,
,
,
,
,
]
neighbour: [
,
Attaching logs from the nodes https://s3.amazonaws.com/bugdb/jira/rebalance_1/temp.tar
Rebalance completed successfully. 12:05:23
Started rebalance : 11:56:22