Details
-
Bug
-
Resolution: Duplicate
-
Major
-
2.0.1
-
Security Level: Public
-
Windows 2008 R2 64bit
Description
Environment:
- 9 windows 2008 R2 64bit.
- Each server has 4 CPU, 8GB RAM and SSD disk
- Cluster has 2 buckets, default and sasl bucket with consistent view enable.
- Load 26 million items to default bucket and 16 million items to sasl bucket. Each key has size from 128 to 512 bytes
- Each bucket has one doc and 2 views for each doc.
- Rebalance out 2 nodes 10.3.121.173 and 10.3.121.243
- Rebalance failed when rebalance done with default bucket and move to second sasl bucket
- File bug
MB-7590
- Rebalance again. Rebalance done.
- Add node 10.3.121.243 back to cluster and rebalane. Rebalance failed again with error "Resetting rebalance status since it's not really running"
- Filed bug
MB-7595
Check diags, see a lot crash in memsup
=========================CRASH REPORT=========================
crasher:
initial call: memsup:init/1
pid: <0.23937.236>
registered_name: memsup
exception exit: {timeout,{gen_server,call,[os_mon_sysinfo,get_mem_info]}}
in function gen_server:terminate/6
ancestors: [os_mon_sup,<0.31883.45>]
messages: []
links: [<0.31884.45>]
dictionary: []
trap_exit: true
status: running
heap_size: 377
stack_size: 24
reductions: 199
neighbours:
Link to manifest file http://builds.hq.northscale.net/latestbuilds/couchbase-server-enterprise_x86_64_2.0.1-140-rel.setup.exe.manifest.xml
Link to collect_info of all nodes https://s3.amazonaws.com/packages.couchbase/collect_info/2_0_1/201301/9nodes-col-201-140-rebalance-failed-not-really-running-20130124-141814.tgz
Attachments
Issue Links
- depends on
-
MB-7658 [Windows] severe timeouts in windows cluster with 2 buckets and 1 ddoc per bucket causes rebalance and other failures
- Closed