Loading...

XML

Word

Printable

Details

Type: Bug
Resolution: Incomplete
Priority: Major
Fix Version/s: 2.0
Affects Version/s: None
Component/s: couchbase-bucket
Security Level: Public
Labels:
None
Environment:
4 node cluster on Ubuntu

Story Points:
2

Description

-------------------------------------------------------------------------------------------------------------------------------------------------------------------------
Setup
-------------------------------------------------------------------------------------------------------------------------------------------------------------------------
Bucket size : 3GB, Nodes : 4, keys inserted [64 - 256 bytes]

-------------------------------------------------------------------------------------------------------------------------------------------------------------------------
Repro Steps
-------------------------------------------------------------------------------------------------------------------------------------------------------------------------
1. Load high volume of data on the cluster[4 ndoes] , note - Data inserted > Low water Mark threshold.
2. Remove a server
3. Rebalance

-------------------------------------------------------------------------------------------------------------------------------------------------------------------------
Error
-------------------------------------------------------------------------------------------------------------------------------------------------------------------------
cbstats returns un-expectedly low resident memory ratio and high eject node ratio.

sample :

/opt/couchbase/bin/cbstats 10.1.3.67:11210 raw memory
ep_kv_size: 1616332569
ep_max_data_size: 3250585600
ep_mem_high_wat: 2437939200
ep_mem_low_wat: 1950351360
ep_oom_errors: 0
ep_overhead: 86679858
ep_tmp_oom_errors: 0
ep_value_size: 267558416
mem_used: 2421153021
tcmalloc_current_thread_cache_bytes: 7941064
tcmalloc_max_thread_cache_bytes: 33554432
tcmalloc_unmapped_bytes: 163840
total_allocated_bytes: 1662925672
total_fragmentation_bytes: 762578072
total_free_bytes: 153034752
total_heap_bytes: 2578538496

The diags from all the nodes are attached below and the cbstats monitored during the rebalance are attached as high_load.log

May -6 : Re-ran this test with the following setup, seeing drop in resident ratio( <50%) on one node and it has a higher fragmentation on it.
Ran w/ 1024 buckets, the resident ratio after rebalancing out one node, stays fairly ok( ~80-90%) on 2 nodes but drops sharply( <50% ) on the master node(10.1.3.92).

Build. - 1.8.1-802rel

Setup - 3GB bucketsize, 4 Nodes( 10.1.3.73, 10.1.3.70, 10.1.3.71, 10.1.3.92), Num of replicas = 2
vbuckets =1024, key-size=512-1k
mcosda: pytests/performance/mcsoda.py membase://10.1.3.92:8091 vbuckets=1024 doc-gen=0 doc-cache=0 ratio-creates=1 ratio-sets=1 min-value-size=512,1024 max-items=4000000 exit-after-creates=1 prefix=test_1_

Load stopped before rebalance Out
Total Items inserted = 2.28M
Remove node and Rebalance Out a node.
Smaller drop in resident ratio.

Diags attached as 10.1.3.70-8091-diag.txt.gz 10.1.3.72-8091-diag.txt.gz 10.1.3.92-8091-diag.txt.gz

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending
- Thumbnails
- List
- Download All

10.1.3.67-8091-diag.txt.gz
6.73 MB
27/Apr/12 10:09 AM
10.1.3.69-8091-diag.txt.gz
7.24 MB
27/Apr/12 10:09 AM
10.1.3.70-8091-diag.txt.gz
13.24 MB
07/May/12 12:43 AM
10.1.3.72-8091-diag.txt.gz
5.80 MB
07/May/12 12:43 AM
10.1.3.72-8091-diag.txt.gz
2.91 MB
27/Apr/12 10:10 AM
10.1.3.73-8091-diag.txt.gz
2.74 MB
27/Apr/12 10:10 AM
10.1.3.92-8091-diag.txt.gz
16.61 MB
07/May/12 12:43 AM
data.tar
8.54 MB
26/Apr/12 6:52 PM
monitor.log
1.93 MB
26/Apr/12 6:52 PM
residentMem_02.tar
19.44 MB
27/Apr/12 10:06 AM
residentMem_03
7.10 MB
27/Apr/12 10:08 AM

Gerrit Reviews

- Issue Only
- Show All Reviews
- Show Open Reviews
- Show All Issues
- Show Open Issues

No reviews matched the request. Check your Options in the drop-down menu of this sections header.

Activity

People

Assignee:: Mike Wiederhold [X] (Inactive)

Reporter:: Ketaki Gangal (Inactive)

Votes:: 0 Vote for this issue

Watchers:: 0 Start watching this issue

Dates

Created:: 26/Apr/12 6:52 PM

Updated:: 09/Jan/13 11:14 PM

Resolved:: 23/May/12 10:38 PM

Gerrit Reviews

There are no open Gerrit changes

Show There are 3 closed Gerrit changes

Hide There are 3 closed Gerrit changes

Adding checkResidentRatio check on existing rebalance Out tests( MB-5176): Gerrit Review:

MB-5176: Modify the item eviction policy: Gerrit Review:

Revert "MB-5176: Modify the item eviction policy": Gerrit Review:

Rebalancing out a Node from a cluster causes a very high ejection to the disk.

Details

Description

Attachments

Attachments

Gerrit Reviews

Activity

People

Dates

Gerrit Reviews

PagerDuty