Loading...

XML

Word

Printable

Details

Type: Bug
Resolution: Fixed
Priority: Major
Fix Version/s: 7.1.0
Affects Version/s: 7.1.0
Component/s: couchbase-bucket
Labels:
- magma
- volume-test
Environment:
7.1.0-1250

Triage:
Untriaged
Link to Log File, atop/blg, CBCollectInfo, Core dump:

Hide
http://supportal.couchbase.com/snapshot/eec114851dca82f61048f7884f676c66::0

s3://cb-customers-secure/wrong_stats/2021-09-13/collectinfo-2021-09-13t061825-ns_1@172.23.106.250.zip
s3://cb-customers-secure/wrong_stats/2021-09-13/collectinfo-2021-09-13t061825-ns_1@172.23.106.251.zip
s3://cb-customers-secure/wrong_stats/2021-09-13/collectinfo-2021-09-13t061825-ns_1@172.23.121.74.zip
s3://cb-customers-secure/wrong_stats/2021-09-13/collectinfo-2021-09-13t061825-ns_1@172.23.105.175.zip
s3://cb-customers-secure/wrong_stats/2021-09-13/collectinfo-2021-09-13t061825-ns_1@172.23.106.233.zip
s3://cb-customers-secure/wrong_stats/2021-09-13/collectinfo-2021-09-13t061825-ns_1@172.23.106.236.zip
s3://cb-customers-secure/wrong_stats/2021-09-13/collectinfo-2021-09-13t061825-ns_1@172.23.106.238.zip

Show
http://supportal.couchbase.com/snapshot/eec114851dca82f61048f7884f676c66::0 s3://cb-customers-secure/wrong_stats/2021-09-13/collectinfo-2021-09-13t061825-ns_1@172.23.106.250.zip s3://cb-customers-secure/wrong_stats/2021-09-13/collectinfo-2021-09-13t061825-ns_1@172.23.106.251.zip s3://cb-customers-secure/wrong_stats/2021-09-13/collectinfo-2021-09-13t061825-ns_1@172.23.121.74.zip s3://cb-customers-secure/wrong_stats/2021-09-13/collectinfo-2021-09-13t061825-ns_1@172.23.105.175.zip s3://cb-customers-secure/wrong_stats/2021-09-13/collectinfo-2021-09-13t061825-ns_1@172.23.106.233.zip s3://cb-customers-secure/wrong_stats/2021-09-13/collectinfo-2021-09-13t061825-ns_1@172.23.106.236.zip s3://cb-customers-secure/wrong_stats/2021-09-13/collectinfo-2021-09-13t061825-ns_1@172.23.106.238.zip
Epic Link:
KV: Magma
Story Points:
1
Is this a Regression?:
Unknown

Description

Sorry for leaving it to Engg to figure out the problem. Trying to elaborate it here:

Steps:
1. Create a 4 kv node and 2 index/n1ql node cluster
2. Create magma bucket, 50 collections under default scope
3. Load 125M items and upsert them
4. Load another 125M items and upsert them as well
5. Create 50 indexes on 50 collections and build them. Start 50 QPS
6. Rebalance In 1 node with doc_ops=create:update:delete:read in parallel
7. During the rebalance it is observed that the various stats were getting empty. Observe the disk used and the items count in the below image:

Bucket Stats:

Expected= 250M, Actual 0

The cluster stats here show the mem_used = 800MB while it is supposed to me more as we have 250M items in the cluster and RAM available is ~85GB then how is it possible that mem_used to be at 800MB.

Cluster Stats:

Finally, the nodes are turning into amber randomly as shown below which is unexpected. The attached video demonstrate it better.
Servers:

QE Test

git fetch "http://review.couchbase.org/TAF" refs/changes/59/161059/9 && git checkout FETCH_HEAD

guides/gradlew --refresh-dependencies testrunner -P jython=/opt/jython/bin/jython -P 'args=-i /tmp/magma_temp_job1.ini -p bucket_storage=magma,bucket_eviction_policy=fullEviction,rerun=False -t aGoodDoctor.Hospital.Murphy.test_rebalance,nodes_init=4,graceful=True,skip_cleanup=True,num_items=2500000,num_buckets=1,bucket_names=GleamBook,doc_size=1024,bucket_type=membase,eviction_policy=fullEviction,iterations=2,batch_size=1000,sdk_timeout=60,log_level=debug,infra_log_level=debug,rerun=False,skip_cleanup=True,key_size=18,randomize_doc_size=False,randomize_value=True,assert_crashes_on_load=True,num_collections=50,maxttl=10,num_indexes=50,pc=25,index_nodes=2,cbas_nodes=0,fts_nodes=0,ops_rate=80000,ramQuota=17000,doc_ops=create:update:delete:read,rebl_ops_rate=10000,key_type=RandomKey -m rest'

Nodes are going down randomly. Check out the attached video.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending
- Thumbnails
- List
- Download All

image-2021-09-13-11-49-33-476.png
486 kB
12/Sep/21 11:19 PM
image-2021-09-13-11-50-34-028.png
502 kB
12/Sep/21 11:20 PM
image-2021-09-13-11-52-22-310.png
377 kB
12/Sep/21 11:22 PM
MB-48419.mov
33.84 MB
12/Sep/21 11:27 PM

Issue Links

relates to

MB-48533 [Magma] Memcached keeps getting disconnected while data loading magma buckets to dgm

Closed

Gerrit Reviews

- Issue Only
- Show All Reviews
- Show Open Reviews
- Show All Issues
- Show Open Issues

No reviews matched the request. Check your Options in the drop-down menu of this sections header.

Activity

People

Assignee:: Ritesh Agarwal

Reporter:: Ritesh Agarwal

Votes:: 0 Vote for this issue

Watchers:: 4 Start watching this issue

Dates

Created:: 12/Sep/21 11:22 PM

Updated:: 19/Nov/21 12:35 AM

Resolved:: 18/Oct/21 1:16 AM

Gerrit Reviews

There are no open Gerrit changes

Incorrect num_items/mem_used is shown on the dashboard during KV rebalance IN at 15% DGM

Details

Description

Attachments

Attachments

Issue Links

Gerrit Reviews

Activity

People

Dates

Gerrit Reviews

PagerDuty