Uploaded image for project: 'Couchbase Server'
  1. Couchbase Server
  2. MB-57358

Analytics keeps hitting OOM due to query with multiple FILTER() clauses

    XMLWordPrintable

Details

    Description

      Test steps

      1. Deploy an GCP cluster having 3 KV, GSI-Query and FTS, CBAS nodes each separately.
      2. Create a Magma bucket having single replica and 1 scope + 2 collections in addition to _default._default keyspace.
      3. Load 5M docs in each of the 2 collections.
      4. Create GSI Indexes, wait for the Indexes to come online and run queries against them.
      5. Increase disk size by 5G for all service groups.
      6. Online scaling operation goes through fine without any issues.
      7. Decrease the disk size by 5G for all the service groups. This triggers a swap rebalance for all the nodes one at a time. When it comes to cbas, swap rebalance for cbas failing:

      Rebalance exited with reason {service_rebalance_failed,cbas,
      {worker_died,
      {'EXIT',<0.15712.37>,
      {rebalance_failed,
      {service_error,
      <<"Rebalance 04e84f10c3df260ab582180f9b550230 failed: CBAS0001: Analytics collections in different partitions have different DCP states. Mutations needed to catch up = 11404155. User action: Try again later">>}}}}}.
      Rebalance Operation Id = 04d96440e4b72ca88bf6344484fb940a
      

      QE Test

      sudo guides/gradlew --refresh-dependencies testrunner -P jython=/opt/jython/bin/jython -P 'args=-i /tmp/capella.ini -p bucket_storage=magma,bucket_eviction_policy=fullEviction,rerun=False -t aGoodDoctor.hostedHospital.Murphy.test_rebalance,graceful=True,skip_cleanup=True,num_buckets=1,bucket_names=GleamBook,bucket_type=membase,eviction_policy=fullEviction,iterations=10,batch_size=1000,sdk_timeout=60,log_level=debug,infra_log_level=debug,rerun=False,skip_cleanup=True,key_size=24,randomize_doc_size=False,randomize_value=True,maxttl=10,pc=20,gsi_nodes=3,cbas_nodes=3,fts_nodes=3,kv_nodes=3,n1ql_nodes=3,mutation_perc=100,key_type=RandomKey,capella_run=true,services=data-query:index-search-analytics,max_rebl_nodes=27,kv_compute=n2-standard-4,gsi_compute=n2-standard-4,n1ql_compute=n2-standard-4,fts_compute=n2-standard-4,cbas_compute=n2-standard-4,kv_disk=500,n1ql_disk=50,gsi_disk=500,cbas_disk=500,provider=GCP,region=us-central1,type=PD-SSD,skip_teardown_cleanup=true,wait_timeout=14400,index_timeout=28800,runtype=dedicated,track_failures=True,skip_init=False,key_type=CircularKey,rebalance_type=disk,clients_per_db=1 -m rest'
      

      Attachments

        No reviews matched the request. Check your Options in the drop-down menu of this sections header.

        Activity

          People

            ali.alsuliman Ali Alsuliman
            ritesh.agarwal Ritesh Agarwal
            Votes:
            0 Vote for this issue
            Watchers:
            8 Start watching this issue

            Dates

              Created:
              Updated:

              Gerrit Reviews

                There are no open Gerrit changes

                PagerDuty