Uploaded image for project: 'Couchbase Server'
  1. Couchbase Server
  2. MB-62635

Revise node lifecycle order of operations

    XMLWordPrintable

Details

    • Bug
    • Resolution: Fixed
    • Critical
    • Columnar 1.0.0
    • Columnar 1.0.0
    • analytics
    • Untriaged
    • 0
    • Unknown
    • Analytics Sprint 46

    Description

      1:

      2024-07-04T10:50:24.980+00:00 INFO CBAS.messaging.NCMessageBroker [Worker:e93b0b898d0f9bf4898662653f6ff875] Received message: StorageCleanupRequestMessage 

       

      2:

      2024-07-04T10:50:58.248+00:00 INFO CBAS.rebalance.TopologyMonitor [Executor-4286:e93b0b898d0f9bf4898662653f6ff875] got topology from cc:  ...

       

      3:

      2024-07-04T10:50:58.248+00:00 INFO CBAS.rebalance.TopologyMonitor [Topology Monitor] cleaning up storage and bootstrapping partitions 

       

      4:

      2024-07-04T10:51:01.612+00:00 INFO CBAS.messaging.NCMessageBroker [Executor-4334:e93b0b898d0f9bf4898662653f6ff875] Received message: ActiveManagerMultiMessage{kind=GENERIC_EVENT, desc=DCP state, runtimeIds=[(remote_HsRVj/default0_4SIVO)[32]:BO, (remote_HsRVj/default0_4SIVO)[33]:BO, (remote_HsRVj/default0_4SIVO)[34]:BO, (remote_HsRVj/default0_4SIVO)[35]:BO, (remote_HsRVj/default0_4SIVO)[36]:BO, (remote_HsRVj/default0_4SIVO)[37]:BO, (remote_HsRVj/default0_4SIVO)[38]:BO, (remote_HsRVj/default0_4SIVO)[39]:BO, (remote_HsRVj/default0_4SIVO)[40]:BO, (remote_HsRVj/default0_4SIVO)[41]:BO, (remote_HsRVj/default0_4SIVO)[42]:BO, (remote_HsRVj/default0_4SIVO)[43]:BO, (remote_HsRVj/default0_4SIVO)[44]:BO, (remote_HsRVj/default0_4SIVO)[45]:BO, (remote_HsRVj/default0_4SIVO)[46]:BO, (remote_HsRVj/default0_4SIVO)[47]:BO]}2024-07-04T10:51:01.722+00:00 INFO CBAS.dataflow.FeedRecordDataFlowController [SAO:JID:0.55:TAID:TID:ANID:ODID:48:0:32:0:(remote_HsRVj/default0_4SIVO)[32]:BO] controller is being set from CREATED to STARTED  

       

      5:

      2024-07-04T10:51:11.825+00:00 INFO CBAS.cloud.AbstractCloudIOManager [Topology Monitor] Cleaning node partitions...2024-07-04T10:51:11.828+00:00 INFO CBAS.util.CloudFileUtil [Topology Monitor] Cleaning partition storage/partition_2.2024-07-04T10:51:11.846+00:00 INFO CBAS.util.CloudFileUtil [Topology Monitor] Cleaning partition storage/partition_34.2024-07-04T10:51:11.863+00:00 INFO CBAS.util.CloudFileUtil [Topology Monitor] Cleaning partition storage/partition_6.2024-07-04T10:51:11.879+00:00 INFO CBAS.util.CloudFileUtil [Topology Monitor] Cleaning partition storage/partition_38.2024-07-04T10:51:11.895+00:00 INFO CBAS.util.CloudFileUtil [Topology Monitor] Cleaning partition storage/partition_10.2024-07-04T10:51:11.895+00:00 INFO CBAS.util.CloudFileUtil [Topology Monitor] Deleting /var/cb-cache/@analytics/v_iodevice_10/storage/partition_10/Default/Default/remote_HsRVj_volCollection_2_munmq/0/remote_HsRVj_volCollection_2_munmq/.metadata from the local cache as storage/partition_10/Default/Default/remote_HsRVj_volCollection_2_munmq/0/remote_HsRVj_volCollection_2_munmq/.metadata doesn't exist in the cloud 

       

      6: Not finding a checkpoint

      2024-07-04T10:51:20.824+00:00 ERRO CBAS.impls.LSMHarness [Executor-4360:e93b0b898d0f9bf4898662653f6ff875] FLUSH operation.afterFinalize failed on {"dir" : "/var/cb-cache/@analytics/v_iodevice_10/storage/partition_10/Default/Default/remote_HsRVj_volCollection_2_munmq/0/remote_HsRVj_volCollection_2_munmq", "memory" : [{"state":"READABLE_UNWRITABLE_FLUSHING", "writers":0, "readers":1, "pendingFlushes":0, "id":"[1,1]", "index":{"class": "BTree", "file": "storage/partition_10/Default/Default/remote_HsRVj_volCollection_2_munmq/0/remote_HsRVj_volCollection_2_munmq_virtual_0"}}, {"state":"READABLE_WRITABLE", "writers":0, "readers":0, "pendingFlushes":0, "id":"[2,2]", "index":{"class": "BTree", "file": "storage/partition_10/Default/Default/remote_HsRVj_volCollection_2_munmq/0/remote_HsRVj_volCollection_2_munmq_virtual_1"}}], "disk" : 0, "num-scheduled-flushes":1, "current-memory-component":1}java.lang.IllegalStateException: Couldn't find any checkpoints for resource: /var/cb-cache/@analytics/v_iodevice_10/storage/partition_10/Default/Default/remote_HsRVj_volCollection_2_munmq/0/remote_HsRVj_volCollection_2_munmq 

       

      At (4), the link is reconnected before bootstrapping the partitions (5)

      Logs from MB-62614

      Attachments

        Issue Links

          No reviews matched the request. Check your Options in the drop-down menu of this sections header.

          Activity

            People

              ali.alsuliman Ali Alsuliman
              wail.alkowaileet Wail Alkowaileet (Inactive)
              Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Gerrit Reviews

                  There are no open Gerrit changes

                  PagerDuty