Uploaded image for project: 'Couchbase Server'
  1. Couchbase Server
  2. MB-53900

[Backport MB-52790 to 7.1.2] - perf tests stuck due to failed cbindex

    XMLWordPrintable

Details

    • Untriaged
    • 1
    • Unknown

    Description

      While running experiments on aether cluster for MB-51755, I noticed that cbindex fails to execute build index and the test gets stuck.

      For example, in aether/2081:

      12:10:25 2022-06-28T23:40:25 [INFO] Running: /opt/couchbase/bin/cbindex -auth=Administrator:password -server 172.23.110.72:8091 -type build -indexes bucket-1:myindex
      12:10:25 
      12:10:25 [172.23.110.53] run: /opt/couchbase/bin/cbindex -auth=Administrator:password -server 172.23.110.72:8091 -type build -indexes bucket-1:myindex
      12:10:25 [172.23.110.53] out: 2022-06-28T23:40:25.726-07:00 [Error] PeerPipe.doRecieve() : ecounter error when received mesasage from Peer 172.23.110.72:9100.  Error = EOF. Kill Pipe.
      12:10:25 [172.23.110.53] out: 2022-06-28T23:40:25.727-07:00 [Error] FollowerSyncProxy.receiveAndUpdateAcceptedEpoch(): Error encountered = Server Error : SyncProxy.listen(): channel closed. Terminate
      12:10:25 [172.23.110.53] out: 2022-06-28T23:40:25.727-07:00 [Error] WatcherServer.runOnce() : Watcher fail to synchronized with peer 172.23.110.72:9100
      12:10:25 [172.23.110.53] out: Index building for: []
      12:10:26 [172.23.110.53] out:

      In indexer log at the same timestamp:

      2022-06-28T23:40:25.724-07:00 [Error] PeerListener.handleConnection error in authfn Protocol Error : IndexManager:ServerAuth: Expect message Request, Receive message FollowerInfo for conn 172.23.110.72:9100:172.23.110.53:46254

       

      aether runs that got stuck:

      1. aether/2181 - logs: http://supportal.couchbase.com/snapshot/00f716993c837f142ce7be66d795d3a6::0
        1. I was able to ssh into the machine and successfully execute the cbindex build. Logs are from before this.
      2. aether/2080 - logs: no logs
      3. aether/2179 - logs: http://supportal.couchbase.com/snapshot/8a7321512e5e867a7b0fc1499f37d232::0

       

      These experiments were with various toy builds built on top of 7.1.1-3097

      Attachments

        Issue Links

          No reviews matched the request. Check your Options in the drop-down menu of this sections header.

          Activity

            People

              pavan.pb Pavan PB
              dhruvil.ketanshah Dhruvil Shah
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Gerrit Reviews

                  There are no open Gerrit changes

                  PagerDuty