Uploaded image for project: 'Couchbase Server'
  1. Couchbase Server
  2. MB-31997

Deployment of Function stuck because of bucket ops throwing temp failures

    XMLWordPrintable

Details

    Description

      Script to Repro

      ./testrunner -i /tmp/win10-bucket-ops.ini -p get-cbcollect-info=True,skip_cleanup=True -t eventing.eventing_rqg.EventingRQG.test_random_n1ql,nodes_init=4,services_init=kv-eventing-index-n1ql,template_file=b/resources/rqg/simple_table_db/query_tests_using_templates/query_10000_fields.txt.zip,dataset=default,groups=simple,reset_services=True,skip_cleanup=True,number_of_handler=3,number_of_queries=50
      

      Logs attached. I left the setup for 12+hours, still undeployment had not completed.

      I see lot of subdoc failure messages in eventing log which could be the source of the undeployment not completing.

      2018-11-12T22:45:39.087-08:00 [Error] Consumer::periodicCheckpointCallback [worker_Function_281451933_test_random_n1ql111218031046555450_0:/tmp/127.0.0.1:8091_0_2470993291.sock:26495] Key: <ud>eventing::2470993291::Function_281451933_test_random_n1ql111218031046555450::vb::700</ud>, subdoc operation failed while performing periodic checkpoint update, err: temporary failure occurred, try again later
      2018-11-12T22:45:39.884-08:00 [Error] Consumer::periodicCheckpointCallback [worker_Function_281451933_test_random_n1ql111218031046557543_0:/tmp/127.0.0.1:8091_0_2514504035.sock:26494] Key: <ud>eventing::2514504035::Function_281451933_test_random_n1ql111218031046557543::vb::646</ud>, subdoc operation failed while performing periodic checkpoint update, err: temporary failure occurred, try again later
      2018-11-12T22:45:40.094-08:00 [Error] Consumer::periodicCheckpointCallback [worker_Function_281451933_test_random_n1ql111218031046555450_0:/tmp/127.0.0.1:8091_0_2470993291.sock:26495] Key: <ud>eventing::2470993291::Function_281451933_test_random_n1ql111218031046555450::vb::700</ud>, subdoc operation failed while performing periodic checkpoint update, err: temporary failure occurred, try again later
      2018-11-12T22:45:40.825-08:00 [Info] [gocb] Threshold Log:
      2018-11-12T22:45:40.887-08:00 [Error] Consumer::periodicCheckpointCallback [worker_Function_281451933_test_random_n1ql111218031046557543_0:/tmp/127.0.0.1:8091_0_2514504035.sock:26494] Key: <ud>eventing::2514504035::Function_281451933_test_random_n1ql111218031046557543::vb::646</ud>, subdoc operation failed while performing periodic checkpoint update, err: temporary failure occurred, try again later
      2018-11-12T22:45:40.907-08:00 [Info] [gocb] Threshold Log:
      2018-11-12T22:45:41.081-08:00 [Info] [gocb] Threshold Log:
      2018-11-12T22:45:41.102-08:00 [Error] Consumer::periodicCheckpointCallback [worker_Function_281451933_test_random_n1ql111218031046555450_0:/tmp/127.0.0.1:8091_0_2470993291.sock:26495] Key: <ud>eventing::2470993291::Function_281451933_test_random_n1ql111218031046555450::vb::700</ud>, subdoc operation failed while performing periodic checkpoint update, err: temporary failure occurred, try again later
      

      Automation Log : http://qa.sc.couchbase.com/job/temp_rebalance_even/619/consoleText

      cbcollect_info of eventing node is attached.

      Abhishek Singh was working on this. Hence assigning it to him.

      Attachments

        No reviews matched the request. Check your Options in the drop-down menu of this sections header.

        Activity

          People

            vikas.chaudhary Vikas Chaudhary
            Balakumaran.Gopal Balakumaran Gopal
            Votes:
            0 Vote for this issue
            Watchers:
            9 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Gerrit Reviews

                There are no open Gerrit changes

                PagerDuty