Uploaded image for project: 'Couchbase Server'
  1. Couchbase Server
  2. MB-30213

[Eventing on Windows] Unexpected server error seen when trying to failover a kv node

    XMLWordPrintable

Details

    • Untriaged
    • Windows 64-bit
    • Unknown

    Description

      Build : 5.5.0-2907
      Test :
      ./testrunner -i /tmp/testexec.23576.ini -p get-cbcollect-info=True,GROUP=bucket_op -t eventing.eventing_rebalance.EventingRebalance.test_kv_eventing_failover_and_kv_eventing_rebalance_simultaneously,nodes_init=6,services_init=kv-kv-eventing-eventing-eventing-index:n1ql,dataset=default,groups=simple,reset_services=True,doc-per-day=10,server_failed_over=1,services_in=kv,server_out=4,GROUP=bucket_op

      Issue :
      In the above test, after deploying function, the next step is to failover a KV node. When trying to do so, the failover API returns an Unexpected server error.

      2018-06-21 04:52:02 | INFO | MainProcess | Cluster_Thread | [task._failover_nodes] Failing over 172.23.104.133:8091 with graceful=False
      2018-06-21 04:53:05 | ERROR | MainProcess | Cluster_Thread | [rest_client._http_request] POST http://172.23.104.130:8091/controller/failOver body: otpNode=ns_1%40172.23.104.133 headers:

      {'Content-Type': 'application/x-www-form-urlencoded', 'Accept': '*/*', 'Authorization': 'Basic QWRtaW5pc3RyYXRvcjpwYXNzd29yZA==\n'}

      error: 500 reason: unknown ["Unexpected server error, request logged."] auth: Administrator:password
      2018-06-21 04:53:05 | ERROR | MainProcess | Cluster_Thread | [rest_client.fail_over] fail_over node ns_1@172.23.104.133 error : ["Unexpected server error, request logged."]

      The following error could be seen in the debug log for 172.23.104.130 node:

      [ns_server:error,2018-06-21T04:53:05.937-07:00,ns_1@172.23.104.130:<0.11781.0>:menelaus_web:loop:143]Server error during processing: ["web request failed",
                                       {path,"/controller/failOver"},
                                       {method,'POST'},
                                       {type,exit},
                                       {what,
                                        {killed,
                                         {gen_fsm,sync_send_event,
                                          [{via,leader_registry,ns_orchestrator},
                                           {failover,['ns_1@172.23.104.133'],false},
                                           infinity]}}},
                                       {trace,
                                        [{gen_fsm,sync_send_event,3,
                                          [{file,
                                            "c:/cygwin64/home/vagrant/OTP_SR~1/lib/stdlib/src/gen_fsm.erl"},
                                           {line,214}]},
                                         {menelaus_web_cluster,handle_failover,1,
                                          [{file,"src/menelaus_web_cluster.erl"},
                                           {line,646}]},
                                         {request_throttler,do_request,3,
                                          [{file,"src/request_throttler.erl"},
                                           {line,59}]},
                                         {menelaus_web,loop,2,
                                          [{file,"src/menelaus_web.erl"},
                                           {line,121}]},
                                         {mochiweb_http,headers,5,
                                          [{file,
                                            "c:/Jenkins/workspace/couchbase-server-windows/couchdb/src/mochiweb/mochiweb_http.erl"},
                                           {line,94}]},
                                         {proc_lib,init_p_do_apply,3,
                                          [{file,
                                            "c:/cygwin64/home/vagrant/OTP_SR~1/lib/stdlib/src/proc_lib.erl"},
                                           {line,239}]}]}]
      
      

      The same test passes on CentOS.

      Logs attached.

      Attachments

        No reviews matched the request. Check your Options in the drop-down menu of this sections header.

        Activity

          People

            mihir.kamdar Mihir Kamdar (Inactive)
            mihir.kamdar Mihir Kamdar (Inactive)
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Gerrit Reviews

                There are no open Gerrit changes

                PagerDuty