Uploaded image for project: 'Couchbase Server'
  1. Couchbase Server
  2. MB-50442

MultiNodeFailover: Failover failed with reason 'Failed to get failover info for bucket "default"'

    XMLWordPrintable

Details

    Description

       

      Steps:

      • Five node cluster

        +----------------+-------------+-----------------+-----------+----------+-----------------------+-------------------+
        | Node           | Services    | CPU_utilization | Mem_total | Mem_free | Swap_mem_used         | Active / Replica  |
        +----------------+-------------+-----------------+-----------+----------+-----------------------+-------------------+
        | 172.23.105.212 | kv          | 0               | 0.0 Byte  | 0.0 Byte | 0.0 Byte / 0.0 Byte   | 0 / 0             |
        | 172.23.105.244 | index       | 0.426599749059  | 3.91 GiB  | 3.29 GiB | 92.50 MiB / 3.50 GiB  | 0 / 0             |
        | 172.23.105.155 | kv          | 4.7800661073    | 3.91 GiB  | 2.98 GiB | 115.79 MiB / 3.50 GiB | 0 / 0             |
        | 172.23.105.213 | index, n1ql | 1.28237364848   | 3.91 GiB  | 3.17 GiB | 64.89 MiB / 3.50 GiB  | 0 / 0             |
        | 172.23.105.211 | kv          | 4.60642875221   | 3.91 GiB  | 3.20 GiB | 146.50 MiB / 3.50 GiB | 0 / 0             |
        +----------------+-------------+-----------------+-----------+----------+-----------------------+-------------------+

      • Couchbase bucket with replicas=2

        +---------+-----------+-----------------+----------+------------+-----+-------+-----------+-----------+-----------+-----+
        | Bucket  | Type      | Storage Backend | Replicas | Durability | TTL | Items | RAM Quota | RAM Used  | Disk Used | ARR |
        +---------+-----------+-----------------+----------+------------+-----+-------+-----------+-----------+-----------+-----+
        | default | couchbase | couchstore      | 2        | none       | 0   | 3     | 5.09 GiB  | 81.97 MiB | 28.36 MiB | 100 |
        +---------+-----------+-----------------+----------+------------+-----+-------+-----------+-----------+-----------+-----+

      • Auto Failover maxCount=5 and timeout=60
      • Perform swap rebalance

        +----------------+-------------+-----------------------+----------------+--------------+
        | Nodes          | Services    | Version               | CPU            | Status       |
        +----------------+-------------+-----------------------+----------------+--------------+
        | 172.23.105.212 | kv          | 7.1.0-2073-enterprise | 13.9653414883  | Cluster node |
        | 172.23.105.244 | index       | 7.1.0-2073-enterprise | 0.853842290306 | --- OUT ---> |
        | 172.23.105.155 | kv          | 7.1.0-2073-enterprise | 6.73957273652  | --- OUT ---> |
        | 172.23.105.213 | index, n1ql | 7.1.0-2073-enterprise | 4.83666751076  | Cluster node |
        | 172.23.105.211 | kv          | 7.1.0-2073-enterprise | 5.13274336283  | Cluster node |
        | 172.23.105.245 | kv          |                       |                | <--- IN ---  |
        | 172.23.100.22  | kv          |                       |                | <--- IN ---  |
        +----------------+-------------+-----------------------+----------------+--------------+

      • Introduce failures on following nodes

        +----------------+----------+-------------+----------------+
        | Node           | Services | Node status | Failover type  |
        +----------------+----------+-------------+----------------+
        | 172.23.105.244 | index    | active      | stop_couchbase |
        | 172.23.105.211 | kv       | active      | stop_memcached |
        | 172.23.105.155 | kv       | active      | stop_memcached |
        +----------------+----------+-------------+----------------+

      Failover failed with following reason:

      Failover exited with reason {failover_failed,"default",
      "Failed to get failover info for bucket \"default\": ['ns_1@172.23.100.22']"}.
      Rebalance Operation Id = 565989613b752ac3f171b34eea50c86b

       

       

      Attachments

        For Gerrit Dashboard: MB-50442
        # Subject Branch Project Status CR V

        Activity

          People

            ashwin.govindarajulu Ashwin Govindarajulu
            ashwin.govindarajulu Ashwin Govindarajulu
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Gerrit Reviews

                There are no open Gerrit changes

                PagerDuty