Uploaded image for project: 'Couchbase Monitoring and Observability Stack'
  1. Couchbase Monitoring and Observability Stack
  2. CMOS-465

multimanager crashes when deleting a cluster

    XMLWordPrintable

Details

    • Bug
    • Resolution: Done
    • Major
    • None
    • None
    • cluster-monitor
    • None

    Description

      deleting a cluster can cause a segfault, this seems to be because heartbeat loops are buffered into a queue and maps aren't for concurrent read/write

      [cbmultimanager.sh]panic: runtime error: invalid memory address or nil pointer dereference
      [cbmultimanager.sh][signal SIGSEGV: segmentation violation code=0x1 addr=0x20 pc=0xa7556a]
      [cbmultimanager.sh]
      [cbmultimanager.sh]goroutine 104 [running]:
      [cbmultimanager.sh]github.com/couchbaselabs/cbmultimanager/cluster-monitor/pkg/manager.(*SingleClusterManager).heartLoop(0xc000466000)
      [cbmultimanager.sh]	/tmp/cmos-build/cluster-monitor/pkg/manager/cluster_manager.go:125 +0x10a
      [cbmultimanager.sh]created by github.com/couchbaselabs/cbmultimanager/cluster-monitor/pkg/manager.(*SingleClusterManager).Start
      [cbmultimanager.sh]	/tmp/cmos-build/cluster-monitor/pkg/manager/cluster_manager.go:79 +0xd8
      [webserver.sh]2022/06/23 13:45:37 [error] 82#82: *160 upstream prematurely closed connection while reading response header from upstream, client: 127.0.0.1, server: , request: "DELETE /api/v1/clusters/cfe0e7f8058c1afd458ccdf81ffdcffd HTTP/1.1", upstream: "http://127.0.0.1:7196/api/v1/clusters/cfe0e7f8058c1afd458ccdf81ffdcffd", host: "localhost:8080", referrer: "http://localhost:8080/monitor/clusters?statusFilter=%5B%22warn%22,%22alert%22,%22info%22,%22good%22,%22missing%22%5D"
      [webserver.sh]2022/06/23 13:45:37 [warn] 82#82: *160 upstream server temporarily disabled while reading response header from upstream, client: 127.0.0.1, server: , request: "DELETE /api/v1/clusters/cfe0e7f8058c1afd458ccdf81ffdcffd HTTP/1.1", upstream: "http://127.0.0.1:7196/api/v1/clusters/cfe0e7f8058c1afd458ccdf81ffdcffd", host: "localhost:8080", referrer: "http://localhost:8080/monitor/clusters?statusFilter=%5B%22warn%22,%22alert%22,%22info%22,%22good%22,%22missing%22%5D"
      [webserver.sh]2022/06/23 13:45:37 [error] 82#82: *160 connect() failed (111: Connection refused) while connecting to upstream, client: 127.0.0.1, server: , request: "DELETE /api/v1/clusters/cfe0e7f8058c1afd458ccdf81ffdcffd HTTP/1.1", upstream: "http://[::1]:7196/api/v1/clusters/cfe0e7f8058c1afd458ccdf81ffdcffd", host: "localhost:8080", referrer: "http://localhost:8080/monitor/clusters?statusFilter=%5B%22warn%22,%22alert%22,%22info%22,%22good%22,%22missing%22%5D"
      [webserver.sh]2022/06/23 13:45:37 [warn] 82#82: *160 upstream server temporarily disabled while connecting to upstream, client: 127.0.0.1, server: , request: "DELETE /api/v1/clusters/cfe0e7f8058c1afd458ccdf81ffdcffd HTTP/1.1", upstream: "http://[::1]:7196/api/v1/clusters/cfe0e7f8058c1afd458ccdf81ffdcffd", host: "localhost:8080", referrer: "http://localhost:8080/monitor/clusters?statusFilter=%5B%22warn%22,%22alert%22,%22info%22,%22good%22,%22missing%22%5D"
      [webserver.sh]127.0.0.1 - - [23/Jun/2022:13:45:37 +0000] "DELETE /api/v1/clusters/cfe0e7f8058c1afd458ccdf81ffdcffd HTTP/1.1" 502 157 "http://localhost:8080/monitor/clusters?statusFilter=%5B%22warn%22,%22alert%22,%22info%22,%22good%22,%22missing%22%5D" "Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/605.1.15 (KHTML, like Gecko) Version/15.4 Safari/605.1.15"
      

      Attachments

        No reviews matched the request. Check your Options in the drop-down menu of this sections header.

        Activity

          People

            Alex.emery Alex Emery (Inactive)
            Alex.emery Alex Emery (Inactive)
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Gerrit Reviews

                There are no open Gerrit changes

                PagerDuty