Uploaded image for project: 'Couchbase Monitoring and Observability Stack'
  1. Couchbase Monitoring and Observability Stack
  2. CMOS-216

Output heartbeat failures as Prometheus metrics

    XMLWordPrintable

Details

    • Improvement
    • Resolution: Unresolved
    • Minor
    • 1.0
    • None
    • cluster-monitor
    • None

    Description

      Currently there's no way for outside users to know that a cluster heartbeat failed, except by querying the REST API.

      We should expose heartbeat related stats to Prometheus. I'm considering:

      • Counter of failures (can alert on increase)
      • Current status of each cluster as a gauge (can show in Grafana)

      All of course appropriately labelled with the cluster.

      Attachments

        No reviews matched the request. Check your Options in the drop-down menu of this sections header.

        Activity

          People

            Unassigned Unassigned
            marks.polakovs Marks Polakovs (Inactive)
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:

              Gerrit Reviews

                There are no open Gerrit changes

                PagerDuty