Details
-
Improvement
-
Resolution: Unresolved
-
Major
-
7.0.0, 7.0.1, 7.0.2, 7.0.3, 7.1.0
-
1
Description
To effectively implement alerting around XDCR the status of a replication needs to be known. The ns_server REST APIs return a status of "running", "paused", "error". If this value is exposed, I can create an alert on whether or not the replication has been paused as an example a customer may wish to alert on the fact a replication has been paused for more than 1 days time.
Prometheus metrics must be a number:
xdcr_status{status="running"} 1
xdcr_status{status="paused"} 0
xdcr_status{status="error"} 0
or
xdcr_status{status="running"} 0
xdcr_status{status="paused"} 1
xdcr_status{status="error"} 2
Attachments
Issue Links
Gerrit Reviews
For Gerrit Dashboard: MB-50973 | ||||||
---|---|---|---|---|---|---|
# | Subject | Branch | Project | Status | CR | V |
190807,2 | Revert "MB-50973: prometheus pipeline status to be a binary gauge" | master | goxdcr | Status: MERGED | +2 | +1 |
190990,5 | MB-50973: Add prometheus stats for pipeline running status | master | goxdcr | Status: MERGED | +2 | +1 |
191134,2 | MB-50973: prometheus pipeline status to be a binary gauge | master | goxdcr | Status: MERGED | +2 | +1 |
191169,5 | MB-50973: Introduce custom labels to prometheusExporter | master | goxdcr | Status: MERGED | +2 | +1 |