P0 |
Dashboard should be organized in hierarchical order:
- Universe of Couchbase - Provides a high-level snapshot of health across multiple Couchbase clusters deployed (configured)
- Cluster names and their health status
- Couchbase cluster - Provides a high-level snapshot of entire cluster-level health
- Individual node health
- Individual services health (Data, Query, Index, Search, Analytics, Eventing, XDCR, Backup)
- System health (CPU, Memory, Disk, Network)
- System Information
- Breakdown of CPU % by Couchbase Process
- Breakdown of Memory by Couchbase Server Process
- Network / Disk Utilization by Couchbase Server Process
- Couchbase services
- Data - Bucket overview/ops, the client connected, users connected, response times/mctiming, resource configuration info, [7.0 and above] scopes overview, and count of collections
- Query - Overall query performance, slowest queries, most common queries, most impactful queries, prepared statements and their performance
- Index - Overall index performance, scan times, number of requests/sec, avg. item size, item count, never been scanned, index disk size
- Search - Overall search performance, scan times, number of requests/sec, avg. item size, item count, never been scanned, index disk size
- Eventing - Overall eventing performance, processing metrics, error counts/rates
- XDCR - Aggregated metrics across all replications, source, and destination replication names (not UUID), individual replication with filter, % complete, replication state, docs replicated, replication backlog, network bandwidth used, changes left
|
Some of the system metrics breakdown of CPU % by process, network utilization will require the node-exporter to be installed to gather those metrics |