Skip to main content

WEKA

Monitor WEKA storage instances

Info

For accessing CoreWeave Grafana Dashboards instructions, see Access CoreWeave Grafana Dashboards.

The WEKA dashboard provides metrics related to WEKA storage instances. It contains the following panel groups and panels.

Cluster-wide info

PanelDescription
Alerts CounterDisplays the total count of active alerts.
Failure domainDisplays information about the failure domain configuration or status.
Protection (Stripe width + Redundancy level)Shows details about the data protection scheme, including stripe width and redundancy level.
StatusShows the overall operational status.
Protection StatusShows the current data protection status.
Protection Status Over TimeShows data protection status.
ContainersShows metrics related to running containers, such as their count or state.
Cluster CapacitiesDisplays the various capacity metrics of the cluster.
ProcessesDisplays metrics about running processes on the cluster Nodes.
Cluster drivesDisplays information or status about the drives in the cluster.
Max Cpu UtilizationTracks the maximum CPU utilization across the cluster.
IopsShows the total Input/Output Operations Per Second (IOPS) of the cluster.
ThroughputShows the total data throughput of the cluster.
Drive Read/Write Requests/SecShows th rate of read and write requests per second to the storage drives.
NIC Data rate per container typeShows the Network Interface Card (NIC) data rate, categorized by the type of container.
AVG Drive Read/Write LatencyShows the average latency for drive read and write operations.
Drive Read/Write ThroughputShows the data throughput for drive read and write operations.

Per FS info

PanelDescription
Average Read Latency by FSShows the average latency of read operations, broken down by individual filesystem (FS).
Average Write Latency by FSShows the average latency of write operations, broken down by individual filesystem (FS).
AVG Read Requests/Sec by FSShows the average number of read requests per second, with data grouped by filesystem (FS).
AVG Write Requests/Sec by FSShows the average number of write requests per second, with data grouped by filesystem (FS).
Read Throughput by FSShows the data read throughput, broken down by filesystem (FS).
Write Throughput by FSShows the data write throughput, broken down by filesystem (FS).

Top Drives info

PanelDescription
Drive Smart Media ErrorsShows the count of media-related errors (such as bad sectors) reported by the drive's S.M.A.R.T. (Self-Monitoring, Analysis, and Reporting Technology) system.
Drive Smart Critical WarningsShows any critical warnings issued by the drive's S.M.A.R.T. system, which can be indicators of impending drive failure.
Drive Smart Composite TempShows the composite temperature of the drives, a key health metric reported by S.M.A.R.T.
Failed / Inactive DrivesShows any drives that are currently in a failed state or are otherwise inactive in the system.

Top Processes info

PanelDescription
Network ReceivedDisplays the amount of incoming network traffic.
Network TransmittedDisplays the amount of outgoing network traffic.
Drive Read RequestsDisplays the number or rate of read requests being made to the storage drives.
Drive Write RequestsDisplays the number or rate of write requests being made to the storage drives.