Skip to main content

Revision Overview

Monitor overall Knative revisions

Info

For accessing CoreWeave Grafana Dashboards instructions, see Access CoreWeave Grafana Dashboards.

The Knative Revision Overview dashboard provides a comprehensive look into the performance and health of a specific service revision. It visualizes key operational metrics like request volume, response times, and error rates. It also offers insights into the autoscaler's behavior.

PanelDescription
Request VolumeShow the overall volume of requests.
Success RateShows the success rate of requests.
Inference TimeShows the average inference time.
Response TimeShows the average response time.
PodsShows the total number of Pods.
Pods Actively Serving (2m)Shows the number of Pods actively serving in the last 2 minutes.
Request Volume by RevisionShows the volume of requests, broken down by service revision.
Request Volume by PodShows the volume of requests, broken down by individual Pod.
5xx Errors by PodTracks the count of 5xx server errors, grouped by Pod.
Request Volume inter Pod StdDevShows the standard deviation of request volume between Pods.
Response Time by Revision (Successful with Queue)Shows the response time (including time spent in queue) for successful requests, grouped by revision.
Response Time by PodDisplays the response time, broken down by individual Pod.
Response Time by Response Code ClassShows response times, categorized by the response code class (e.g., 2xx, 4xx, 5xx).
Response Time inter Pod StdDevShows the standard deviation of response times between Pods.
Observed ConcurrencyShows the observed level of concurrent requests per Pod.
Revision Pod CountsShows the number of Pods for each service revision over time.
Activator Request ConcurrencyShows the concurrent requests being handled by the activator.
Pod Queue DepthShows the number of requests waiting in the queue for each Pod.
Activator Request CountShows the total number of requests handled by the activator.
Total Pod Queue DepthShows the total number of requests waiting in queues across all Pods.
Error Code BreakdownShows requests by their resulting error codes.
Unique Pod count returning errors in same minuteShows the count of unique Pods that have returned an error within the same minute.
Pods Actively Serving (2m)Shows the number of Pods actively serving requests over a 2-minute window.