Revision Overview

Monitor overall Knative revisions

To view the dashboard, go to the Revision Overview dashboard.

Info

For accessing CoreWeave Grafana Dashboards instructions, see Access CoreWeave Grafana Dashboards.

The Knative Revision Overview dashboard provides a comprehensive look into the performance and health of a specific service revision. It visualizes key operational metrics like request volume, response times, and error rates. It also offers insights into the autoscaler's behavior.

Panel	Description
Request Volume	Show the overall volume of requests.
Success Rate	Shows the success rate of requests.
Inference Time	Shows the average inference time.
Response Time	Shows the average response time.
Pods	Shows the total number of Pods.
Pods Actively Serving (2m)	Shows the number of Pods actively serving in the last 2 minutes.
Request Volume by Revision	Shows the volume of requests, broken down by service revision.
Request Volume by Pod	Shows the volume of requests, broken down by individual Pod.
5xx Errors by Pod	Tracks the count of 5xx server errors, grouped by Pod.
Request Volume inter Pod StdDev	Shows the standard deviation of request volume between Pods.
Response Time by Revision (Successful with Queue)	Shows the response time (including time spent in queue) for successful requests, grouped by revision.
Response Time by Pod	Displays the response time, broken down by individual Pod.
Response Time by Response Code Class	Shows response times, categorized by the response code class (e.g., 2xx, 4xx, 5xx).
Response Time inter Pod StdDev	Shows the standard deviation of response times between Pods.
Observed Concurrency	Shows the observed level of concurrent requests per Pod.
Revision Pod Counts	Shows the number of Pods for each service revision over time.
Activator Request Concurrency	Shows the concurrent requests being handled by the activator.
Pod Queue Depth	Shows the number of requests waiting in the queue for each Pod.
Activator Request Count	Shows the total number of requests handled by the activator.
Total Pod Queue Depth	Shows the total number of requests waiting in queues across all Pods.
Error Code Breakdown	Shows requests by their resulting error codes.
Unique Pod count returning errors in same minute	Shows the count of unique Pods that have returned an error within the same minute.
Pods Actively Serving (2m)	Shows the number of Pods actively serving requests over a 2-minute window.