Skip to main content

Internet Transit

Monitor your Node and service-level traffic and connection statistics.

The Internet Transit dashboard provides a near-real-time view into how much traffic flows through your backbone egress/ingress points, how that traffic is distributed across internal services, and how well the network is performing from an end-user perspective.

By default, the panels refresh every minute with a 5-minute time window. Both of these can be adjusted at the top-right of the dashboard.

Customers can use this single pane of glass to:

  • Spot spikes or drops in throughput that may indicate an incident or a saturation event.
  • Correlate traffic behavior with latency (RTT) and connection counts to understand user impact.
  • Drill into per-Node or per-service views for focused troubleshooting.

Prerequisites

You must be a member of the admin, metrics, or write groups to access Grafana dashboards.

Open the dashboard

  1. Log in to the CoreWeave Cloud Console.
  2. In the left navigation, select Grafana to launch your Managed Grafana instance.
  3. Click Dashboards.
  4. Expand Network - Backbone, then choose Internet Transit.
  5. Select Data Source, Zone, Cluster Org, and Cluster at the top of the page.

If you are already logged in to CoreWeave Cloud Console, you can open the Internet Transit dashboard directly from this link.

Filters and parameters

Use these filters at the top-left of the page to choose the data you want to view:

  • Data Source: The Prometheus data source selector.
  • Zone: The Availability Zone (AZ) to view.
  • Cluster Org: The organization that owns the cluster.
  • Cluster: The specific cluster in the organization to view.

Set the time range and refresh rate parameters at the top-right of the page. The default time range is 5 minutes, and the default refresh rate is 1 minute.

Panel descriptions

The dashboard has several panels with views into the network traffic and connection statistics.

PanelDescription
Traffic - Inbound Per NodeA sortable table where each row represents a Node. The Throughput column is rendered as a horizontal gauge, making it easy to see relative utilization at a glance. Click a Node to open the Node Details dashboard for deep-dive analysis.
Traffic - Outbound Per NodeThe complement to Inbound Per Node, but for egress traffic. This panel is useful for determining which Nodes may require additional bandwidth or rate-limiting.
Traffic - Inbound TotalA single card showing the aggregate inbound rate (Gb/s).
Traffic - Outbound TotalThe complement to Inbound Total, showing the aggregate outbound rate (Gb/s).
Average RTT per ServiceA time-series graph showing the rolling 5-minute average round-trip time (RTT) for both internal and external paths, grouped by service signature. This panel helps identify latency trends and potential congestion points.
Traffic - Inbound Per ServiceA stacked-area time-series chart that breaks down inbound traffic by service and categorized by third-party cloud vendors. The legend is interactive, allowing you to toggle visibility and sort by current throughput, minimum, or maximum.
Note: New traffic is labeled (Being analyzed) until it's processed and categorized.
Traffic - Outbound Per ServiceThe complement to Inbound Per Service, but for egress traffic. This panel is useful for determining which services are generating the most outbound traffic.
Traffic - TotalA time-series graph showing the total inbound and outbound traffic. Inbound is shown in green above the X-axis; outbound is shown in yellow below. Helps visualize the net traffic balance and identify asymmetric patterns.
Connections - TotalA time-series graph showing the total number of concurrent connections across all hosts. Helps identify connection flood or drop-off events.
Connections - Per NodeA stacked-area time-series chart that breaks down the number of concurrent connections by Node. This helps to identify connection hot spots and potential load-balancing issues.
Connections - Per ServiceA stacked-area time-series chart that breaks down the number of concurrent connections by service object. This helps to identify which services are holding the most active connections.

Benefits of this dashboard

The Internet Transit dashboard fuses traffic volume, latency, and connection telemetry into one coherent layout that provides a comprehensive view of your network's health. It's particularly useful for:

  1. Spotting Anomalies in traffic patterns, such as sudden spikes or drops that may indicate an incident or saturation event.
  2. Correlating Traffic Behavior with latency (RTT) and connection counts to understand user impact.
  3. Drilling into Per-Node or Per-Service Views for focused troubleshooting, allowing you to quickly identify the source of issues.
  4. Capacity Planning by providing insights into which Nodes or services are under the most load, helping you make informed decisions about scaling and resource allocation.
  5. Monitoring Third-Party Services to understand how throughput to and from foreign clouds affect your workloads.

Together, these capabilities shorten the time required to detect network issues, guide capacity planning, and ultimately improve end-user experience.