Workload Scheduling - CoreWeave Docs

How do CPU and memory requests work with GPU Pods?

GPU Pods request GPUs through the nvidia.com/gpu resource, but their CPU and memory requests behave like any other Kuber …

Use a nodeSelector (or nodeAffinity) on your Pod to target a specific GPU type. CoreWeave labels GPU Nodes with hardware …

Install Kueue on your CKS cluster from the CoreWeave-published Helm chart (coreweave/cks-kueue), then define ResourceFla …

Kubernetes assigns each Pod one of three Quality-of-Service classes based on its resource requests and limits, and the k …

For most GPU training and inference workloads, set CPU and memory requests to roughly the per-GPU share of the instance, …

The CKS autoscaler scales a Node Pool up only when adding another Node of that pool’s instance type would make a pending …

GPU Pods stay Pending when they request a resource the available GPU Nodes do not provide, when no Node Pool of the requ …

The Kubernetes Cluster Autoscaler decides that a Node is unneeded using its own utilization-based logic. It does not con …

A Pod stays Pending when the Kubernetes scheduler cannot place it on any Node. The Pod’s FailedScheduling event names th …

Last modified on July 28, 2026