Documentation Index
Fetch the complete documentation index at: https://docs.coreweave.com/llms.txt
Use this file to discover all available pages before exploring further.
Compute nodes
SUNK compute nodes are Kubernetes Pods runningslurmd. They are mapped to physical Nodes in the cluster. Jobs run within Slurm cgroups on these Nodes, which are typically large, feature high memory, fast networking, and have either GPUs or significant CPU resources. Compute nodes are the “workhorses” of the Slurm cluster. They define and manage the specific hardware used to run Slurm jobs, and handle the resources used to run jobs submitted to Slurm.
In SUNK, Slurm nodes run in Kubernetes Pods. Slurm nodes are not the same as Kubernetes Nodes, the worker machines that run Kubernetes Pods. To maintain a distinction between the two in this documentation, Kubernetes Nodes are capitalized while Slurm nodes are not.