GPU driver management features are now available in CKS Node Pools, allowing you to specify and target specific GPU driver versions for your workloads. This feature provides better control over driver compatibility and enables homogeneous driver environments across your clusters.Documentation Index
Fetch the complete documentation index at: https://docs.coreweave.com/llms.txt
Use this file to discover all available pages before exploring further.
Overview
GPU driver management features are now available across CoreWeave’s compute platforms, providing enhanced control over driver versions and improved workload compatibility. These features allow you to:- Target specific driver versions in both CKS Node Pools and SUNK SLURM workloads
- Ensure homogeneous driver environments across your compute resources
- Maintain compatibility with specific CUDA versions and application requirements
- Plan driver upgrades with better visibility and control
Supported driver versions
CoreWeave supports all tested release driver (TRD) versions from NVIDIA. See Supported driver versions for more information.Key benefits
| Benefit | Description |
|---|---|
| Driver compatibility | Ensure workloads run on compatible driver versions |
| Homogeneous environments | Maintain consistent driver versions across compute resources |
| Upgrade planning | Better visibility and control over driver upgrade schedules |
| Application stability | Reduce driver-related compatibility issues in production workloads |
| Resource optimization | Target workloads to Nodes with optimal driver versions for their needs |
Usage
CKS
- Node Pool level configuration: Configure Node Pools to use a specific driver version using the
gpu.versionfield in the Node Pool manifest - Pod scheduling: Schedule Pods onto Nodes specified driver versions using Node selectors and affinity rules
- Kubernetes-native integration: Use Node selectors and affinity rules to schedule Pods onto Nodes with specific driver versions
- Helm chart support: Use the CoreWeave Helm chart to deploy and manage GPU driver versions in your CKS clusters