Pending when they request a resource the available GPU Nodes do not provide, when no Node Pool of the requested instance type has capacity, or when the Pod’s nodeSelector or tolerations do not match any Node. Run kubectl describe pod [POD-NAME] and read the events at the bottom. The scheduler logs the exact mismatch (for example, 0/3 nodes are available: 3 Insufficient nvidia.com/gpu).
Common fixes: lower the GPU resource request, add or scale a Node Pool with the right instance type, or correct the nodeSelector to match the labels actually present on your Nodes. For full details, see Manage CKS Nodes.
Workload Scheduling Server Errors