Skip to main content
Inference on CKS gives you full control over your inference deployment stack using CoreWeave Kubernetes Service. Deploy inference runtimes, configure networking, and manage scaling directly through Kubernetes resources on CoreWeave GPU infrastructure. The following tutorials walk through deploying common inference runtimes on CKS:
Last modified on May 29, 2026