About Serverless Inference

Serverless Inference lets you deploy and serve AI models without provisioning or managing the underlying infrastructure. CoreWeave automatically handles scaling, routing, and resource allocation so you can focus on your models. CoreWeave delivers Serverless Inference through W&B Inference, so the canonical setup, usage, and reference documentation lives in the W&B docs. Read about getting started with Serverless Inference.