Create deployment
Create deployment
Create a new CoreWeave inference model deployment.
POST
Create deployment
- The API server is
https://api.coreweave.com. - Replace
{API_ACCESS_TOKEN}with your CoreWeave API access token. - For required permissions, see IAM Access Policies.
name, gatewayIds, runtime, resources,
model, autoscaling, and traffic. The available
runtime.engine and runtime.version values are returned
by GET /v1alpha1/inference/deployments/parameters.
Example request
Authorizations
CoreWeave API access token sent as a bearer token.
Body
application/json
Request for CreateDeployment
The name of the deployment
The gateways to associate the deployment with
Runtime selection and configuration
Resource configuration for the deployment
The model configuration
The autoscaling configuration
The traffic configuration for the deployment
The unique identifier of the deployment, UUID format
Disable the deployment
Response
OK
Response for CreateDeployment
The deployment that was created
Last modified on May 22, 2026