The CoreWeave Inference API provides programmatic control over inference gateways, model deployments, and capacity claims. The API is available atDocumentation Index
Fetch the complete documentation index at: https://docs.coreweave.com/llms.txt
Use this file to discover all available pages before exploring further.
api.coreweave.com.
This page covers cross-cutting topics: authentication, protocols, status values, error formats, and the OpenAI-compatible inference endpoint. For per-endpoint request and response schemas, see the per-operation pages under each service in the left sidebar:
The Inference API is versioned as
v1alpha1. APIs may change before general availability.Authentication
All API requests must include a CoreWeave API access token in theAuthorization header as a Bearer token. The token must belong to a user with the Inference Viewer or Inference Admin role, depending on the operation.
Protocol support
The Inference API supports multiple protocols:| Protocol | Description |
|---|---|
| REST/JSON | Standard HTTP/1.1 with JSON request and response bodies. All examples in this documentation use REST. |
| gRPC | Protocol buffers over HTTP/2 for high-performance programmatic access. |
| Connect | gRPC-compatible protocol with improved browser and HTTP/1.1 support. |
Query parameters
List endpoints support the following query parameter:| Parameter | Type | Description |
|---|---|---|
updatedAfter | date-time | Filter resources to those updated after the specified timestamp (ISO 8601 format). |
Status values
All resources share a common set of status values:| Status | Description |
|---|---|
STATUS_CREATING | Resource is being provisioned. |
STATUS_READY | Resource is active and operational. |
STATUS_UPDATING | Resource configuration is being updated. |
STATUS_DELETING | Resource is being removed. |
STATUS_ERROR | Resource encountered a recoverable error. |
STATUS_FAILED | Resource encountered a terminal error. |
conditions array in its status with detailed information about the current state, including timestamps, reasons, and human-readable messages.