Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.coreweave.com/llms.txt

Use this file to discover all available pages before exploring further.

Powered by a single NVIDIA GH200 Grace Hopper Superchip, this instance offers a unique architecture that combines a 72-core Grace CPU with a Hopper GPU. Its defining feature is the 576 GB unified memory pool (96 GB HBM3e + 480 GB LPDDR5X) connected via the high-speed NVLink-C2C interconnect. This design eliminates traditional PCIe bottlenecks and allows the GPU to access a massive memory space, making it unparalleled for running extremely large models on a single machine.

Specifications

FeatureDetail
CategoryProfessional AI & Graphics
Instance IDgd-1xgh200
GPU1x NVIDIA GH200 Grace Hopper
GPU RAM96 GB
GPU ConnectivityPCIe
CPU ModelNVIDIA Grace Arm v9 (3.10 GHz)
vCPUs72
RAM480 GB
Local Storage7.68 TB
Network SpeedDual-port 100GbE
AvailabilityRNO2A
US-EAST-04A

Primary use cases

Low-latency inference for models that are too large to fit in the memory of a standard GPU, large-scale graph analytics, and memory-intensive data science. Inference on heavily quantized versions of modern state-of-the-art models (500B - 1T parameters) like Kimi K2.6, GLM 5.1, as well as moderately quantized versions of other large models such as DeepSeek V4 Flash and MiniMax-M2.7 in the 100B-500B parameter range.
Last modified on May 12, 2026