Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.coreweave.com/llms.txt

Use this file to discover all available pages before exploring further.

Powered by eight NVIDIA H200 Hopper GPUs, each equipped with 141 GB of ultra-fast HBM3e memory, these instances are purpose-built for memory-intensive AI and HPC workloads. The significant memory per GPU allows for training larger models with bigger batch sizes and longer context windows compared to its predecessor. The GPUs are interconnected with NVIDIA NVLink for a high-speed, unified memory pool within the server. For clustering, they feature NVIDIA ConnectX-7 adapters providing a 400G NDR InfiniBand fabric, making them ideal for large, distributed training jobs that are memory-bound.

Specifications

FeatureDetail
CategoryMid-large Size Model Training & Inference
Instance IDgd-8xh200ib-i128
GPU8x NVIDIA H200
GPU RAM141 GB
GPU ConnectivityInfiniBand & NVLink
CPU ModelIntel Emerald Rapids 8562Y+ (2.80 GHz)
vCPUs128
RAM2048 GB
Local Storage61.44 TB
Network SpeedDual-port 100GbE
AvailabilityEU-SOUTH-03B
US-EAST-01A
US-EAST-02A
US-EAST-04A
US-EAST-08A
US-WEST-04A

Primary use cases

Training and fine-tuning large models (~70B parameters) with high precision, high-throughput inference with long context lengths, and memory-intensive scientific computing. Full precision versions of the latest open-source models such as DeepSeek V4 Flash, Qwen 3.5 397B-A17B, Kimi K2.6, MiniMax M2.7 and other large models requiring substantial GPU memory for a single Node.
Last modified on May 12, 2026