These instances feature eight NVIDIA L40S GPUs, each with 48 GB of GDDR6 memory. The L40S is a compute-optimized version of the L40, specifically tuned to deliver higher performance for AI and data science workloads. Connected via PCIe, these GPUs are a cost-effective choice for scaling out inference and for fine-tuning mainstream models. They provide a strong balance of performance and value for enterprise AI deployment.
Specifications
| Feature | Detail |
|---|
| Category | Professional AI & Graphics |
| Instance ID | gd-8xl40s-i128 |
| GPU | 8x NVIDIA L40S |
| GPU RAM | 48 GB |
| GPU Connectivity | PCIe |
| CPU Model | Intel Sapphire Rapids 8462Y+ (2.80 GHz) |
| vCPUs | 128 |
| RAM | 1024 GB |
| Local Storage | 7.68 TB |
| Network Speed | Dual-port 100GbE |
| Availability | US-EAST-04A |
Primary use cases
High-throughput inference, efficient fine-tuning, video analytics, and other compute-focused AI tasks.
Recommended models
Inference for models up to ~40B parameters like Gemma 4 26B or Qwen 3.6 27B; efficient fine-tuning of smaller sub-15B models. Last modified on May 12, 2026