← All hardware
NVIDIA GB200 NVL72 logo
hardware Data Center AI Accelerators

NVIDIA GB200 NVL72

by NVIDIA

A rack-scale exaflop AI supercomputer that acts as one giant GPU.

Pros

  • Unmatched single-domain scale for the largest LLMs
  • Mature CUDA software ecosystem with broad framework support
  • Treats 72 GPUs as one coherent accelerator, simplifying very large models

Cons

  • Roughly $2.8M-$3.4M per rack and severe supply constraints
  • Requires liquid cooling and ~120 kW+ power per rack
  • Vendor lock-in to NVIDIA networking and software

✓ Where it shines / best for

  • Trillion-parameter LLM training at hyperscale
  • Real-time inference on massive mixture-of-experts models
  • Cloud providers and large AI labs building exascale clusters

✕ Not the best fit for

  • Single-server or small-cluster deployments
  • Datacenters without liquid-cooling and high-density power
  • Cost-sensitive or experimental workloads

Features

  • ✓ 72 Blackwell GPUs + 36 Grace CPUs in a single liquid-cooled rack (36 GB200 Superchips)
  • ✓ Single 72-GPU NVLink domain via 5th-gen NVLink (130TB/s aggregate)
  • ✓ 1.44 EFLOPS sparse FP4 (720 PFLOPS dense) inference per rack
  • ✓ 13.4TB of unified HBM3e memory at 576TB/s
  • ✓ Grace-Blackwell superchip pairs 2 GPUs to 1 Grace CPU via NVLink-C2C
  • ✓ NVLink Switch System unifies all 72 GPUs as one accelerator
  • ✓ 30x faster real-time trillion-parameter LLM inference vs prior gen
  • ✓ Draws up to ~120kW per rack; requires liquid cooling

Pricing

PlanPriceBillingNotes
Full NVL72 rack$2,000,000-$3,000,000one-timeRack-scale system price via OEM/cloud quote; not publicly listed by NVIDIA
Cloud rental (per GB200 GPU)~$10.50-$27.00per hourOn-demand neocloud per-GPU rate; ~$756-$1,944/hr for a full 72-GPU rack; lower with reservations

Pricing verified from the official source. Prices change often — confirm on the vendor's site before buying.

Specifications

power~120 kW+ per rack, liquid-cooled
memory13.5 TB HBM3e unified (192 GB per GPU)
architectureBlackwell (TSMC 4NP) + Grace Arm CPU, rack-scale
interconnectQuantum-X800 InfiniBand / Spectrum-X Ethernet
gpus_per_rack72 Blackwell + 36 Grace CPUs
fp4_performance1,440 PFLOPS (1.44 EXAFLOPS) per rack
nvlink_bandwidth130 TB/s aggregate (1.8 TB/s per GPU, 5th-gen NVLink)
Sponsored

A full review is being generated for this product and will appear here shortly.

Compare with

Compare
Compare