Total AI chip memory bandwidth has grown 4.1x per year, now reaching 70 million terabytes per second

Epoch's work is free to use, distribute, and reproduce provided the source and authors are credited under the Creative Commons BY license.

Learn more about this graph

We estimate total HBM bandwidth shipped with AI chips over time by multiplying quarterly shipments by each chip’s known memory bandwidth. When chips come in variants with different bandwidth specifications, we use the most popular variant, or take the geometric mean of all variants if we cannot determine which chip dominated sales.

The main challenge is NVIDIA’s Hopper generation, where financial reporting bundles H100 and H200 shipments even though the two have different memory specifications. We model the transition between the two using a logistic S-curve anchored to six public data points, with uncertainty propagated via Monte Carlo simulation.

Data

Analysis

For each quarter, we sample the number of chips shipped from a log-normal distribution fitted to the datahub’s median confidence interval, then multiply by each chip’s HBM bandwidth. For Hopper chips specifically, we need to estimate what fraction were H200s.

We model the H200 share as a logistic S-curve — a standard shape for production ramp-ups — defined by three quantities: when the crossover occurred (H200s comprised ~50% of Hopper shipments), the rate of the transition from 20% to 80%, and where the share leveled off. Rather than picking fixed values, we sample these from distributions and then reweight each sample by how well it matches six data points:

Cumulative H200 production ≈ 2 million units. The January 2026 export rule caps China H200 exports at 50% of cumulative US sales. Bloomberg reported that China shipments could reach up to 1 million units, implying a total of ~2M. Reuters reported ~700K H200s in inventory at the start of 2026, consistent with ~1.3M shipped + 700K unsold.
The share of H200s in Q2 2024 was very small (~5%). Jensen Huang hand delivered the first H200 chip to OpenAI in March 2024, which “entered mass production in late Q2” according to TrendForce reporting. This suggests H200 shipments in Q2 were minimal.
The share of H200s in Q3 2024 was modest (~25%). TrendForce reported that H200 deliveries began, but that orders were “largely focused on the H100 in the HGX architecture, with the H200’s share remaining limited.”
H200s comprised the majority of shipped chips by Q4 2024 (~55%). In September 2024, TrendForce reported the H200 was expected to become NVIDIA’s primary shipment driver, replacing the H100. With Blackwell shipments delayed, H200 became the bridge product, and major cloud providers launched H200 instances starting in September 2024.
By Q1 2025, nearly all Hopper shipments were H200s (~85%). The September 2024 TrendForce report states that “Starting in Q3 […] once customers complete their existing orders, the H100 will naturally phase out, and the H200 will take over as the main product supplied to the market”. We assume that this occurred by Q1 2025.
~4 million total non-China Hoppers through October 2025. Jensen Huang disclosed that Hopper chips had sold a total of 4 million units through October 2025, serving as a consistency check on our shipment data.

Samples whose implied H200 shares are more consistent with these data points receive more weight in the final estimates. We report weighted medians and 90% confidence intervals for all quantities.

Limitations

Explore this data

AI Chip Sales

AI chip sales data.

Research & Commentary

More

Datasets

Benchmarking Data

By Epoch AI

Total AI chip memory bandwidth has grown 4.1x per year, now reaching 70 million terabytes per second

Learn more about this graph

Data

Analysis

Limitations

Explore this data

Total AI chip memory bandwidth has grown 4.1x per year, now reaching 70 million terabytes per second

Research & Commentary

More

Datasets

Benchmarking Data

By Epoch AI

AI Trends & Statistics

Papers & Reports

Newsletter: Gradient Updates

Data Insights

Podcast: Epoch After Hours

Models

Frontier Data Centers

Hardware

Companies

Chip Sales

Polling on Usage

AI Capabilities

FrontierMath

Learn more about this graph

Data

Analysis

Limitations

Explore this data

Related topics