Epoch's work is free to use, distribute, and reproduce provided the source and authors are credited under the Creative Commons BY license.
Learn more about this graph
We estimate total HBM bandwidth shipped with AI chips over time by multiplying quarterly shipments by each chip’s known memory bandwidth. When chips come in variants with different bandwidth specifications, we use the most popular variant, or take the geometric mean of all variants if we cannot determine which chip dominated sales.
The main challenge is NVIDIA’s Hopper generation, where financial reporting bundles H100 and H200 shipments even though the two have different memory specifications. We model the transition between the two using a logistic S-curve anchored to six public data points, with uncertainty propagated via Monte Carlo simulation.