All research sources
Sources & Citations

AI Inference Infrastructure

10 primary sources 6 categories We publish the receipts
A

Market & spend forecasts

McKinsey · Goldman · EPRI

Global data center capacity nearly triples to 219 gigawatts by 2030, with about 70 percent of new demand from AI workloads. Inference identified as the dominant AI workload by 2030. AI-equipped data centers projected to require $5.2 trillion in capital expenditures through 2030.

McKinsey & Company — “The cost of compute: A $7 trillion race to scale data centers” (2024) View source

Data center power demand grows 165 percent by 2030 versus 2023. AI workload share of total data center power consumption rises from 14 percent (today) to 27 percent (2027) to 39 percent (2030). Inference becomes the main AI requirement by 2027.

Goldman Sachs Research — “AI to drive 165% increase in data center power demand by 2030” View source

U.S. data centers projected to consume 4.6 to 9.1 percent of total U.S. electricity generation annually by 2030, up from roughly 4 percent in 2023. Flat-profile load methodology (load distributed evenly across hours of the year).

EPRI — “Powering Intelligence: Analyzing Artificial Intelligence and Data Center Energy Consumption” (2024) View source
B

Inference cost curve

Stanford HAI

Inference cost for a GPT-3.5-equivalent model (MMLU score 64.8) fell from $20 per million tokens in November 2022 to $0.07 per million tokens by October 2024 (Gemini-1.5-Flash-8B), a 280-fold reduction in approximately 18 months.

Stanford HAI — 2025 AI Index Report View source
C

Grid, interconnection, and electricity demand

LBNL · IEA

As of year-end 2023: over 1,570 GW of generation and approximately 1,030 GW of storage active in U.S. interconnection queues (approximately 2,600 GW total). Median time from interconnection request to commercial operation reached five years for projects built in 2023, up from less than two years for the 2000-2007 cohort.

Lawrence Berkeley National Laboratory — “Queued Up: 2024 Edition, Characteristics of Power Plants Seeking Transmission Interconnection As of the End of 2023” View source

AI workload load curves differ structurally from traditional industrial demand; data centers and AI are a rising share of electricity demand globally.

International Energy Agency — Electricity 2024 View source
D

Density & infrastructure

Uptime · ASHRAE

Average typical rack density across 2024 survey respondents was approximately 8 kW, with only about 1 percent of operators reporting racks above 100 kW. Dense racks concentrated among hyperscalers and AI-specialized facilities.

Uptime Institute — Global Data Center Survey 2024 View source

Direct-to-chip liquid cooling and immersion cooling are needed to sustain operation as rack densities climb past the 50-to-60 kW band that defines the air-cooling cliff.

ASHRAE — Technical Committee 9.9 (Mission Critical Facilities, Data Centers, Technology Spaces) thermal guidance View source
E

Power cost & utility dataU.S. EIA

Retail commercial-industrial power rates vary widely by state and utility; tracked monthly. Used as the baseline retail tariff anchor in the worked-example unit-economic comparison.

U.S. Energy Information Administration — Electric Power Monthly View source
F

State regulatory landscapeMultiState

Twelve U.S. states have introduced data center moratoria or restrictive AI-load bills as of early 2026. Carries forward from SAVRN piece 6 doctrine (“Data Center Moratorium: 12 States, 2026 Map, The Fix”).

MultiState Associates — state legislative tracker (AI / data center matrix) View source
These are the primary sources behind our research. Have a better one, or spot an error? Tell us — we’ll correct it.