AI Inference Infrastructure
Market & spend forecasts
McKinsey · Goldman · EPRIGlobal data center capacity nearly triples to 219 gigawatts by 2030, with about 70 percent of new demand from AI workloads. Inference identified as the dominant AI workload by 2030. AI-equipped data centers projected to require $5.2 trillion in capital expenditures through 2030.
Data center power demand grows 165 percent by 2030 versus 2023. AI workload share of total data center power consumption rises from 14 percent (today) to 27 percent (2027) to 39 percent (2030). Inference becomes the main AI requirement by 2027.
U.S. data centers projected to consume 4.6 to 9.1 percent of total U.S. electricity generation annually by 2030, up from roughly 4 percent in 2023. Flat-profile load methodology (load distributed evenly across hours of the year).
Inference cost curve
Stanford HAIInference cost for a GPT-3.5-equivalent model (MMLU score 64.8) fell from $20 per million tokens in November 2022 to $0.07 per million tokens by October 2024 (Gemini-1.5-Flash-8B), a 280-fold reduction in approximately 18 months.
Grid, interconnection, and electricity demand
LBNL · IEAAs of year-end 2023: over 1,570 GW of generation and approximately 1,030 GW of storage active in U.S. interconnection queues (approximately 2,600 GW total). Median time from interconnection request to commercial operation reached five years for projects built in 2023, up from less than two years for the 2000-2007 cohort.
AI workload load curves differ structurally from traditional industrial demand; data centers and AI are a rising share of electricity demand globally.
Density & infrastructure
Uptime · ASHRAEAverage typical rack density across 2024 survey respondents was approximately 8 kW, with only about 1 percent of operators reporting racks above 100 kW. Dense racks concentrated among hyperscalers and AI-specialized facilities.
Direct-to-chip liquid cooling and immersion cooling are needed to sustain operation as rack densities climb past the 50-to-60 kW band that defines the air-cooling cliff.
Power cost & utility dataU.S. EIA
Retail commercial-industrial power rates vary widely by state and utility; tracked monthly. Used as the baseline retail tariff anchor in the worked-example unit-economic comparison.
State regulatory landscapeMultiState
Twelve U.S. states have introduced data center moratoria or restrictive AI-load bills as of early 2026. Carries forward from SAVRN piece 6 doctrine (“Data Center Moratorium: 12 States, 2026 Map, The Fix”).