Building & Construction Data Center Readiness

Methodology

How the Data Center Readiness Index ranks all 50 US states across nine dimensions of the operating environment for hyperscale data center deployment: power, grid reliability, interconnection, natural-disaster risk, water, workforce, economy, permitting, and community sentiment.

Summary

A composite ranking of all 50 US states on nine dimensions of the operating environment for multi-hundred-megawatt campus builds, the scale typical of hyperscalers, AI training clusters, and large colocation operators. The Index measures friction in the raw environment, not existing market footprint.

Overview

How it works

The Data Center Readiness Index ranks all 50 US states on their suitability for large-scale data center deployment. It aggregates publicly available federal and third-party data across nine dimensions to give hyperscaler and large-colocation site-selection teams, infrastructure investors and lenders underwriting multi-billion-dollar campus builds, and policy analysts and journalists a defensible, reproducible basis for comparing states.

The dimensions, indicators, and weights are tuned to multi-hundred-megawatt campus-scale projects, but the Index is directionally useful for any party evaluating where to place a large industrial load.

The Data Center Readiness Index ranks states on dimensional readiness, not on existing market presence or engineered-solution offsets. A state’s rank reflects raw operating-environment quality, not its current campus footprint. Three of the most active US data-center markets (California, New Jersey, and Nevada) rank in the lower half of the composite. They are not low-ranked because the Index is wrong about them. They are low-ranked because the underlying operating environment in each is materially harder than in the high-ranked states, and operators who build there do so by buying their way around those constraints: long-term water contracts, custom interconnection deals, dry-cooled or closed-loop facility designs, premium real estate, premium labor.

The most useful reading of the Data Center Readiness Index is as a complement to existing-market data: a high-ranked state is one where a new build faces less friction; an existing major market that ranks low is a place where capital has accumulated despite friction, typically through engineered solutions and incumbency effects that a new entrant would not start with.

Indicators

Energy Supply

Heaviest weight

Installed capacity (MW), operable and proposed generators, the long-range capacity pipeline, and industrial retail electricity prices. A hyperscale facility can draw 100–500 MW continuously (the power consumption of a mid-sized city), so capacity headroom and competitive industrial pricing are the foundational dimension. Sources: EPA eGRID, EIA-860, NREL ReEDS (Mid-Case scenario), EIA Retail Electricity Sales API.

Grid Reliability

Standard weight

Day-to-day outage frequency (SAIFI) and duration (SAIDI, CAIDI), reported both with and without Major Event Days so day-to-day reliability is separable from extreme-weather performance, plus property-damage-weighted historical storm exposure. Even with on-site backup, outages drive diesel consumption, generator wear, and SLA risk. Sources: EIA-861, NOAA Storm Events Database.

Interconnection Speed

Standard weight

Queue depth as a share of installed capacity, the withdrawal rate (what fraction of queued projects give up before completing interconnection), and active project count. Interconnection is routinely a 3–5 year lead-time item; states with deep queues and high withdrawal rates are effectively closed to fast deployment. Source: LBNL Queued Up.

Natural Disaster Risk

Standard weight

Multi-peril exposure (storms, floods, wildfire, seismic, drought, hail, heat, and other hazards) from FEMA’s National Risk Index: county-level scores rolled up to the state level via population weighting and reported as expected annual loss per capita. Source: FEMA National Risk Index.

Water Supply

Lighter weighting

Overall availability of water for data-center cooling operations at hyperscale: whether a state’s water environment can reliably support cooling at scale over a multi-year horizon. The supply-side measurement integrates chronic drought exposure (the share of state area in D2+ drought on a two-year rolling average, capturing both how often a state is in drought and how severe those droughts are) alongside the broader water-supply picture relevant to large industrial withdrawals. Actual water consumption is highly facility-dependent; some operators rely on closed-loop immersion cooling or fully recycled water, while others have heavy potable-water draws. This indicator evaluates water supply at the state level only, not facility-level water demand. Source: U.S. Drought Monitor.

Workforce & Talent

Standard weight

Employment counts and wage rates in occupations data centers directly depend on (electrical and electronics engineering, computer hardware and network engineering, electrical installation and maintenance), identified by SOC codes. State-level workforce depth (employment) is paired with median wages as a supply/demand competitiveness proxy. Source: BLS Occupational Employment and Wage Statistics.

Economic Climate

Standard weight

Combined state and local sales tax rate (affecting large equipment and construction purchases), electricity market structure (deregulated / partially deregulated / regulated, a procurement-flexibility signal), and announced data-center investment momentum from major hyperscalers and operators. Sources: Tax Foundation, EIA/NARUC, curated investment data.

Permitting Friction

Standard weight

Regulatory burden, processing speed, and inspection complexity for the permits required to build and interconnect a large facility: the time and uncertainty between site control and groundbreaking. Source: Red Tape Index permitting data.

Sentiment

Standard weight

State-level community license to operate a hyperscale data center, captured in two stages of opposition. Pre-permit grassroots: opposition actions per existing-or-announced facility. Legislative: an ordinal moratorium-status score capturing local moratoriums, pending state bills, and statewide moratoriums in effect. Sources: public data-center opposition tracking, triangulated across multiple public legislative trackers.

Detail

How we score states

Every state receives two composite scores per release. The published 1–50 ranking is produced by a non-compensatory geometric-mean composite; a supplementary arithmetic-mean composite is computed from the same inputs as a robustness check on the headline. Both follow standard practice in the OECD Handbook on Constructing Composite Indicators.

Each indicator is winsorized at the 5th and 95th percentiles across the 51 jurisdictions before any normalization. This caps extreme values (a single hurricane year, for example, would otherwise dominate the outage-duration scale) without re-ranking the underlying data. Indicators where lower is better (electricity price, outage minutes, hazard score, opposition actions) are inverted so that 100 always means best in class.

The headline ranking uses a geometric-mean composite. Each winsorized indicator is normalized to a 0–100 min-max scale; indicators are combined within each dimension by weighted geometric mean, and dimensions are then combined by weighted geometric mean across the composite. The result is sorted to produce the 1–50 ranking.

The geometric mean is non-compensatory by design: a state cannot trade catastrophic weakness on any single dimension off against strength on another. Weakness in any one dimension is penalised multiplicatively rather than averaged out. Hyperscalers require minimum acceptable thresholds across all criteria simultaneously, not merely a high average; the headline ranking is built to reflect that.

A supplementary composite is computed from the same winsorized inputs using percentile-rank normalization and weighted arithmetic mean. The Spearman rank correlation between the geometric-mean headline and the arithmetic-mean supplementary composite is reported with every release as a methodological witness to the structural agreement between the two aggregations. Disagreement between them is concentrated in states with mixed performance across dimensions; uniformly strong or uniformly weak states rank similarly under both methods. The divergence is informational, reflecting different commitments about whether averaging out weakness is acceptable.

Energy Supply is the most heavily weighted dimension; the other weights reflect operator priorities surfaced through expert interviews and review. The exact dimension weights are part of our proprietary methodology and are not published.

Detail

Tier classification

States are assigned to one of four tiers, recalculated on each release: Top Quartile covers ranks 1–13 (strongest overall environment for data center deployment); Above Average covers ranks 14–26 (solid fundamentals with one or two notable gaps); Below Average covers ranks 27–38 (meaningful weaknesses that warrant attention); Bottom Quartile covers ranks 39–50 (significant structural barriers).

A separate Insufficient Data designation applies to jurisdictions that fall over the missing-data threshold and are not assigned a 1–50 rank.

Detail

Reproducibility

Every set of published scores is linked to a release identifier that records the weights used, the source data vintage, and the timestamp of the run. Rerunning with the same inputs produces identical scores. Score history is preserved; no published row is ever overwritten.

We test weight sensitivity by perturbing each dimension weight by ±25 percent and observing tier movement. Top-tier states are expected to remain stable under these perturbations.

Detail

References

OECD/JRC (2008). Handbook on Constructing Composite Indicators: Methodology and User Guide. OECD Publishing, Paris. DOI: 10.1787/9789264043466-en.

Nardo, M., Saisana, M., Saltelli, A., & Tarantola, S. (2005). Tools for Composite Indicators Building. European Commission, Joint Research Centre, EUR 21682 EN. Ispra.

Saisana, M., Saltelli, A., & Tarantola, S. (2005). Uncertainty and sensitivity analysis techniques as tools for the quality assessment of composite indicators. Journal of the Royal Statistical Society: Series A, 168(2), 307–323.

Saltelli, A., Ratto, M., Andres, T., Campolongo, F., Cariboni, J., Gatelli, D., Saisana, M., & Tarantola, S. (2008). Global Sensitivity Analysis: The Primer. John Wiley & Sons, Chichester.

Caveats

Known limitations

01The Data Center Readiness Index ranks states on raw operating-environment quality, not on existing capital deployed. A state where data-center capital has accumulated may rank lower than its market share would suggest; that is the Index doing its job, not a misread.
02The Index measures state-level conditions. Sub-state variation (Northern Virginia vs. the rest of Virginia, the Phoenix metro vs. rural Arizona) is not separable in the published ranking.
03The District of Columbia is reported as “insufficient data” rather than ranked. Three federal datasets used in the composite (including the U.S. Drought Monitor) do not produce comparable inputs for DC, putting it over the missing-data threshold for a 1–50 rank.
04The Index does not capture: existing campus footprint or announced-MW capacity, operator-specific cooling architecture (dry-cooled vs. evaporative), or forward-looking climate projections.

Data sources

EPA eGRID: generation capacity and emissions
EIA-860: generator-level capacity and pipeline
EIA-861: utility outage and retail data
NREL ReEDS (Mid-Case): long-horizon capacity forecast
EIA Retail Electricity Sales API: industrial electricity prices
LBNL Queued Up: federal interconnection queue
NOAA Storm Events Database: historical storm exposure
FEMA National Risk Index: multi-peril hazard composite
U.S. Drought Monitor (NDMC / USDA / NOAA): drought classification
BLS Occupational Employment and Wage Statistics: occupational employment and wages
Tax Foundation: state and local tax burdens
EIA / NARUC: electricity market structure
Red Tape Index: permitting data
Public data-center opposition tracking: community-opposition signals
Public legislative trackers: state and local moratorium status

Refresh cadence

We target a quarterly refresh, though not every source publishes on that cadence. Each indicator is updated as new vintages become available, and we do our best to keep the composite current. Each release records the source vintage and a unique identifier so historical scores remain reproducible.