January 25, 2026

5 min read

Validating unemployment insurance estimates

PolicyEngine corrects CPS underreporting of UI benefits using quantile regression forests and reweighting, matching administrative totals within 2-4%.

Contents

Validation results

PolicyEngine's approach

1. Chained conditional imputation

2. Reweighting to multiple targets

Other approaches

Federal Reserve / Joint Committee on Taxation

CBO

TRIM3 (Urban Institute)

Summary

Why methodology matters

Limitations

Conclusion

Survey data systematically underreport unemployment insurance (UI) benefits. The Current Population Survey captures only 58-65% of actual UI outlays. PolicyEngine corrects this underreporting using a methodology that differs from other approaches—and understanding these differences matters for interpreting results.

Validation results#

We compared PolicyEngine's 2024 UI estimates against administrative data from the Congressional Budget Office and Department of Labor:

Raw CPS captures 59-64% of administrative totals. PolicyEngine matches within 2-4%. Hover over bars for absolute values.

PolicyEngine's approach#

PolicyEngine corrects UI underreporting in two steps:

1. Chained conditional imputation#

We use quantile regression forests (QRF) to impute UI from IRS Public Use File data onto CPS records. QRF samples from the empirical conditional distribution of UI given demographics and other income sources—no parametric assumptions about the shape.

This is sequential: each variable conditions on all previously imputed ones. UI (imputed 38th in sequence) conditions on employment income, Social Security, pension income, and 34 other variables imputed earlier.

Why doesn't variable order matter? Correlations are symmetric in the training data. If UI and employment income are correlated in the PUF, then P(UI | income) and P(income | UI) both capture the same relationship. This is the foundation of chained equations imputation (MICE): with correctly specified conditionals, sequential imputation converges to the joint distribution.

2. Reweighting to multiple targets#

After imputation, we optimize household weights to simultaneously match:

IRS SOI Table 1.4: UI amounts and counts by AGI bracket (36 cells)
CBO program spending projections
2,800+ other calibration targets

This enables targeting both caseloads and benefit amounts simultaneously. Cell-based selection methods have one degree of freedom per cell (selection probability), which constrains benefit amounts to follow from who is selected.

Other approaches#

Federal Reserve / Joint Committee on Taxation#

A 2022 FEDS Notes paper by Fed and JCT researchers documented an imputation methodology (Stata code). This was a one-off research paper, not part of regular Fed publications:

Stratify CPS respondents into 100 income percentiles
Calculate mean and standard deviation of UI within each percentile from IRS 1099-G data
For non-reporters, draw UI amounts from Normal(μ, σ) for their percentile
Select non-reporters randomly until benefit totals match administrative targets

The method targets aggregate benefits, not recipient counts—the number of recipients is an outcome, not a constraint.

CBO#

CBO Working Paper 2018-07 (GitHub code):

Estimate probability of UI receipt via probit regression on demographics and income
Assign each non-reporter a random number; if probability > random, they receive UI
Assign average benefit amount for their demographic/income group
Iterate until totals match administrative data

CBO notes their method "was designed with a degree of precision that is suited for estimating the distribution of income by quintiles—not by households."

TRIM3 (Urban Institute)#

TRIM3 is a comprehensive microsimulation model. Documentation describes detailed rules-based eligibility simulation for SNAP, TANF, and Medicaid, but provides less detail on UI methodology. What is documented:

Takes survey-reported UI from CPS
Allocates amounts across months within state-level constraints (max weeks, min/max weekly amounts)
Corrects for underreporting to match DOL administrative totals

TRIM3 does not document how it selects which non-reporters to assign UI or how it determines benefit amounts for them. TRIM3 code is not publicly available.

Summary#

Model	Method	Distribution	Open source
PolicyEngine	QRF + reweight to SOI	Nonparametric (learned)	Yes
Fed/JCT	Normal draw by percentile	Normal(μ, σ)	Yes (Stata)
CBO	Probit → assign group averages	Point estimates	Partial
TRIM3	Adjust reported amounts	Deterministic	No

Why methodology matters#

Consider two workers at the 40th income percentile ($48,000 AGI) who lose their jobs.

Parametric approach (normal distribution within percentile):

Drawing from Normal(μ=$9,200, σ=$4,800), about 2.3% of imputations exceed $18,800. A worker receiving this amount would see their AGI rise to $66,800—potentially crossing from the 12% to 22% federal tax bracket.

But actual UI rarely reaches $18,800. It requires ~30 weeks of benefits at near-maximum weekly amounts. The true probability is well below 2.3%.

The Fed stratifies by pre-UI income. Adding UI changes income rank: a worker at the 40th percentile ($48,000) who receives $18,800 UI now has $66,800—around the 52nd percentile. The imputed UI was drawn from the 40th percentile distribution.

Nonparametric approach:

QRF samples from the empirical UI distribution in tax data. High amounts are rare in the training data, so they remain rare in imputations.

This distinction extends beyond federal taxes. The Supplemental Poverty Measure (SPM) subtracts taxes from resources. If a model assigns higher UI amounts than actually occur, computed tax liabilities increase, SPM resources decrease, and SPM poverty rates rise. The Official Poverty Measure (pre-tax thresholds) does not incorporate taxes, so it is unaffected by this mechanism.

Limitations#

Training data: The IRS PUF is from 2015, aged forward. Structural changes in UI (extended benefits, gig economy workers) may not be fully captured.
State variation: We don't yet calibrate to state-level UI totals, though the methodology supports this.
Monthly timing: Like other annual models, we work with annual totals rather than monthly benefit flows.

Conclusion#

Survey underreporting of unemployment insurance is well-documented. Correcting it requires choices about imputation methodology that affect downstream tax and poverty calculations. PolicyEngine uses nonparametric imputation (QRF) and reweights to match administrative totals within 2-4%.

The Enhanced CPS with these corrections is available through our web app and Python package.

Max Ghenis

PolicyEngine's Co-founder and CEO