Debiasing Machine Learning Predictions for Causal Inference Without Additional Ground Truth Data: “One Map, Many Trials” in Satellite-Driven Poverty Analysis
Paper i proceeding, 2026

Machine learning models trained on Earth observation data, such as satellite imagery, have demonstrated significant promise in predicting household-level wealth indices, enabling the creation of high-resolution wealth maps that can be leveraged across multiple causal trials while addressing chronic data scarcity in global development research. However, because standard training objectives prioritize overall predictive accuracy, these predictions often suffer from shrinkage toward the mean, leading to attenuated estimates of causal treatment effects and limiting their utility in policy evaluations. Existing debiasing methods, such as Prediction-Powered Inference (PPI), can handle this attenuation bias but require additional fresh ground-truth data at the downstream stage of causal inference, which restricts their applicability in data-scarce environments. We introduce and evaluate two post-hoc correction methods—Linear Calibration Correction (LCC) and a Tweedie’s correction approach—that substantially reduce shrinkage-induced prediction bias without relying on newly collected labeled data. LCC applies a simple linear transformation estimated on a held-out calibration split; Tweedie’s method locally de-shrink predictions using density score estimates and a noise scale learned upstream. We provide practical diagnostics for when a correction is warranted and discuss practical limitations. Across analytical results, simulations, and experiments with Demographic and Health Surveys (DHS) data, both approaches reduce attenuation; Tweedie’s correction yields nearly unbiased treatment-effect estimates, enabling a “one map, many trials” paradigm. Although we demonstrate on EO-ML wealth mapping, the methods are not geospatial-specific: they apply to any setting where imputed outcomes are reused downstream (e.g., pollution indices, population density, or LLM-derived indicators).

Författare

Markus Pettersson

Chalmers, Data- och informationsteknik, Data Science och AI

Connor T. Jerzak

University of Texas

Adel Daoud

Linköpings universitet

Proceedings of the AAAI Conference on Artificial Intelligence

21595399 (ISSN) 23743468 (eISSN)

Vol. 40 46 39106-39115

40th AAAI Conference on Artificial Intelligence, AAAI 2026
Singapore, Singapore,

Ämneskategorier (SSIF 2025)

Sannolikhetsteori och statistik

Oceanografi, hydrologi och vattenresurser

DOI

10.1609/aaai.v40i46.41258

Mer information

Senast uppdaterat

2026-04-17