Debiasing Machine Learning Predictions for Causal Inference Without Additional Ground Truth Data: “One Map, Many Trials” in Satellite-Driven Poverty Analysis

Markus Pettersson; Connor T. Jerzak; Adel Daoud

doi:10.1609/aaai.v40i46.41258

Debiasing Machine Learning Predictions for Causal Inference Without Additional Ground Truth Data: “One Map, Many Trials” in Satellite-Driven Poverty Analysis
Paper in proceeding, 2026

Machine learning models trained on Earth observation data, such as satellite imagery, have demonstrated significant promise in predicting household-level wealth indices, enabling the creation of high-resolution wealth maps that can be leveraged across multiple causal trials while addressing chronic data scarcity in global development research. However, because standard training objectives prioritize overall predictive accuracy, these predictions often suffer from shrinkage toward the mean, leading to attenuated estimates of causal treatment effects and limiting their utility in policy evaluations. Existing debiasing methods, such as Prediction-Powered Inference (PPI), can handle this attenuation bias but require additional fresh ground-truth data at the downstream stage of causal inference, which restricts their applicability in data-scarce environments. We introduce and evaluate two post-hoc correction methods—Linear Calibration Correction (LCC) and a Tweedie’s correction approach—that substantially reduce shrinkage-induced prediction bias without relying on newly collected labeled data. LCC applies a simple linear transformation estimated on a held-out calibration split; Tweedie’s method locally de-shrink predictions using density score estimates and a noise scale learned upstream. We provide practical diagnostics for when a correction is warranted and discuss practical limitations. Across analytical results, simulations, and experiments with Demographic and Health Surveys (DHS) data, both approaches reduce attenuation; Tweedie’s correction yields nearly unbiased treatment-effect estimates, enabling a “one map, many trials” paradigm. Although we demonstrate on EO-ML wealth mapping, the methods are not geospatial-specific: they apply to any setting where imputed outcomes are reused downstream (e.g., pollution indices, population density, or LLM-derived indicators).

Author

Markus Pettersson

Chalmers, Computer Science and Engineering (Chalmers), Data Science and AI

Other publications Research

Connor T. Jerzak

University of Texas

Adel Daoud

Linköping University

Other publications Research

Proceedings of the AAAI Conference on Artificial Intelligence

21595399 (ISSN) 23743468 (eISSN)

Vol. 40 46 39106-39115

40th AAAI Conference on Artificial Intelligence, AAAI 2026
Singapore, Singapore,

Subject Categories (SSIF 2025)

Probability Theory and Statistics

Oceanography, Hydrology and Water Resources

DOI

10.1609/aaai.v40i46.41258

Publication data connected to DOI

More information

Latest update

4/17/2026

Debiasing Machine Learning Predictions for Causal Inference Without Additional Ground Truth Data: “One Map, Many Trials” in Satellite-Driven Poverty Analysis Paper in proceeding, 2026