AI RESEARCH

Spatially Robust Inference with Predicted and Missing at Random Labels

arXiv CS.LG

ArXi:2603.11368v1 Announce Type: cross When outcome data are expensive or onerous to collect, scientists increasingly substitute predictions from machine learning and AI models for unlabeled cases, a process which has consequences for downstream statistical inference. While recent methods provide valid uncertainty quantification under independent sampling, real-world applications involve missing at random (MAR) labeling and spatial dependence. For inference in this setting, we propose a doubly robust estimator with cross-fit nuisances.