AI RESEARCH

Lost in the Folds: When Cross-Validation Is Not a Deep Ensemble for Uncertainty Estimation

arXiv CS.LG

ArXi:2605.18329v1 Announce Type: cross Ensemble disagreement is widely used as a proxy for epistemic uncertainty in medical image segmentation. In practice, many studies form ensembles via K-fold cross-validation (CV), yet refer to them as ``deep ensembles'' (DE). Because CV members are trained on different data subsets, their disagreement mixes seed-driven variability with data-exposure effects, which can change how uncertainty should be interpreted. We audit recent segmentation uncertainty studies and find that terminology--implementation mismatches are common.