AI RESEARCH

Unsupervised Machine Learning for Detecting Structural Anomalies in European Regional Statistics

arXiv CS.LG

ArXi:2605.02884v1 Announce Type: new Ensuring the coherence of regional socio-economic statistics is a central task for national statistical institutes. Traditional validation tools, such as range edits, ratio checks, or univariate outlier detection, are effective for identifying extreme values in individual series but are less suited for detecting unusual combinations of indicators in high-dimensional settings. This paper proposes an unsupervised machine learning framework for identifying structurally atypical regional profiles within Europe using publicly available Eurostat data.