AI RESEARCH

Information-Geometric Decomposition of Generalization Error in Unsupervised Learning

arXiv CS.LG

ArXi:2604.12340v1 Announce Type: cross We decompose the Kullback--Leibler generalization error (GE) -- the expected KL divergence from the data distribution to the trained model -- of unsupervised learning into three non-negative components: model error, data bias, and variance. The decomposition is exact for any e-flat model class and follows from two identities of information geometry: the generalized Pythagorean theorem and a dual e-mixture variance identity.