AI RESEARCH

Score-Based Density Estimation from Pairwise Comparisons

arXiv CS.LG

ArXi:2510.09146v2 Announce Type: replace We study density estimation from pairwise comparisons, motivated by expert knowledge elicitation and learning from human feedback. We relate the unobserved target density to a tempered winner density (marginal density of preferred choices), learning the winner's score via score-matching. This allows estimating the target by `de-tempering' the estimated winner density's score. We prove that the score vectors of the belief and the winner density are collinear, linked by a position-dependent tempering field.