AI RESEARCH

DiffScore: Text Evaluation Beyond Autoregressive Likelihood

arXiv CS.AI • May 13, 2026

ArXi:2605.11601v1 Announce Type: cross Autoregressive language models are widely used for text evaluation, however, their left-to-right factorization