AI RESEARCH
Reaching Beyond the Mode: RL for Distributional Reasoning in Language Models
arXiv CS.AI
•
ArXi:2603.24844v1 Announce Type: cross Given a question, a language model (LM) implicitly encodes a distribution over possible answers. In practice, post-