AI RESEARCH

Beyond Mode-Seeking RL: Trajectory-Balance Post-Training for Diffusion Language Models

arXiv CS.LG

ArXi:2605.13935v1 Announce Type: new Diffusion language models are a promising alternative to autoregressive models, yet post-