AI RESEARCH
dTRPO: Trajectory Reduction in Policy Optimization of Diffusion Large Language Models
arXiv CS.AI
•
ArXi:2603.18806v1 Announce Type: new Diffusion Large Language Models (dLLMs)