AI RESEARCH

dTRPO: Trajectory Reduction in Policy Optimization of Diffusion Large Language Models

arXiv CS.AI

ArXi:2603.18806v1 Announce Type: new Diffusion Large Language Models (dLLMs)