Why do dLLMs tend to collapse in RL (3 minute read)

TLDR AI
Generative AI Reinforcement Learning

Diffusion Language Models (dLLMs) experience