Why do dLLMs tend to collapse in RL (3 minute read)
TLDR AI
•
Generative AI
Reinforcement Learning
Diffusion Language Models (dLLMs) experience