AI RESEARCH
Support Before Frequency in Discrete Diffusion
arXiv CS.LG
•
ArXi:2605.13999v1 Announce Type: new Discrete diffusion models are increasingly competitive for language modeling, yet it remains unclear how their denoising objectives organize learning. Although these objectives target the full data distribution, we show that the exact reverse process induces a hierarchy between coarse information and finer frequency information. For uniform and absorbing (a.k.a.