DPRM: A Plug-in Doob h transform-induced Token-Ordering Module for Diffusion Language Models

ArXi:2604.24357v1 Announce Type: new Diffusion language models generate without a fixed left-to-right order, making token ordering a central algorithmic choice: which tokens should be revealed, retained, revised or verified at each step? Existing systems mainly use random masking or confidence-driven ordering. Random masking creates train--test mismatch, while confidence-only rules are efficient but can be myopic and suppress useful exploration.