AI RESEARCH
Prefix-Adaptive Block Diffusion for Efficient Document Recognition
arXiv CS.CV
•
ArXi:2605.16861v1 Announce Type: new Block Diffusion Models (BDMs) parallel generation, flexible-length output, and KV caching, making them promising for efficient document parsing. However, existing BDMs bind denoising and cache commitment to fixed block boundaries: parallelism shrinks during intra-block denoising, while generated tokens cannot be cached until the whole block is completed. Moreover, intra-block bidirectional denoising conflicts with inter-block autoregression, creating inconsistent information flow that can challenge structure-sensitive recognition.