AI RESEARCH
Dynamic Chunking Diffusion Transformer
arXiv CS.AI
•
ArXi:2603.06351v1 Announce Type: cross Diffusion Transformers process images as fixed-length sequences of tokens produced by a static $\textit{patchify}$ operation. While effective, this design spends uniform compute on low- and high-information regions alike, ignoring that images contain regions of varying detail and that the denoising process progresses from coarse structure at early timesteps to fine detail at late timesteps. We