AI RESEARCH
Guiding a Diffusion Model by Swapping Its Tokens
arXiv CS.CV
•
ArXi:2604.08048v1 Announce Type: new Classifier-Free Guidance (CFG) is a widely used inference-time technique to boost the image quality of diffusion models. Yet, its reliance on text conditions prevents its use in unconditional generation. We propose a simple method to enable CFG-like guidance for both conditional and unconditional generation. The key idea is to generate a perturbed prediction via simple token swap operations, and use the direction between it and the clean prediction to steer sampling towards higher-fidelity distributions.