AI RESEARCH
It's Never Too Late: Noise Optimization for Collapse Recovery in Trained Diffusion Models
arXiv CS.LG
•
ArXi:2601.00090v2 Announce Type: replace-cross Contemporary text-to-image models exhibit a surprising degree of mode collapse, as can be seen when sampling several images given the same text prompt. Previous work has attempted to address this issue by steering the model using guidance mechanisms, or by generating a large pool of candidates and refining them. In this work, we take a different direction and aim for diversity in generations via noise optimization.