AI RESEARCH

Image Generators are Generalist Vision Learners

arXiv CS.CV

ArXi:2604.20329v1 Announce Type: new Recent works show that image and video generators exhibit zero-shot visual understanding behaviors, in a way reminiscent of how LLMs develop emergent capabilities of language understanding and reasoning from generative pre