AI RESEARCH
Image Generators are Generalist Vision Learners
arXiv CS.CV
•
ArXi:2604.20329v1 Announce Type: new Recent works show that image and video generators exhibit zero-shot visual understanding behaviors, in a way reminiscent of how LLMs develop emergent capabilities of language understanding and reasoning from generative pre