AI RESEARCH

Can You Learn to See Without Images? Procedural Warm-Up for Vision Transformers

arXiv CS.CV

ArXi:2511.13945v2 Announce Type: replace Transformers are remarkably versatile, suggesting the existence of generic inductive biases beneficial across modalities. In this work, we explore a new way to instil such biases in vision transformers (ViTs) through pre