AI RESEARCH

From Edges to Depth: Probing the Spatial Hierarchy in Vision Transformers

arXiv CS.LG

ArXi:2604.23452v1 Announce Type: cross Vision Transformers trained only on image classification routinely transfer to tasks that demand spatial understanding, yet they receive no spatial supervision during pre