TraversalBench: Challenging Paths to Follow for Vision Language Models

ArXi:2604.10999v1 Announce Type: new Vision-language models (VLMs) perform strongly on many multimodal benchmarks. However, the ability to follow complex visual paths -- a task that human observers typically find straightforward -- remains under-tested. We