AI RESEARCH
ScenePilot-Bench: A Large-Scale Dataset and Benchmark for Evaluation of Vision-Language Models in Autonomous Driving
arXiv CS.CV
•
ArXi:2601.19582v2 Announce Type: replace In this paper, we