V2X-QA: A Comprehensive Reasoning Dataset and Benchmark for Multimodal Large Language Models in Autonomous Driving Across Ego, Infrastructure, and Cooperative Views

ArXi:2604.02710v1 Announce Type: cross Multimodal large language models (MLLMs) have shown strong potential for autonomous driving, yet existing benchmarks remain largely ego-centric and. therefore. cannot systematically assess model performance in infrastructure-centric and cooperative driving conditions. In this work, we