GeoArena: Evaluating Open-World Geographic Reasoning in Large Vision-Language Models

ArXi:2509.04334v4 Announce Type: replace Geographic reasoning is a fundamental cognitive capability that requires models to infer plausible locations by synthesizing visual evidence with spatial world knowledge. Despite recent advances in large vision-language models (LVLMs), existing evaluation paradigms remain largely outcome-centric, relying on static datasets and predefined labels that are conceptually misaligned with open-world geographic inference.