AI RESEARCH

UHR-Micro: Diagnosing and Mitigating the Resolution Illusion in Earth Observation VLMs

arXiv CS.CV

ArXi:2605.12237v1 Announce Type: new Vision-Language Models (VLMs) increasingly operate on ultra-high-resolution (UHR) Earth observation imagery, yet they remain vulnerable to a severe scale mismatch between large-scale scene context and micro-scale targets. We refer to this empirical gap as a "resolution illusion": higher input resolution provides the appearance of richer visual detail, but does not necessarily yield reliable perception of spatially small, task-relevant evidence. To benchmark this challenge, we.