AI RESEARCH

What Happens Before Decoding? Prefill Determines GUI Grounding in VLMs

arXiv CS.CV

ArXi:2605.12549v1 Announce Type: new