AI RESEARCH

Structured Layout Priors for Robust Out-of-Distribution Visual Document Understanding

arXiv CS.CV

ArXi:2605.19866v1 Announce Type: new Vision-Language Models (VLMs) parse documents end-to-end but frequently break down on layouts unlike those seen in