AI RESEARCH
Structured Layout Priors for Robust Out-of-Distribution Visual Document Understanding
arXiv CS.CV
•
ArXi:2605.19866v1 Announce Type: new Vision-Language Models (VLMs) parse documents end-to-end but frequently break down on layouts unlike those seen in