AI RESEARCH
Parser-Oriented Structural Refinement for a Stable Layout Interface in Document Parsing
arXiv CS.CV
•
ArXi:2604.02692v1 Announce Type: new Accurate document parsing requires both robust content recognition and a stable parser interface. In explicit Document Layout Analysis (DLA) pipelines, downstream parsers do not consume the full detector output. Instead, they operate on a retained and serialized set of layout instances. However, on dense pages with overlapping regions and ambiguous boundaries, unstable layout hypotheses can make the retained instance set inconsistent with its parser input order, leading to severe downstream parsing errors. To address this issue, we