AI RESEARCH

Quid est VERITAS? A Modular Framework for Archival Document Analysis

arXiv CS.AI

ArXi:2603.28108v1 Announce Type: cross The digitisation of historical documents has traditionally been conceived as a process limited to character-level transcription, producing flat text that lacks the structural and semantic information necessary for substantive computational analysis. We present VERITAS (Vision-Enhanced Reading, Interpretation, and Transcription of Archival Sources), a modular, model-agnostic framework that reconceptualises digitisation as an integrated workflow encompassing transcription, layout analysis, and semantic enrichment.