AI RESEARCH
DocRevive: A Unified Pipeline for Document Text Restoration
arXiv CS.CV
•
ArXi:2604.10077v1 Announce Type: new In Document Understanding, the challenge of reconstructing damaged, occluded, or incomplete text remains a critical yet unexplored problem. Subsequent document understanding tasks can benefit from a document reconstruction process. In response, this paper presents a novel unified pipeline combining state-of-the-art Optical Character Recognition (OCR), advanced image analysis, masked language modeling, and diffusion-based models to re and reconstruct text while preserving visual integrity.