AI RESEARCH

CC-OCR V2: Benchmarking Large Multimodal Models for Literacy in Real-world Document Processing

arXiv CS.CL

ArXi:2605.03903v1 Announce Type: new Large Multimodal Models (LMMs) have recently shown strong performance on Optical Character Recognition (OCR) tasks, nstrating their promising capability in document literacy. However, their effectiveness in real-world applications remains underexplored, as existing benchmarks adopt task scopes misaligned with practical applications and assume homogeneous acquisition conditions. To address this gap, we