AI RESEARCH
CC-OCR V2: Benchmarking Large Multimodal Models for Literacy in Real-world Document Processing
arXiv CS.CL
•
ArXi:2605.03903v1 Announce Type: new Large Multimodal Models (LMMs) have recently shown strong performance on Optical Character Recognition (OCR) tasks, nstrating their promising capability in document literacy. However, their effectiveness in real-world applications remains underexplored, as existing benchmarks adopt task scopes misaligned with practical applications and assume homogeneous acquisition conditions. To address this gap, we