AI RESEARCH

Masked Next-Scale Prediction for Self-supervised Scene Text Recognition

arXiv CS.CV

ArXi:2605.14885v1 Announce Type: new Scene Text Recognition requires modeling visual structures that evolve from coarse layouts to fine-grained character strokes