Looking for OCR for AI papers (math-heavy PDFs) — FireRed-OCR vs DeepSeek-OCR vs MonkeyOCR?
r/LocalLLaMA
•
Computer Vision
Open Source AI
AI Research
Right now I’m trying to build a workflow for extracting content from recent AI research papers (mostly arXi PDFs) so I can speed up reading, indexing, and note-taking. The catch is: these papers are not “clean text” documents. They usually include: Dense mathematical formulas (often LaTeX-heavy) Multi-column layouts Complex tables Figures/diagrams embedded with captions Mixed reading order issues So for me, plain OCR accuracy is not enough - I care a lot about structure + formulas + layout consistency.