Hitting a wall parsing 1,000+ complex scanned PDFs & Excel tables to JSON (CPU-only). AI newbie looking for local parser recommendations (GLM-OCR, FireRed OCR, etc.)

r/LocalLLaMA
Computer Vision

Hey everyone, I’m pretty new to the AI engineering side of things, but I've recently been tasked with a massive digitization project at work across 6 food manufacturing plants. I’ve hit a serious wall and would love some advice from the veterans here. We’re trying to move away from paper logs and digitize over 1,000 different types of field logs (production, quality, equipment maintenance) into our new MES. My goal is to extract the document metadata and the hierarchical schema (like Group > Item) from these scanned PDFs.