I fine-tuned Qwen3.5-2B for OCR

r/LocalLLaMA
Machine Learning Generative AI Computer Vision AI Research

Hey everyone, I’ve been working on fine-tuning vision-language models for OCR tasks and wanted to share my latest release. It's a fine-tuned Qwen3.5-2B specifically optimized for English/LTR Document OCR. Model link: loay/English-Document-OCR-Qwen3.5-2B I’d love to hear your feedback, especially if you test it out on messy documents or specific edge cases. Let me know how it performs for you! submitted by /u/Other-Confusion2974 [link] [comments]