AI RESEARCH

From PDF to RAG-Ready: Evaluating Document Conversion Frameworks for Domain-Specific Question Answering

arXiv CS.AI

ArXi:2604.04948v1 Announce Type: cross Retrieval-Augmented Generation (RAG) systems depend critically on the quality of document preprocessing, yet no prior study has evaluated PDF processing frameworks by their impact on downstream question-answering accuracy.