Is Conformal Factuality for RAG-based LLMs Robust? Novel Metrics and Systematic Insights

ArXi:2603.16817v1 Announce Type: new Large language models (LLMs) frequently hallucinate, limiting their reliability in knowledge-intensive applications. Retrieval-augmented generation (RAG) and conformal factuality have emerged as potential ways to address this limitation. While RAG aims to ground responses in retrieved evidence, it provides no statistical guarantee that the final output is correct.