AI RESEARCH
MARCH: Multi-Agent Reinforced Self-Check for LLM Hallucination
arXiv CS.CL
•
ArXi:2603.24579v1 Announce Type: new Hallucination remains a critical bottleneck for large language models (LLMs), undermining their reliability in real-world applications, especially in Retrieval-Augmented Generation (RAG) systems. While existing hallucination detection methods employ LLM-as-a-judge to verify LLM outputs against retrieved evidence, they suffer from inherent confirmation bias, where the verifier inadvertently reproduces the errors of the original generation. To address this, we.