AI RESEARCH

MARCH: Multi-Agent Reinforced Self-Check for LLM Hallucination

arXiv CS.CL

ArXi:2603.24579v1 Announce Type: new Hallucination remains a critical bottleneck for large language models (LLMs), undermining their reliability in real-world applications, especially in Retrieval-Augmented Generation (RAG) systems. While existing hallucination detection methods employ LLM-as-a-judge to verify LLM outputs against retrieved evidence, they suffer from inherent confirmation bias, where the verifier inadvertently reproduces the errors of the original generation. To address this, we.