AI RESEARCH
Metacognitive Behavioral Tuning of Large Language Models for Multi-Hop Question Answering
arXiv CS.AI
•
ArXi:2602.22508v2 Announce Type: replace Large Language Models (LLMs) often produce incorrect answers on multi-hop question answering even when the reasoning trace already contains a correct intermediate