AI RESEARCH

Understanding the Role of Hallucination in Reinforcement Post-Training of Multimodal Reasoning Models

arXiv CS.AI

ArXi:2604.03179v1 Announce Type: cross The recent success of reinforcement learning (RL) in large reasoning models has inspired the growing adoption of RL for post-