AI RESEARCH
Understanding the Role of Hallucination in Reinforcement Post-Training of Multimodal Reasoning Models
arXiv CS.AI
•
ArXi:2604.03179v1 Announce Type: cross The recent success of reinforcement learning (RL) in large reasoning models has inspired the growing adoption of RL for post-