AI RESEARCH
Teaching AI models to say “I’m not sure”
MIT AI News
•
MIT CSAIL's “Reinforcement Learning with Calibration Rewards” technique improves AI confidence estimates without sacrificing performance, addressing a root cause of hallucination in reasoning models.