Teaching AI models to say “I’m not sure”

MIT CSAIL's “Reinforcement Learning with Calibration Rewards” technique improves AI confidence estimates without sacrificing performance, addressing a root cause of hallucination in reasoning models.