Self-Calibrating Language Models via Test-Time Discriminative Distillation

ArXi:2604.09624v1 Announce Type: cross Large language models (LLMs) are systematically overconfident: they routinely express high certainty on questions they often answer incorrectly. Existing calibration methods either require labeled validation data, degrade under distribution shifts, or incur substantial inference costs.