Know When You're Wrong: Aligning Confidence with Correctness for LLM Error Detection

ArXi:2603.06604v1 Announce Type: new As large language models (LLMs) are increasingly deployed in critical decision-making systems, the lack of reliable methods to measure their uncertainty presents a fundamental trustworthiness risk. We