AI RESEARCH
MIRROR: A Hierarchical Benchmark for Metacognitive Calibration in Large Language Models
arXiv CS.LG
•
ArXi:2604.19809v1 Announce Type: cross