AI RESEARCH
Models Know Their Shortcuts: Deployment-Time Shortcut Mitigation
arXiv CS.LG
•
ArXi:2604.12277v1 Announce Type: new Pretrained language models often rely on superficial features that appear predictive during