AI RESEARCH
Subliminal Signals in Preference Labels
arXiv CS.LG
•
ArXi:2603.01204v2 Announce Type: replace As AI systems approach superhuman capabilities, scalable oversight increasingly relies on LLM-as-a-judge frameworks where models evaluate and guide each other's