AI RESEARCH

Exploration Hacking: Can LLMs Learn to Resist RL Training?

arXiv CS.LG

ArXi:2604.28182v1 Announce Type: new Reinforcement learning (RL) has become essential to the post-