AI RESEARCH
Exploration Hacking: Can LLMs Learn to Resist RL Training?
arXiv CS.LG
•
ArXi:2604.28182v1 Announce Type: new Reinforcement learning (RL) has become essential to the post-