AI RESEARCH

Exploration Hacking: Can LLMs Learn to Resist RL Training?

arXiv CS.LG • May 01, 2026

ArXi:2604.28182v1 Announce Type: new Reinforcement learning (RL) has become essential to the post-