AI RESEARCH
Why Does Reinforcement Learning Generalize? A Feature-Level Mechanistic Study of Post-Training in Large Language Models
arXiv CS.CL
•
ArXi:2604.25011v1 Announce Type: new Reinforcement learning (RL)-based post-