AI RESEARCH
Scaling Behaviors of LLM Reinforcement Learning Post-Training: An Empirical Study in Mathematical Reasoning
arXiv CS.AI
•
ArXi:2509.25300v4 Announce Type: replace-cross While scaling laws for large language models (LLMs) during pre-