AI RESEARCH

SWE-MiniSandbox: Container-Free Reinforcement Learning for Building Software Engineering Agents

arXiv CS.AI

ArXi:2602.11210v3 Announce Type: replace-cross Reinforcement learning (RL) has become a key paradigm for