AI RESEARCH
SWE-MiniSandbox: Container-Free Reinforcement Learning for Building Software Engineering Agents
arXiv CS.AI
•
ArXi:2602.11210v3 Announce Type: replace-cross Reinforcement learning (RL) has become a key paradigm for