AI RESEARCH
CuSearch: Curriculum Rollout Sampling via Search Depth for Agentic RAG
arXiv CS.AI
•
ArXi:2605.11611v1 Announce Type: new Reinforcement Learning with Verifiable Rewards (RLVR) has emerged as a promising paradigm for