AI RESEARCH

CuSearch: Curriculum Rollout Sampling via Search Depth for Agentic RAG

arXiv CS.AI

ArXi:2605.11611v1 Announce Type: new Reinforcement Learning with Verifiable Rewards (RLVR) has emerged as a promising paradigm for