AI RESEARCH
Can Compact Language Models Search Like Agents? Distillation-Guided Policy Optimization for Preserving Agentic RAG Capabilities
arXiv CS.CL
•
ArXi:2508.20324v4 Announce Type: replace Reinforcement Learning has emerged as a dominant post-