AI RESEARCH
Building Efficient RL Training for the Agentic Era
Salesforce AI Research
•
Introduction Reinforcement Learning from Human or AI Feedback (RLHF, RLAIF) has become the standard recipe for aligning large language models (LLMs). But as we push into the agentic era — where models call…