ActiveUltraFeedback: Efficient Preference Data Generation using Active Learning

ArXi:2603.09692v1 Announce Type: cross Reinforcement Learning from Human Feedback (RLHF) has become the standard for aligning Large Language Models (LLMs), yet its efficacy is bottlenecked by the high cost of acquiring preference data, especially in low-resource and expert domains. To address this, we