AI RESEARCH
AJ-Bench: Benchmarking Agent-as-a-Judge for Environment-Aware Evaluation
arXiv CS.AI
•
ArXi:2604.18240v1 Announce Type: new As reinforcement learning continues to scale the