Anthropic Used Claude To Beat Its Own Human Alignment Researchers (At $22 An Hour)
The Neuron
•
Generative AI
Anthropic's new paper shows nine parallel Claude agents recovered 97% of the alignment-research performance gap on a problem its human researchers got 23% of in seven days. Total cost: $18K. The number to internalize is $22 per agent-hour.