Agent skills look great in benchmarks but fall apart under realistic conditions, researchers find
The Decoder
•
Generative AI
AI agents are supposed to tap into specialized knowledge through so-called skills, modular instructions they can pull up on the fly. But a study testing 34,000 real-world skills finds these enhancements barely help under realistic conditions. Weaker models actually perform worse with them than without.