AI RESEARCH
Inside VAKRA: Reasoning, Tool Use, and Failure Modes of Agents
Hugging Face Blog
•
Task Description Evaluation Framework Error Analysis