AI RESEARCH

Inside VAKRA: Reasoning, Tool Use, and Failure Modes of Agents

Hugging Face Blog

Task Description Evaluation Framework Error Analysis