AI RESEARCH
ClawBench: Can AI Agents Complete Everyday Online Tasks?
arXiv CS.CL
•
ArXi:2604.08523v1 Announce Type: new AI agents may be able to automate your inbox, but can they automate other routine aspects of your life? Everyday online tasks offer a realistic yet unsolved testbed for evaluating the next generation of AI agents. To this end, we