AI RESEARCH

ClawBench: Can AI Agents Complete Everyday Online Tasks?

arXiv CS.CL

ArXi:2604.08523v1 Announce Type: new AI agents may be able to automate your inbox, but can they automate other routine aspects of your life? Everyday online tasks offer a realistic yet unsolved testbed for evaluating the next generation of AI agents. To this end, we