AI RESEARCH
A Functionality-Grounded Benchmark for Evaluating Web Agents in E-commerce Domains
arXiv CS.AI
•
ArXi:2508.15832v2 Announce Type: replace-cross Web agents have shown great promise in performing many tasks on ecommerce website. To assess their capabilities, several benchmarks have been