Multimodal Coding Agents Benchmark (GitHub Repo)

TLDR AI
AI Research

Vision2Web is a benchmark for evaluating multimodal agents on end-to-end website development tasks across the full software lifecycle.