InteractWeb-Bench: Can Multimodal Agent Escape Blind Execution in Interactive Website Generation?

ArXi:2604.27419v1 Announce Type: new With the advancement of multimodal large language models (MLLMs) and coding agents, the website development has shifted from manual programming to agent-based project-level code synthesis. Existing benchmarks rely on idealized assumptions, especially for well-structured, information-rich inputs and static execution settings.