AI RESEARCH
AEC-Bench: A Multimodal Benchmark for Agentic Systems in Architecture, Engineering, and Construction
arXiv CS.AI
•
ArXi:2603.29199v1 Announce Type: new The AEC-Bench is a multimodal benchmark for evaluating agentic systems on real-world tasks in the Architecture, Engineering, and Construction (AEC) domain. The benchmark covers tasks requiring drawing understanding, cross-sheet reasoning, and construction project-level coordination. This report describes the benchmark motivation, dataset taxonomy, evaluation protocol, and baseline results across several domain-specific foundation model harnesses.