AI RESEARCH
AMIGO: Agentic Multi-Image Grounding Oracle Benchmark
arXiv CS.AI
•
ArXi:2603.28662v1 Announce Type: cross Agentic vision-language models increasingly act through extended interactions, but most evaluations still focus on single-image, single-turn correctness. We