AI RESEARCH

GENFIG1: Visual Summaries of Scholarly Work as a Challenge for Vision-Language Models

arXiv CS.AI

ArXi:2604.04172v1 Announce Type: cross In many science papers, "Figure 1" serves as the primary visual summary of the core research idea. These figures are visually simple yet conceptually rich, often requiring significant effort and iteration by human authors to get right, highlighting the difficulty of science visual communication. With this intuition, we