AICA-Bench: Holistically Examining the Capabilities of VLMs in Affective Image Content Analysis

ArXi:2604.05900v1 Announce Type: new Vision-Language Models (VLMs) have nstrated strong capabilities in perception, yet holistic Affective Image Content Analysis (AICA), which integrates perception, reasoning, and generation into a unified framework, remains underexplored. To address this gap, we