AI RESEARCH

Taxonomy-Aware Representation Alignment for Hierarchical Visual Recognition with Large Multimodal Models

arXiv CS.AI

ArXi:2603.00431v2 Announce Type: replace-cross A high-performing, general-purpose visual understanding model should map visual inputs to a taxonomic tree of labels, identify novel categories beyond the