AI RESEARCH

Semantic Alignment in Hyperbolic Space for Open-Vocabulary Semantic Segmentation

arXiv CS.CV

ArXi:2605.08874v1 Announce Type: new Open-vocabulary semantic segmentation requires adapting image-level vision-language models such as CLIP to dense pixel-level prediction, which is challenging due to the mismatch between hierarchical structure and semantic alignment in the embedding space. While recent works leverage hyperbolic geometry to model hierarchical relationships, they align embeddings across hierarchical levels but overlook semantic misalignment among embeddings within the same level.