AI RESEARCH
Hierarchical Textual Knowledge for Enhanced Image Clustering
arXiv CS.CL
•
ArXi:2604.11144v1 Announce Type: cross Image clustering aims to group images in an unsupervised fashion. Traditional methods focus on knowledge from visual space, making it difficult to distinguish between visually similar but semantically different classes. Recent advances in vision-language models enable the use of textual knowledge to enhance image clustering. However, most existing methods rely on coarse class labels or simple nouns, overlooking the rich conceptual and attribute-level semantics embedded in textual space.