AI RESEARCH
From Pixels to Concepts: Do Segmentation Models Understand What They Segment?
arXiv CS.CV
•
ArXi:2605.09591v1 Announce Type: new Segmentation is a fundamental vision task underlying numerous downstream applications. Recent promptable segmentation models, such as Segment Anything Model 3 (SAM3), extend segmentation from category-agnostic mask prediction to concept-guided localization conditioned on high-level textual prompts. However, existing benchmarks primarily evaluate mask accuracy or object presence, leaving unclear whether these models faithfully ground the queried concept or instead rely on visually salient but semantically misleading cues. We