AI RESEARCH
From Diffusion to Rectified Flow: Rethinking Text-Based Segmentation
arXiv CS.AI
•
ArXi:2605.04590v1 Announce Type: cross Text-based image segmentation aims to delineate object boundaries within an image from text prompts, offering higher flexibility and broader application scope compared to traditional fixed-category segmentation tasks. Recent studies have shown that diffusion models (e.g., Stable Diffusion) can provide rich multimodal semantic features, leading to studies of using diffusion models as feature extractors for segmentation tasks. Such methods, however, inherit the generative natures of diffusion models that are harmful to discriminative segmentation tasks.