AI RESEARCH

Moondream Segmentation: From Words to Masks

arXiv CS.AI

ArXi:2604.02593v1 Announce Type: cross We present Moondream Segmentation, a referring image segmentation extension of Moondream 3, a vision-language model. Given an image and a referring expression, the model autoregressively decodes a vector path and iteratively refines the rasterized mask into a final detailed mask. We