Inpainting in 3 commands: remove objects or add accessories with any base model, no dedicated inpaint model needed

r/StableDiffusion
Generative AI

Removed people from a street photo and added sunglasses to a portrait; all from the terminal, 3 commands each. No Photoshop. No UI. No dedicated inpaint model; works with flux klein or z-image. Two different masking strategies depending on the task: Object removal: vision ground (Qwen3-VL-8B) → process segment (SAM) → inpaint. SAM shines here, clean person silhouette. Add accessories: vision ground "eyes" → bbox + --expand 70 → inpaint. Skipped SAM intentionally - it returns two eye-shaped masks, useless for placing sunglasses. Expanded bbox gives you the right region.