AI RESEARCH
Relationship-Aware Safety Unlearning for Multimodal LLMs
arXiv CS.AI
•
ArXi:2603.14185v1 Announce Type: new Generative multimodal models can exhibit safety failures that are inherently relational: two benign concepts can become unsafe when linked by a specific action or relation (e.g., child-drinking-wine). Existing unlearning and concept-erasure approaches often target isolated concepts or image-text pairs, which can cause collateral damage to benign uses of the same objects and relations.