URMF: Uncertainty-aware Robust Multimodal Fusion for Multimodal Sarcasm Detection

ArXi:2604.06728v1 Announce Type: new Multimodal sarcasm detection (MSD) aims to identify sarcastic intent from semantic incongruity between text and image. Although recent methods have improved MSD through cross-modal interaction and incongruity reasoning, they often assume that all modalities are equally reliable. In real-world social media, however, textual content may be ambiguous and visual content may be weakly relevant or even irrelevant, causing deterministic fusion to