AI RESEARCH
Text-Guided Multimodal Unified Industrial Anomaly Detection
arXiv CS.CV
•
ArXi:2604.22899v1 Announce Type: new Industrial anomaly detection based on RGB-3D multimodal data has emerged as a mainstream paradigm for intelligent quality inspection. However, existing unsupervised methods suffer from two critical limitations: ambiguous cross-modal alignment caused by the lack of high-level semantic guidance and insufficient geometric modeling for RGB-to-3D feature mapping. To address these issues, we propose a unified multimodal industrial anomaly detection framework guided by text semantics.