Precise Shield: Explaining and Aligning VLLM Safety via Neuron-Level Guidance

ArXi:2604.08881v1 Announce Type: new In real-world deployments, Vision-Language Large Models (VLLMs) face critical challenges from multilingual and multimodal composite attacks: harmful images paired with low-resource language texts can easily bypass defenses designed for high-resource language scenarios, exposing structural blind spots in current cross-lingual and cross-modal safety methods.