AI RESEARCH
Jailbreaking Frontier Foundation Models Through Intention Deception
arXiv CS.AI
•
ArXi:2604.24082v1 Announce Type: cross Large (vision-)language models exhibit remarkable capability but remain highly susceptible to jailbreaking. Existing safety