Two Frames Matter: A Temporal Attack for Text-to-Video Model Jailbreaking

ArXi:2603.07028v1 Announce Type: cross Recent text-to-video (T2V) models can synthesize complex videos from lightweight natural language prompts, raising urgent concerns about safety alignment in the event of misuse in the real world. Prior jailbreak attacks typically rewrite unsafe prompts into paraphrases that evade content filters while preserving meaning. Yet, these approaches often still retain explicit sensitive cues in the input text and therefore overlook a profound, video-specific weakness.