AI RESEARCH

Physion-Eval: Evaluating Physical Realism in Generated Video via Human Reasoning

arXiv CS.CV

ArXi:2603.19607v1 Announce Type: new Video generation models are increasingly used as world simulators for storytelling, simulation, and embodied AI. As these models advance, a key question arises: do generated videos obey the physical laws of the real world? Existing evaluations largely rely on automated metrics or coarse human judgments such as preferences or rubric-based checks. While useful for assessing perceptual quality, these methods provide limited insight into when and why generated dynamics violate real-world physical constraints. We.