AI RESEARCH

Phantom: Physics-Infused Video Generation via Joint Modeling of Visual and Latent Physical Dynamics

arXiv CS.CV

ArXi:2604.08503v1 Announce Type: new Recent advances in generative video modeling, driven by large-scale datasets and powerful architectures, have yielded remarkable visual realism. However, emerging evidence suggests that simply scaling data and model size does not endow these systems with an understanding of the underlying physical laws that govern real-world dynamics. Existing approaches often fail to capture or enforce such physical consistency, resulting in unrealistic motion and dynamics.