AI RESEARCH

Generative Blocks World: Moving Things Around in Pictures

arXiv CS.CV

ArXi:2506.20703v2 Announce Type: replace-cross We describe Generative Blocks World to interact with the scene of a generated image by manipulating simple geometric abstractions. Our method represents scenes as assemblies of convex 3D primitives, and the same scene can be represented by different numbers of primitives, allowing an editor to move either whole structures or small details. Once the scene geometry has been edited, the image is generated by a flow-based method, which is conditioned on depth and a texture hint.