AI RESEARCH
Controlla: Learning Controllability via Graph-Constrained Latent Geometry
arXiv CS.CV
•
ArXi:2605.16603v1 Announce Type: new Controllable multimodal generation is commonly formulated as an inference-time conditioning problem using prompts, guidance, or auxiliary modules. While effective, such approaches do not explicitly structure how semantic attributes evolve, which can lead to identity drift and inconsistent cross-modal behavior. We propose Controlla, a modular factorized-control framework that treats controllability as a property of structured latent geometry.