AI RESEARCH

DepthPilot: From Controllability to Interpretability in Colonoscopy Video Generation

arXiv CS.AI

ArXi:2604.26232v1 Announce Type: cross Controllable medical video generation has achieved remarkable progress, but it still lacks interpretability, which requires the alignment of generated contents with physical priors and faithful clinical manifestations. To push the boundaries from mere controllability to interpretability, we propose DepthPilot, the first interpretable framework for colonoscopy video generation. This work takes a step toward trustworthy generation through two synergistic paradigms.