AI RESEARCH

OmniCustom: Sync Audio-Video Customization Via Joint Audio-Video Generation Model

arXiv CS.AI

ArXi:2602.12304v3 Announce Type: replace-cross Existing mainstream video customization methods focus on generating identity-consistent videos based on given reference images and textual prompts. Benefiting from the rapid advancement of joint audio-video generation, this paper proposes a compelling new task: sync audio-video customization, which aims to synchronously customize both video identity and audio timbre.