Qwen Voice Clone + LTX 2.3 Image and Speech to Video. Made Locally on RTX3090

r/StableDiffusion
Generative AI Open Source AI

Another quick test using rtx 3090 24 VRAM and 96 system RAM TTS (qwen TTS) TTS is a cloned voice, generated locally via QwenTTS custom voice from this video Workflow used: Image and Speech-to-video for lipsync Used this ltx 2.3 workflow submitted by /u/Inevitable_Emu2722 [link] [comments]