LTX 2.3 audio as standalone speech model.
r/StableDiffusion
•
Generative AI
Open Source AI
User from X posted about this new nodel. Has anyone here tried it yet? LTX 2.3 audio as standalone speech model. Emotional TTS with Scenema Audio. - Zero-shot expressive voice cloning, speech gen - 8-step distilled with Gemma 3 12B text encoding - stage directions via