Can the text encoder in LTX2.3 be replaced by another model?
r/StableDiffusion
•
Generative AI
AI Research
LTX2.3 uses gemma3 12b it as it's text encoder, I was wondering if it could be swapped with some qwen3.5 variant or something else to potentially get better results, or is the model built around that specific LLM? submitted by /u/blastbottles [link] [comments]