LongCat-AudioDiT: High-Fidelity Diffusion Text-to-Speech in the Waveform Latent Space

r/LocalLLaMA
Generative AI AI Research AI Tools

HuggingFace: GitHub: Announcement: submitted by /u/DreamGenX [link] [comments]