Mistral AI to release Voxtral TTS, a 3-billion-parameter text-to-speech model with open weights that the company says outperformed ElevenLabs Flash v2.5 in human preference tests. The model runs on about 3 GB of RAM, achieves 90-millisecond time-to-first-audio, supports nine languages.
r/LocalLLaMA
•
Generative AI
Open Source AI
VentureBeat: Mistral AI just released a text-to-speech model it says beats ElevenLabs - and it's giving away the weights for free: Mistral AI unlisted video on YouTube: Voxtral TTS. Find your voice.: Mistral new 404: submitted by /u/Nunki08 [link] [comments]