Mistral's first open-weight TTS model Voxtral clones voices from three seconds of audio across nine languages

The Decoder
Generative AI Open Source AI AI Business

French AI startup Mistral has released Voxtral TTS, its first text-to-speech model that s nine languages and can clone voices from just three seconds of audio.