Miso Labs Releases MisoTTS: An 8B Emotive Text-to-Speech Model with Open Weights
Miso Labs has launched MisoTTS, an open-weights 8-billion-parameter text-to-speech mannequin. It generates expressive speech from each textual content and audio context. The mannequin makes use of residual vector quantization (RVQ) to widen its sonic vary. This avoids scaling a single flat vocabulary whereas holding parameter rely mounted. What is MisoTTS MisoTTS is an 8B-parameter text-to-dialogue…
