Models

SpeechToolkit supports several different models for various tasks.

Note that the license "Same" indicates that this is

Text-to-Speech

Below is a list of models supported for text-to-speech:

Name License Link
StyleTTS 2 MIT Repository

Note: StyleTTS 2 by default uses a GPL-licensed phonemizer but we've replaced it with the BSD-licensed OpenPhonemizer.

Automatic Speech Recognition

Below is a list of supported models for automatic speech recognition.

Name License Link
Whisper MIT Repository

Speech Classification

NOTE: Classification models are not very accurate yet.

SpeechToolkit supports several different types of speech classification. These models are trained by ML for Speech.

Version Task Link
V1 Language Classification Model

Voice Conversion

Below is a list of supported models for voice conversion.

Name License Link
LVC-VC MIT Repository
NS3VC MIT Repository

A Short Guide to Licenses

Note that this is not legal advice.

Please note that models may have a different license than SpeechToolkit. If this is the case, you must comply with both SpeechToolkit and the license of the model.

If you're wondering whether or not you can use a model commercially, you should check both the model's license and the pretrained weights' license. The MIT, Apache 2.0, and BSD licenses typically allow commercial use, unless otherwise specified by the authors. However, the BSD-4-Clause license requires you to provide attribution to the author in certain marketing materials (read the full license for details). If the license name includes "NC," it is likely a non-commercial license, which means you cannot use it commercially. Also note that some models may be trained on copyrighted content, which, depending on your jurisdiction, may influence the ability for you to use the models.

Before using models, you should carefully read their licenses.

Disclaimer

Disclaimer for models trained by SpeechToolkit:

THE MODEL IS PROVIDED "AS IS" AND THE AUTHOR DISCLAIMS ALL WARRANTIES WITH REGARD TO THIS MODEL INCLUDING ALL IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS. IN NO EVENT SHALL THE AUTHOR BE LIABLE FOR ANY SPECIAL, DIRECT, INDIRECT, OR CONSEQUENTIAL DAMAGES OR ANY DAMAGES WHATSOEVER RESULTING FROM LOSS OF USE, DATA OR PROFITS, WHETHER IN AN ACTION OF CONTRACT, NEGLIGENCE OR OTHER TORTIOUS ACTION, ARISING OUT OF OR IN CONNECTION WITH THE USE OR PERFORMANCE OF THIS MODEL.