CHIM XTTS included in the Distro behaves a bit differently from other TTS services…
You can still override voices manually, the automatic in-game generation won’t overwrite voices.
If you set everything up correctly, you will be easily able to talk to ANY NPC (with an ingame voice) without any prior setup required!
There is currently no way to easily delete voices in the interface. We are looking to fix that.
You can also manually upload .wav files for generation under Configuration - CHIM XTTS Management.
Here is a guide if you want to run CHIM XTTS on the cloud to save VRAM.
We also provide support for Mantella XTTS. It requires a few configuration changes but is rather simple to set up.
You may also need to enable your firewall to allow apps through to WSL2:
https://superuser.com/questions/1714002/wsl2-connect-to-host-without-disabling-the-windows-firewall
That is all you need to do! We would recommend that you do a TTS Troubleshooting test before trying ingame. If you have issues it may be your firewall or your internal windows network.
Any voices you see from videos of people who do not wish to be generated are done by users without our permission.
WE CAN NOT CONTROL THIS!
WE DO NOT CONDONE THE USE OF THE TOOLS PROVIDED TO GENERATE AI VOICES OF THOSE WHO DO NOT WISH TO BE AI GENERATED.
We will not now or in the future provide training data or voice files for anyone who does not wish to be generated within our mod files.
Zonos TTS is one of the most powerful TTS services supported by CHIM.
It has rather life like voices and emotion, but at the cost of having a 6GB VRAM requirement.
This makes it very hard to run both Skyrim and Zonos on the same machine unless you have a super computer!
There are 3 ways you can run Zonos:
Here is a guide if you want to run Zonos on the cloud to save VRAM.
Zonos works quite simply.
Whatever voices are in your voice cache, will be used to generate an AI voice every-time it makes an TTS request. You do not need to sync any voices on startup. Using the XTTS Management upload feature will place any new voices into your cache.
If playing normally most voices for NPCs should be in your cache already.
You can manually upload new voices using the CHIM XTTS Management page to place new voices in your voice cache.
There is not too much for us to say about the xVASynth implementation. It's a decent TTS service that's been around for a few years now and is simple to install. However it does lack some voices compared to MeloTTS or CHIM XTTS.
You may also need to enable your firewall to allow apps through to WSL2:
https://superuser.com/questions/1714002/wsl2-connect-to-host-without-disabling-the-windows-firewall
It’s pretty easy to set up and install:
Click SAVE
All new profiles created will now force the AI NPC’s to use xVASynth for their voices, if you have their voice model installed.
MeloTTS is one of our recommended TTS services for you to use. It is free, runs locally, and with a low hardware requirement. Currently we have all the default approved Skyrim Voices trained using it. The quality is not as good compared to CHIM XTTS, but it will allow all players to have easy access to a comprehensive TTS service. There is no current way to easily train more voices using the Distro.
It is rather easy to set up. It can be installed using the main installation script or as an optional component folder in the Distro. You can run it on CPU (required usage for AMD users), or GPU (which is faster). After that just select it as the TTS service in the default profile, and speak to any vanilla NPC. They will automatically be allocated with an appropriate voice whenever they are activated.
We do recommend that you use CHIM XTTS if you have the hardware to do so as it is much better quality.
Here is a guide to run it on the cloud (down below in this document): CHIM Manual
The reason we have MeloTTS support is for users who can not run other more powerful TTS services while supporting all Skyrim like voices.
All our current MeloTTS voices are listed at the bottom of this document.
More info on MeloTTS can be found here: https://github.com/myshell-ai/MeloTTS
Piper is another TTS service that can run efficiently on low powered machines. It has a better quality than MeloTTS, but worse than XTTS.
It also uses the same voiceid logic as MeloTTS. List can be found below.
However you will need to manually download voices for it to work with CHIM.
How to install Piper Voices:
Piper Voice Links:
Mantella (Nexus, main mod), file size 4.1GB,
https://www.nexusmods.com/skyrimspecialedition/mods/98631?tab=files&file_id=632328
Mantella - Expanded Piper Models List (Nexus, optional), file size 1.3GB
https://www.nexusmods.com/skyrimspecialedition/mods/98631?tab=files&file_id=591368
Mantella Missing Voice Files - Male and Female Child (.onnx files), (Nexus):
https://www.nexusmods.com/skyrimspecialedition/mods/139736?tab=files&file_id=586124
Mantella Piper ZoraFairChild (Nexus)
https://www.nexusmods.com/skyrimspecialedition/mods/143687?tab=files&file_id=602216
Other Voices
https://huggingface.co/rhasspy/piper-voices
https://brycebeattie.com/files/tts/