Wendigos Voice Cloning

The masked can now mimic the voices of your friends

By Tim_Shaw

https://github.com/TimShaw1/Wendigos-Mod

Date uploaded	a week ago
Version	2.0.0
Download link	Tim_Shaw-Wendigos_Voice_Cloning-2.0.0.zip
Downloads	2628
Dependency string	Tim_Shaw-Wendigos_Voice_Cloning-2.0.0

This mod requires the following mods to function

BepInEx-BepInExPack

BepInEx pack for Mono Unity games. Preconfigured and ready to use.

Preferred version: 5.4.2100

qwbarch-NAudioLame

Wrapper for libmp3lame.dll to add MP3 encoding support to NAudio.

Preferred version: 2.1.1

Tim_Shaw-VoiceBox

A Unified AI Services Framework

Preferred version: 0.3.2

README

Wendigos Voice Cloning (Mod for Lethal Company)

LC Tnail 2

The Masked have learned how to copy the voices of your friends. Can you tell who's real and who's fake?

This mod requires every player to have the mod to function corrcectly.

Here's a video I made showing the mod off!

Here's a SECOND video I made showing the real time stuff off!

Features

Real-time AI generated responses to players, allowing the masked to hold full conversations with players.
Automatic clip collection and transcription for intelligent responses to players (even without realtime responses enabled).
Optional automatic voice cloning (no more cloning in advance!).
Host config sync to streamline setup for non-technical friends.
Masked improvements (hidden nametags, disable mask and zombie arms)

First Time Setup (not real time) (free services)

1. Run the game

First, run the game once to generate the config file Wendigos.cfg.

2. Get your Chat API key (Gemini Example) and set up the Chat config

Chat example: to get a Gemini API key go to https://aistudio.google.com/ and click "Get API key". Then create a project and create an API key.
Then, set your api key, chat service provider, and model in Wendigos.cfg.

3. Get your STT API key (Azure Example) and set up STT

Go to https://portal.azure.com/ and create an account and a new resource group. You will most likely be prompted to create a subscription, when you're doing that it's fine to pick the FREE subscription, no need for pay-as-you-go. Make sure to pick a region that is CLOSEST to your real-life location for best results.

After that's done, go into your new resource group and press create, look for "Speech" under AI services, click create, punch in all the required info (name, resource group, etc), and again put the region that's closest to you. Be sure to select the Free F0 tier.

After that's done go into your new Speech Service and select Keys and Endpoint. Put your KEY 1 into wendigos.cfg in STT API KEY and put your Location/Region in Region (if using Azure). Also set your language with the correct language code (see Wendigos.cfg for more info).

Real time responses (EXPERIMENTAL)

Allows the masked to reply to things players say in real time. This feature does NOT respect push-to-talk.

What you need

An Elevenlabs subscription ($5 tier or better)
A free Azure Speech to Text service OR you can use Elevenlabs for Speech to Text if you want to set up less
A chat api key set (and credits purchased, if needed)

Elevenlabs

Create an account and subscribe to the $5 tier or better. Click on your profile and click "API Keys". Create a key and save it somewhere.

Optionally, you can clone your voice (and anyone else's who you are sharing the account with) in advance, then under Voices -> Personal click your voice, then click "ID" to copy the voice ID. Save that ID too.
- Alternatively, you can choose not to do this and the mod will automatically clone everyone's voices. Note that autoclone tends to produce less convincing voice clones.

Final Realtime Setup Checklist

Ensure you set all api keys (Chat, STT, TTS), service providers for Chat and STT, and model name for Chat if you haven't already (and region under STT if using Azure).
Have everyone set their Elevenlabs voice ID if you cloned in advance
Set Experimental -> Realtime Responses to true in Wendigos.cfg.
Have everyone set their name in the Experimental -> Your name setting if you'd like the AI to know who is who.

Bonus features

Optional configs

General -> Talk Probability: how likely the Masked are to play recorded voice lines.
General -> Max voice clips: the maximum number of voice recordings to store at a time.
Experimental -> Config sync prompt: (host only) whether to show a config sync prompt when a new player joins the lobby. This will sync all relevant config options (Service selections, models, realtime enabled, API KEYS!!!, etc.
TTS -> Masked Voice Volume: adjusts how loud the masked are.
TTS -> Voice Description: provides a description of your voice to Elevenlabs to improve autoclone quality.
Chat -> Prompt: the prompt provided to the chat service. Useful for improving quality of realtime responses.

Masked improvements

This mod removes the masked masks and zombie arms to better fool players. Player clothing is also mimicked.

Possible issues

STT crashes due to server issue
Masked audio sounds choppy due to high network volatility

FAQ

What languages does the realtime mode support?

Elevenlabs Flash 2.5 (The TTS model this mod uses) supports 32 languages. Find out more here: https://elevenlabs.io/languages

Can I use this with Mirage?

I wouldn't recommend it as this mod uses audio clip collection and playback.

When is the mod listening to me?

The mod listens only during rounds. This enables realtime responses and transcribed clip collection.

Can I share my Elevenlabs account?

Yes! You can have everyone on the same account. Everyone just has to set their own voice ID in their config.

How did you add so many AI services to the mod?

I used my AI services framework VoiceBox.

TODO

[ ] Allow any enemy to clone voices

Credits

https://github.com/coqui-ai/tts
RugbugRedfern's Skinwalkers mod
@Kalthun and @notgarrett for helping me test this mod
The Lethal Company Modding Discord

CHANGELOG

v2.0.0 - The VoiceBox Rewrite

Rewrite AI services to use VoiceBox
- Add support for Gemini, Claude, and Ollama chat services
- Add support for Elevenlabs STT service
- Significantly improve real-time response latency using TTS audio streaming
Rewrite audio networking approach to improve reliability and eliminate lag on game join
Rewrite local mod functionality
- Remove local voice cloning model and reliance on main.exe
- Implement clip collection with speech transcription and smart chatbot clip selection
Implement optional config sync feature to streamline setup for clients and sharing service API keys
Add autoclone functionality to allow automatic cloning of player voices during rounds
Rework dependencies to enable cross-mod compatibility

v1.0.10

Update Netcode patch to fix v73 error

v1.0.9

Fixed a bug where masked could become invincible

v1.0.8

Add support for realtime responses in multiple languages courtesy of joecabezas on github

v1.0.5 - v1.0.7

Clamp audio so it isnt too loud
gpt-4o-mini support
Update tutorial for realtime responses

v1.0.4

Compatibility patch for LethalLevelLoader

v1.0.3

Increase volume of masked at the cost of latency

v1.0.1 and v1.0.2

Quick patches to fix crash

v1.0.0

Add realtime responses with ChatGPT and Azure
Allow players with mod disabled to still hear masked talk
Allow players to modify chatgpt prompt
Enable speed optimization for Elevenlabs
Add damaged line type
and a bunch more stuff

v0.1.9

Update README to show off the video I made about the mod!

v0.1.8

Prevent menu popup when using elevenlabs

v0.1.7

Add support for all languages XTTSv2 supports
Add support for special characters like arabic

v0.1.6 and prior

First releases

Get the Thunderstore App