-
Notifications
You must be signed in to change notification settings - Fork 257
Description
Hey there,
First off, thanks a lot for all the great work on this fork — the large text handling, audiobook generation, voice cloning, and the clean Web UI have made Chatterbox much more usable in real projects. Really appreciate the effort that went into this.
I just noticed that Resemble AI released a new version called Chatterbox Multilingual, which now supports 23 languages, emotion exaggeration, and comes with a new 0.5B Llama backbone.
It would be awesome if this fork could be updated to use the new model, ideally keeping all the current enhancements (FastAPI server, Docker setup, chunking, predefined voices, etc.).
Some highlights of the new version:
- Native multilingual TTS with 23 supported languages
- Zero-shot voice cloning across languages
- Emotion/exaggeration control
- Improved stability and pacing control
- Still MIT licensed
Possible next steps:
Swap in ChatterboxMultilingualTTS.from_pretrained() for the new model
- Add a language selection dropdown to the UI
- Expose new parameters like exaggeration and cfg_weight
- Check that chunking and audiobook generation still work properly
If you’re planning to upgrade, I’d be happy to help test it.
Thanks again for maintaining such a useful and polished project — it’s made a big difference for those of us running self-hosted TTS setups.