Upgrade to the new Chatterbox Multilingual (v2) with 23-language support

Hey there,

First off, thanks a lot for all the great work on this fork — the large text handling, audiobook generation, voice cloning, and the clean Web UI have made Chatterbox much more usable in real projects. Really appreciate the effort that went into this.

I just noticed that Resemble AI released a new version called Chatterbox Multilingual, which now supports 23 languages, emotion exaggeration, and comes with a new 0.5B Llama backbone.
It would be awesome if this fork could be updated to use the new model, ideally keeping all the current enhancements (FastAPI server, Docker setup, chunking, predefined voices, etc.).

**Some highlights of the new version:**
- Native multilingual TTS with 23 supported languages
- Zero-shot voice cloning across languages
- Emotion/exaggeration control
- Improved stability and pacing control
- Still MIT licensed

**Possible next steps:**

Swap in ChatterboxMultilingualTTS.from_pretrained() for the new model
- Add a language selection dropdown to the UI
- Expose new parameters like exaggeration and cfg_weight
- Check that chunking and audiobook generation still work properly

If you’re planning to upgrade, I’d be happy to help test it.

Thanks again for maintaining such a useful and polished project — it’s made a big difference for those of us running self-hosted TTS setups.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Upgrade to the new Chatterbox Multilingual (v2) with 23-language support #77

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Upgrade to the new Chatterbox Multilingual (v2) with 23-language support #77

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions