Skip to content

Upgrade to the new Chatterbox Multilingual (v2) with 23-language support #77

@middi1408

Description

@middi1408

Hey there,

First off, thanks a lot for all the great work on this fork — the large text handling, audiobook generation, voice cloning, and the clean Web UI have made Chatterbox much more usable in real projects. Really appreciate the effort that went into this.

I just noticed that Resemble AI released a new version called Chatterbox Multilingual, which now supports 23 languages, emotion exaggeration, and comes with a new 0.5B Llama backbone.
It would be awesome if this fork could be updated to use the new model, ideally keeping all the current enhancements (FastAPI server, Docker setup, chunking, predefined voices, etc.).

Some highlights of the new version:

  • Native multilingual TTS with 23 supported languages
  • Zero-shot voice cloning across languages
  • Emotion/exaggeration control
  • Improved stability and pacing control
  • Still MIT licensed

Possible next steps:

Swap in ChatterboxMultilingualTTS.from_pretrained() for the new model

  • Add a language selection dropdown to the UI
  • Expose new parameters like exaggeration and cfg_weight
  • Check that chunking and audiobook generation still work properly

If you’re planning to upgrade, I’d be happy to help test it.

Thanks again for maintaining such a useful and polished project — it’s made a big difference for those of us running self-hosted TTS setups.

Metadata

Metadata

Labels

help wantedExtra attention is needed

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions