Skip to content

feat: MMS-300M Integration & Acoustemes Documentation#2

Open
joaocarvoli wants to merge 4 commits intomainfrom
obtlab/obt-23-switch-model-xlsr-53-to-mms
Open

feat: MMS-300M Integration & Acoustemes Documentation#2
joaocarvoli wants to merge 4 commits intomainfrom
obtlab/obt-23-switch-model-xlsr-53-to-mms

Conversation

@joaocarvoli
Copy link
Member

Summary

This PR integrates the Meta MMS-300M model into the training pipeline and provides comprehensive documentation on acousteme generation and semantic mapping.

Key Changes

  • MMS-300M Integration: Updated phase1_acoustic and phase2_bpe to support model switching.
  • Documentation: Added docs/ACOUSTEMES_GENERATION.md in RFC format, detailing RLE logic and future semantic mapping workflows.
  • Bug Fix: Improved phase3_vocoder_v2.py resume logic.
  • Utility: Added src/utils/list_checkpoints.py.

Context

Addresses the need for better acoustic tokenization and lays the groundwork for the 'Acoustic Meaning Map'.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant