Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction (LLM-TSE)
-
Updated
Oct 13, 2023 - JavaScript
Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction (LLM-TSE)
Implementation of "SpEx: Multi-Scale Time Domain Speaker Extraction Network".
Make the sound you hear pure and clean by deep learning.
Knowledge Boosting
Master Project at University of Cambridge
Batch extract specific speakers from mixed audio using reference samples. Powered by TitanNet & Silero VAD with lossless export.
Add a description, image, and links to the target-speaker-extraction topic page so that developers can more easily learn about it.
To associate your repository with the target-speaker-extraction topic, visit your repo's landing page and select "manage topics."