GitHub - BagelHero/DiffSinger_colab_notebook_MLo7: DiffSinger training colab notebook to make training easier hopefully

Custom Local Training GUI is moved to DiffTrainer

DiffSinger training notebook:

current supported data format:

lab + wav (NNSVS format)
csv + wav (DiffSinger format)
ds (DiffSinger .ds files)

NOTE:

your_speaker_folder's folder name will be used as spk_name so please be careful about your file naming
colab notebook primarily uses python; thus space and special character in file name or folder path may be invalid
for an in-depth guide for SVS training and/or labeling, please see SVS Singing Voice Database - Tutorial
it is advised to edit your data using SlurCutter for a more refined data for your pitch model
please visit DiffSinger Discord for any help and questions regarding model production

Zip file format examples:

[NOTE] .ds training has the same zip organization as lab + wav, but with only .ds files- no wav needed
#single speaker (lab + wav)
your_zip.zip:
    |
    |
    your_speaker_folder:
        |
        |
        data_1.wav
        data_1.lab
        .
        data_2.wav
        data_2.lab
        .
        data_3.wav
        data_3.lab
        .
        ...

#single speaker (csv + wav)
your_zip.zip:
    |
    |
    your_speaker_folder:
        |
        |
        wavs (folder named "wavs" containing all the wavs)
        .
        transcriptions.csv

#multi speaker (lab + wav)
your_zip.zip:
    |
    |
    your_speaker_folder_1:
        |
        |
        data_1.wav
        data_1.lab
        .
        data_2.wav
        data_2.lab
        .
        data_3.wav
        data_3.lab
        .
        ...
    your_speaker_folder_2:
        |
        |
        data_1.wav
        data_1.lab
        .
        data_2.wav
        data_2.lab
        .
        data_3.wav
        data_3.lab
        .
        ...

#multi speaker (csv + wav)
your_zip.zip:
    |
    |
    your_speaker_folder_1:
        |
        |
        wavs (folder named "wavs" containing all the wavs)
        .
        transcriptions.csv
    your_speaker_folder_2:
        |
        |
        wavs (folder named "wavs" containing all the wavs)
        .
        transcriptions.csv

Vocoder finetuning notebook:

current supported data format:

wav

NOTE:

it is suggested to use manual segmented audio for cleaner segments (though there's minimal difference when using the auto segmentation)
zip file format can consist of any type of files, even subfolders. data extraction will only account .wav that are within the zip into the training set

SOFA training notebook (wip):

current supported data format:

lab + wav (NNSVS format)

NOTE:

this notebook is still a rough draft, please either don't use it at all or use it with caution....

Plans (update might not be in order):

[notebook] improve SOFA notebook, add inference
[notebook] update dictionary conversion code for phoneme types in build OU
[notebook] clean up multi-dict notebook and support logic for dictionary generating for out-of-spefied-lang labels (/)
[resource] add example file(s) for multi-dicitonary training

Credits:

openvpi for DiffSinger fork and more
UtaUtaUtau for nnsvs-db-converter
Kei for the original notebook
MLo7 for the repo's content
PixPrucer for an in-depth SVS guide
haru0l for the base pretrain with embeds
AgentAsteriski for the local GUI

Name		Name	Last commit message	Last commit date
Latest commit History 266 Commits
DiffSinger_colab_notebook.ipynb		DiffSinger_colab_notebook.ipynb
DiffSinger_multidict_colab_notebook.ipynb		DiffSinger_multidict_colab_notebook.ipynb
Diffsinger_colab_notebook.ipynb		Diffsinger_colab_notebook.ipynb
NSF_hifigan_finetuning_notebook.ipynb		NSF_hifigan_finetuning_notebook.ipynb
README.md		README.md
SOFA_Notebook.ipynb		SOFA_Notebook.ipynb
g2p_for_OpenUtau_training_notebook.ipynb		g2p_for_OpenUtau_training_notebook.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Custom Local Training GUI is moved to DiffTrainer

DiffSinger training notebook:

current supported data format:

NOTE:

Vocoder finetuning notebook:

current supported data format:

NOTE:

SOFA training notebook (wip):

current supported data format:

NOTE:

Plans (update might not be in order):

About

Uh oh!

Releases

Packages

Languages

BagelHero/DiffSinger_colab_notebook_MLo7

Folders and files

Latest commit

History

Repository files navigation

Custom Local Training GUI is moved to DiffTrainer

DiffSinger training notebook:

current supported data format:

NOTE:

Vocoder finetuning notebook:

current supported data format:

NOTE:

SOFA training notebook (wip):

current supported data format:

NOTE:

Plans (update might not be in order):

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages