Dataset

The train and test sets of the dataset are saved in UrbanSound8K_train.pkl, and UrbanSound8K_test.pkl. The dataset is structured as a list of dictionaries. Each dict in the list corresponds to a different audio segment from an audio file. The dicts contain the following keys:

• filename: contains a unique name of the audio file. This is useful for matching audio segments to the audio file that they are coming from, and compute global scores by averaging the segments scores that have the same filename • class: class name • classID: class number [0…9] • features: all the features to be used for training. This is a dictionary which contains: • logmelspec • mfcc • chroma • spectral_contrast • Tonnetz

Dataloader

In dataset.py, the body of a PyTorch dataloader can be found to load UrbanSound8K dataset. You first have to edit this file to load the different inputs (LMC, MC, and MLMC features) for training your convolutional networks. The code already loads the labels, and the unique identifiers of the files that the audio segments belong to. You have to modify the commented lines. Then to use it, include the following lines in your code:

from dataset import UrbanSound8KDataset

train_loader = torch.utils.data.DataLoader( UrbanSound8KDataset(‘UrbanSound8K_train.pkl’, mode), batch_size=32, shuffle=True, num_workers=8, pin_memory=True)

val_loader = torch.utils.data.DataLoader( UrbanSound8KDataset(‘UrbanSound8K_test.pkl’, mode), batch_size=32, shuffle=False, num_workers=8, pin_memory=True)

for i, (input, target, filename) in enumerate(train_loader): #training code

for i, (input, target, filename) in enumerate(val_loader): #validation code

In the code above, input is a batch of 32 log-mel spectrograms, target are their corresponding labels, and filename are the names of the audio files each segment belongs to (useful for testing). The variable mode should take one of the values: ‘LMC’, ‘MC’, ‘MLMC’.

Name		Name	Last commit message	Last commit date
Latest commit History 56 Commits
.gitignore		.gitignore
COMSM0018_Project_2019.pdf		COMSM0018_Project_2019.pdf
DLModel.ai		DLModel.ai
DLModel.png		DLModel.png
LMC1.png		LMC1.png
LMC2.png		LMC2.png
MC1.png		MC1.png
MC2.png		MC2.png
README.docx		README.docx
Readme.md		Readme.md
cnn_model.py		cnn_model.py
cnn_model_cutout.py		cnn_model_cutout.py
dataset.py		dataset.py
dataset_cutout.py		dataset_cutout.py
group_cnn.py		group_cnn.py
group_cnn_cutout.py		group_cnn_cutout.py
output.py		output.py
paper-summary.docx		paper-summary.docx
sensors-19-01733.pdf		sensors-19-01733.pdf
train_LMC.sh		train_LMC.sh
train_MC.sh		train_MC.sh
validate.py		validate.py
validate_cutout.py		validate_cutout.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Dataset

Dataloader

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

RomanBromidge/DeepLearningGroupProject

Folders and files

Latest commit

History

Repository files navigation

Dataset

Dataloader

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages