简体繁体中英

How to process input data for audio classification using CNN with PyTorch?

原文 2020-01-22 07:38:31 0 1 python/ machine-learning/ classification/ pytorch/ signal-processing

As an engineer student works towards DSP and ML fields, I am working on an audio classification project with inputs being short clips (4 sec.) of instruments like bass, keyboard, guitar, etc. ( NSynth Dataset by the Magenta team at Google ).

The idea is to convert all the short clips (.wav files) to spectrograms or melspectrograms then apply a CNN to train the model.

However, my questions is since the entire dataset is large (approximately 23GB), I wonder if I should firstly convert all the audio files to images like PNG then apply CNN. I feel like this can take a lot of time, and it will double the storage space for my input data as now it is audio + image (maybe up to 70GB).

Thus, I wonder if there is any workaround here that can speed the process.

Thanks in advance.

1 answers

Preprocessing is totally worth it. You will very likely end up, running multiple experiments before your network will work as you want it to and you don't want to waste time pre-processing the features every time, you want to change a few hyper-parameters.

Rather than using PNG, I would rather save directly PyTorch tensors ( torch.save that uses Python's standard pickling protocols) or NumPy arrays ( numpy.savez saves serialized arrays into a zip file). If you are concerned with disk space, you can consider numpy.save_compressed .

How to improve cats and dogs classification using CNN with pytorch

how to handle unbalanced data for multilabel classification using CNN in Keras?

Re-using a classification CNN model for autoencoding - pytorch

Pytorch,How to feed output of CNN into input of RNN?

How to make a custom dataset for CNN using pytorch?

How to save image paths using PyTorch CNN

Cnn model using pytorch

PyTorch CNN Different Input Size

Using multiple images per data point in a binary classification CNN

Regarding image classification using CNN

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question How to improve cats and dogs classification using CNN with pytorch how to handle unbalanced data for multilabel classification using CNN in Keras? Re-using a classification CNN model for autoencoding - pytorch Pytorch,How to feed output of CNN into input of RNN? How to make a custom dataset for CNN using pytorch? How to save image paths using PyTorch CNN Cnn model using pytorch PyTorch CNN Different Input Size Using multiple images per data point in a binary classification CNN Regarding image classification using CNN

Related Tags

How to process input data for audio classification using CNN with PyTorch?

Question

1 answers

solution1 1 ACCPTED 2020-01-22 08:53:37

solution1
1 ACCPTED 2020-01-22 08:53:37