Imbalanced Audio Dataset for Deep Learning Classification

Hi, I am trying to use audio data from interviews for binary classification through converting my dataset into spectrograms before feeding into CNN for classification. Firstly, the audio data have different duration i.e., 7 min-30 min and the dataset is imbalanced. I am aware of techniques such as SMOTE and oversampling of minority classes, but I am lost on how to oversample my minority class. Should I convert into spectrogram before oversampling and are there any ways to do it? Thanks!

답변 (1개)

Vineet Joshi
Vineet Joshi 2021년 7월 30일

0 개 추천

Hi
The following documentation talks about data augmentation for audio data. It covers examples on how to create custom pipelines and functions such as pitch shifting, time shifting, and time stretching.
Hope this helps you.
Thanks

카테고리

도움말 센터File Exchange에서 Deep Learning Toolbox에 대해 자세히 알아보기

제품

릴리스

R2021a

질문:

2021년 7월 12일

답변:

2021년 7월 30일

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by