What kind of input values from MFCC should be used to train Neural Network for speech recognition?
조회 수: 4 (최근 30일)
이전 댓글 표시
I have set of .wav files and using MFCC i have extracted 13 MFCC coefficients for each sound file(.wav). For a particular .wav file i got an m-by-n matrix where n is the 13 MFCC coefficients and m varies for different .wav files. EG: file1.wav=>54-by-13 matrix file2.wav=>65-by-13 matrix
For training Neural network i should have a fixed m-by-n matrix size.Further i calculated correlation coefficient for each m-by-n MFCC matrix[eg: corrcoef(54-by-13 matrix)] and obtained 13-by-13 matrix for all the .wav files uniqely.Is this the right way to apply correlation cofficient for varying MFCC coefficient matrix inorder to obtain a unique m-by-n matrix for all the .wav files?
댓글 수: 0
답변 (0개)
참고 항목
카테고리
Help Center 및 File Exchange에서 Speech Recognition에 대해 자세히 알아보기
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!