Gaussian Mixture Model for speech recognition

Hi all! I'm implementing a tool for speech recognition (command based).
My training data are 21 commands (7 different commands with 3 utterances for each). I did:
  • the pre-processing phase (silence removal and end-point detection)
  • the features extraction phase (with MFCC calculation).
So, for every utterance in my training set, i have a MFCC matrix with 12 columns (12=number of MFCC) and as much rows as the number of frames i divided the signal.
For the recognition phase, i was wondering to use the gmdistribution tool.
I read this article:
% model = gmdistribution.fit(MFCCtraindata,M);
What is the MFCCtraindata parameter?
Is it the MFCC matrix associated with every utterance?
For each command i have 3 utterances, so i have 3 different MFCC matrixes.
How can i do to create a unique gmm if, for every command, i will got 3 different gmm?
Any kind of help will be appreciated.
Thank you!!

답변 (5개)

Castalia
Castalia 2013년 3월 8일

0 개 추천

Nobody could give me any advice, please?
Rania Ziedan
Rania Ziedan 2015년 10월 22일

0 개 추천

i really need help in the same issue if you handled it could you help me thanks in advance
MUZITIANXINJIE
MUZITIANXINJIE 2016년 6월 26일

0 개 추천

Yes,I want,but no one help me! I really need to use the deep learning tu classfy the voice recognition . thanks for your help.
hanieh rafiee
hanieh rafiee 2017년 2월 19일

0 개 추천

Hi Is the answer to your question receipts? Will you help me please?

카테고리

질문:

2013년 3월 8일

답변:

2017년 2월 19일

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by