Download as pdf or txt
Download as pdf or txt
You are on page 1of 15

MFCC TRONG NHN DNG TiNG NI

Lp Nhm 13 : D07DTMT : L Dng Ngc L Vn Trng

H thng nhn dng ting ni

MFCC (Mel frequency cepstral coefficient)


MFCC l phng php trch chn c trng da trn cc h s cepstral - Tn hiu ting ni s c trch chn cc c trng sau khi thu m - Kt qu sau qu trnh ny l tp cc vecto c trng m hc - L tin cho qu trnh hun luyn h thng sau ny

S khi qu trnh phn tch MFCC

Qu trnh phn tch MFCC


 Pre-emphasis
- Tn hiu ting ni s (n) c cho qua mt b lc thng cao s2(n) = s(n) - a*s(n-1) vi h s c nh a thng chn l 0.95 - Hm truyn t: H(z)=1-a*z-1 - iu ny lm cho phng ph tn hiu, t b nh hng bi cc php bin i.

Output Pre-emphasis

Frame blocking
 Tn hiu ting ni u vo c chia nh thnh cc khung hnh t 20 ~ 30 ms  Gm cc khung c N mu  Cc khung cnh nhau cch bit M mu

Hamming windowing
 Tn hiu s c tr v 0 phn bt u v kt thc ca mi khung -> Tc l gim nh s khng lin tc ca tn hiu Ca s hamming w(n, a) = (1 - a) - a cos(2pn/(N-1))0nN-1

Ca s Hamming

Fast Fourier Transform or FFT

 Ph tn hiu sau khi nhn vi ca s Hamming s s dng php bin i Fourier nhanh -> Thu c bin ph cha cc thng tin c ch ca tn hiu ting ni

Triangular Bandpass Filters (b lc di tam gic)


 H lc ny gm 23 bng con(subbands)  Thnh phn FFT ph c nhn vi mt tam gic v c tch ly vo mt vng tn s xc nh  -> l thnh phn ph Mel  Cng thc tnh tn s Mel: mel (f) = 1.125 * ln (1 + f/700)

Discrete cosine transform (DCT)


 trch chn thnh phn c trng  Ta p dng php bin i Cosine ri rc(DCT) cho logarit ph Mel  -> Cc c trng c lp ny s to thun li cho vic m hnh ting ni v so snh i chiu mu  Cng thc thng dng tnh h s DCTi Cm=Sk=1Ncos[m*(k-0.5)*p/N]*Ek, m=1,2, ..., L

Kt lun

 Tc tnh ton cao  tin cy ln  c s dng rt hiu qu trong cc chng trnh nhn dng hin nay

Ti liu tham kho


[1] https://ccrma.stanford.edu/~unjung/mylec/mfcc.html
[2] Bi ging x l ting ni L xun Thnh [3] http://vi.wikipedia.org/wiki/Nh%E1%BA%ADn_d%E1%BA%A1ng _ti%E1%BA%BFng_n%C3%B3i

Xin cm n !

You might also like