ScholarGate
Asistents
Process / pipelineAudio Signal Processing

MFCC (Mel-Frequency Cepstral Coefficients)

Mel-Frequency Cepstral Coefficients (MFCC) ir kompakts audio iezīmju attēlojums, kas atdarina cilvēka dzirdes uztveri. Deivisa un Mermelšteina ieviesti 1980. gadā, MFCC ir de facto iezīmju ekstrakcijas metode runas atpazīšanai un vides skaņu analīzei. Tie saspiež audio signālu frekvences informāciju nelielā koeficientu kopā, kas uztver fonētisko saturu, vienlaikus atmetot neatbilstošas detaļas.

Atvērt MethodMindDrīzumāVideoDrīzumāDownload slides

Lasīt pilno metodes aprakstu

Tikai dalībniekiem

Piesakieties ar bezmaksas kontu, lai lasītu šo sadaļu.

Pieteikties

Method map

The neighbourhood of related methods — select a node to explore.

Avoti

  1. Davis, S., & Mermelstein, P. (1980). Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. IEEE Transactions on Acoustics, Speech, and Signal Processing, 28(4), 357-366. DOI: 10.1109/TASSP.1980.1163420
  2. Young, S. J., Evermann, G., Gales, M. J., et al. (1996). The HTK Book. Cambridge University Engineering Department. link
  3. Moustakides, G. V., & Rougui, J. A. (2004). Optimal filtering for polynomial signal models. IEEE Transactions on Signal Processing, 52(8), 2219-2230. link

Kā citēt šo lapu

ScholarGate. (2026, June 3). Mel-Frequency Cepstral Coefficients. ScholarGate. https://scholargate.app/lv/applied-physics/mfcc

Which method?

Set this method beside its closest kin and read them side by side — the library lays the books on the table; the choice is yours.

Compare side by side

Uz to atsaucas

ScholarGateMFCC (Mel-Frequency Cepstral Coefficients). Izgūts 2026-06-15 no https://scholargate.app/lv/applied-physics/mfcc · Datu kopa: https://doi.org/10.5281/zenodo.20539026