Módszerek összehasonlítása
Tekintse át a kiválasztott módszereket egymás mellett; az eltérő sorok kiemelve jelennek meg.
| Bark- és Mel-skálák× | Cepstrum-analízis× | |
|---|---|---|
| Tudományterület | Akusztika | Akusztika |
| Módszercsalád | Process / pipeline | Process / pipeline |
| Keletkezés éve≠ | 1937 | 1963 |
| Megalkotó≠ | Eberhard Zwicker, Stanley Smith Stevens | Bogert, Healy, Tukey |
| Típus≠ | Perceptual frequency mapping | Spectral decomposition method |
| Alapmű≠ | Zwicker, E. (1961). Subdivision of the audible frequency range into critical bands. Journal of the Acoustical Society of America, 33(2), 248–248. link ↗ | Bogert, B. P., Healy, M. J., & Tukey, J. W. (1963). The quefrency alanysis of time series for echoes: cepstrum, pseudo-autocovariance, cross-cepstrum, and saphe cracking. In Time Series Analysis Research Papers (pp. 209–243). Wiley. link ↗ |
| Alternatív nevek | bark scale, mel scale, critical bandwidth, perceptual frequency | cepstrum, MFCC, mel-frequency cepstral coefficients, spectral analysis |
| Kapcsolódó | 5 | 5 |
| Összefoglaló≠ | Bark and Mel scales are perceptual frequency scales that map physical frequency (Hz) to perceived pitch and auditory perception. Formalized by Zwicker (Bark, 1961) and Stevens (Mel, 1937), these non-linear scales reflect how the human ear processes sound. Bark scale divides hearing into 24 critical bands; Mel scale models pitch perception. Both are essential for audio feature extraction, speech processing, and designing audio systems that align with human hearing. | Cepstral analysis is a spectral analysis technique that decomposes signals into independent components by inverting the log-magnitude spectrum. Pioneered by Bogert, Healy, and Tukey in 1963, cepstral analysis reveals periodic structure in spectra (pitch, echo patterns) and separates source excitation from filter response. Mel-frequency cepstral coefficients (MFCCs) derived from cepstral analysis are the most widely used features in automatic speech recognition, speaker verification, and audio analysis. |
| ScholarGateAdatkészlet ↗ |
|
|