方法对比
并排查看您选择的方法;存在差异的行会高亮显示。
| Bark 和 Mel 尺度× | 线性预测编码× | |
|---|---|---|
| 领域 | 声学 | 声学 |
| 方法族 | Process / pipeline | Process / pipeline |
| 起源年份≠ | 1937 | 1975 |
| 提出者≠ | Eberhard Zwicker, Stanley Smith Stevens | Freddy Burg, John Makhoul |
| 类型≠ | Perceptual frequency mapping | Predictive speech coding and analysis |
| 开创性文献≠ | Zwicker, E. (1961). Subdivision of the audible frequency range into critical bands. Journal of the Acoustical Society of America, 33(2), 248–248. link ↗ | Makhoul, J. (1975). Linear prediction: A tutorial review. Proceedings of the IEEE, 63(4), 561–580. DOI ↗ |
| 别名 | bark scale, mel scale, critical bandwidth, perceptual frequency | LPC, autoregressive model, speech prediction, vocal tract modeling |
| 相关 | 5 | 5 |
| 摘要≠ | Bark and Mel scales are perceptual frequency scales that map physical frequency (Hz) to perceived pitch and auditory perception. Formalized by Zwicker (Bark, 1961) and Stevens (Mel, 1937), these non-linear scales reflect how the human ear processes sound. Bark scale divides hearing into 24 critical bands; Mel scale models pitch perception. Both are essential for audio feature extraction, speech processing, and designing audio systems that align with human hearing. | Linear Predictive Coding (LPC) is a powerful signal processing technique for modeling and compressing speech by assuming each speech sample can be predicted from a linear combination of previous samples. Pioneered by Burg and Makhoul in the 1970s, LPC is the foundation of speech codecs, speech synthesis, speaker recognition, and speech enhancement. LPC exploits the time-correlated structure of speech to achieve high compression ratios and enable efficient parameter extraction. |
| ScholarGate数据集 ↗ |
|
|