手法を比較
選択した手法を並べて確認できます。異なる行はハイライト表示されます。
| 線形予測符号化× | BarkスケールとMelスケール× | |
|---|---|---|
| 分野 | 音響学 | 音響学 |
| 系統 | Process / pipeline | Process / pipeline |
| 提唱年≠ | 1975 | 1937 |
| 提唱者≠ | Freddy Burg, John Makhoul | Eberhard Zwicker, Stanley Smith Stevens |
| 種類≠ | Predictive speech coding and analysis | Perceptual frequency mapping |
| 原典≠ | Makhoul, J. (1975). Linear prediction: A tutorial review. Proceedings of the IEEE, 63(4), 561–580. DOI ↗ | Zwicker, E. (1961). Subdivision of the audible frequency range into critical bands. Journal of the Acoustical Society of America, 33(2), 248–248. link ↗ |
| 別名 | LPC, autoregressive model, speech prediction, vocal tract modeling | bark scale, mel scale, critical bandwidth, perceptual frequency |
| 関連 | 5 | 5 |
| 概要≠ | Linear Predictive Coding (LPC) is a powerful signal processing technique for modeling and compressing speech by assuming each speech sample can be predicted from a linear combination of previous samples. Pioneered by Burg and Makhoul in the 1970s, LPC is the foundation of speech codecs, speech synthesis, speaker recognition, and speech enhancement. LPC exploits the time-correlated structure of speech to achieve high compression ratios and enable efficient parameter extraction. | Bark and Mel scales are perceptual frequency scales that map physical frequency (Hz) to perceived pitch and auditory perception. Formalized by Zwicker (Bark, 1961) and Stevens (Mel, 1937), these non-linear scales reflect how the human ear processes sound. Bark scale divides hearing into 24 critical bands; Mel scale models pitch perception. Both are essential for audio feature extraction, speech processing, and designing audio systems that align with human hearing. |
| ScholarGateデータセット ↗ |
|
|