手法を比較

選択した手法を並べて確認できます。異なる行はハイライト表示されます。

	線形予測符号化 ×	BarkスケールとMelスケール ×
分野	音響学	音響学
系統	Process / pipeline	Process / pipeline
提唱年≠	1975	1937
提唱者≠	Freddy Burg, John Makhoul	Eberhard Zwicker, Stanley Smith Stevens
種類≠	Predictive speech coding and analysis	Perceptual frequency mapping
原典≠	Makhoul, J. (1975). Linear prediction: A tutorial review. Proceedings of the IEEE, 63(4), 561–580. DOI ↗	Zwicker, E. (1961). Subdivision of the audible frequency range into critical bands. Journal of the Acoustical Society of America, 33(2), 248–248. link ↗
別名	LPC, autoregressive model, speech prediction, vocal tract modeling	bark scale, mel scale, critical bandwidth, perceptual frequency
関連	5	5
概要≠	Linear Predictive Coding (LPC) is a powerful signal processing technique for modeling and compressing speech by assuming each speech sample can be predicted from a linear combination of previous samples. Pioneered by Burg and Makhoul in the 1970s, LPC is the foundation of speech codecs, speech synthesis, speaker recognition, and speech enhancement. LPC exploits the time-correlated structure of speech to achieve high compression ratios and enable efficient parameter extraction.	Bark and Mel scales are perceptual frequency scales that map physical frequency (Hz) to perceived pitch and auditory perception. Formalized by Zwicker (Bark, 1961) and Stevens (Mel, 1937), these non-linear scales reflect how the human ear processes sound. Bark scale divides hearing into 24 critical bands; Mel scale models pitch perception. Both are essential for audio feature extraction, speech processing, and designing audio systems that align with human hearing.
ScholarGateデータセット ↗	v1 3 出典 PUBLISHED	v1 3 出典 PUBLISHED

検索へ → スライドをダウンロード