ScholarGate
助手

方法对比

并排查看您选择的方法;存在差异的行会高亮显示。

音频指纹×音高检测算法×
领域音乐信息检索音乐信息检索
方法族Machine learningMachine learning
起源年份20022002
提出者Jeroen HaitsmaAlain de Cheveigné
类型Perceptual audio hashingFundamental frequency estimation
开创性文献Haitsma, J., & Kalker, T. (2002). A highly robust audio fingerprinting system. In Proceedings of the International Symposium on Music Information Retrieval. link ↗de Cheveigné, A., & Kawahara, H. (2002). YIN, a fundamental frequency estimator for speech and music. The Journal of the Acoustical Society of America, 111(4), 1917-1930. DOI ↗
别名robust hashing, perceptual hashing, music identificationf0 detection, fundamental frequency tracking, monophonic pitch extraction
相关55
摘要Audio fingerprinting is a technique for creating a compact, robust identifier (fingerprint) for audio recordings that uniquely represents the content while being tolerant to modifications such as compression, noise, or time-shifting. Introduced by Haitsma and Kalker (2002), it underlies music identification services like Shazam and is critical for copyright enforcement, music matching, and library deduplication. A fingerprint is not a waveform hash; it captures perceptual content and remains stable across reasonable audio alterations.Pitch detection (or fundamental frequency estimation) is the task of automatically determining the perceived pitch of a monophonic (single-source) audio signal at each moment in time. Formalized by de Cheveigné and Kawahara (2002) through the YIN algorithm, it is foundational to music and speech processing. Pitch detection enables vocal analysis, music transcription, instrument tuning, and speech analysis. Monophonic pitch is unambiguous; polyphonic pitch detection is fundamentally harder and a distinct problem.
ScholarGate数据集
  1. v1
  2. 3 来源
  3. PUBLISHED
  1. v1
  2. 3 来源
  3. PUBLISHED

前往搜索 下载幻灯片

ScholarGate方法对比: Audio Fingerprinting · Pitch Detection Algorithm. 于 2026-06-18 检索自 https://scholargate.app/zh/compare