ScholarGate
助手

方法对比

并排查看您选择的方法;存在差异的行会高亮显示。

音频指纹×音乐分段×
领域音乐信息检索音乐信息检索
方法族Machine learningMachine learning
起源年份20022001
提出者Jeroen HaitsmaMasataka Goto
类型Perceptual audio hashingAudio structural analysis
开创性文献Haitsma, J., & Kalker, T. (2002). A highly robust audio fingerprinting system. In Proceedings of the International Symposium on Music Information Retrieval. link ↗Goto, M., & Hasegawa, Y. (2001). Automatic transcription of popular music audio. In Proceedings of the Fourth International Conference on Music Information Retrieval. link ↗
别名robust hashing, perceptual hashing, music identificationstructural segmentation, music structure analysis, section boundary detection
相关55
摘要Audio fingerprinting is a technique for creating a compact, robust identifier (fingerprint) for audio recordings that uniquely represents the content while being tolerant to modifications such as compression, noise, or time-shifting. Introduced by Haitsma and Kalker (2002), it underlies music identification services like Shazam and is critical for copyright enforcement, music matching, and library deduplication. A fingerprint is not a waveform hash; it captures perceptual content and remains stable across reasonable audio alterations.Music segmentation is the task of dividing a musical recording into distinct structural sections (e.g., verse, chorus, bridge, pre-chorus, outro). Introduced by Goto (2001), it identifies major structural boundaries and labels sections according to musical form. Segmentation is essential for music understanding, audio editing, and composition analysis. It enables higher-level tasks like cover song identification and song structure-aware music generation.
ScholarGate数据集
  1. v1
  2. 3 来源
  3. PUBLISHED
  1. v1
  2. 3 来源
  3. PUBLISHED

前往搜索 下载幻灯片

ScholarGate方法对比: Audio Fingerprinting · Music Segmentation. 于 2026-06-19 检索自 https://scholargate.app/zh/compare