방법 비교
선택한 방법을 나란히 검토하세요. 서로 다른 행은 강조 표시됩니다.
| 오디오 지문 인식× | 음악 분할× | |
|---|---|---|
| 분야 | 음악 정보 검색 | 음악 정보 검색 |
| 계열 | Machine learning | Machine learning |
| 기원 연도≠ | 2002 | 2001 |
| 창시자≠ | Jeroen Haitsma | Masataka Goto |
| 유형≠ | Perceptual audio hashing | Audio structural analysis |
| 원전≠ | Haitsma, J., & Kalker, T. (2002). A highly robust audio fingerprinting system. In Proceedings of the International Symposium on Music Information Retrieval. link ↗ | Goto, M., & Hasegawa, Y. (2001). Automatic transcription of popular music audio. In Proceedings of the Fourth International Conference on Music Information Retrieval. link ↗ |
| 별칭 | robust hashing, perceptual hashing, music identification | structural segmentation, music structure analysis, section boundary detection |
| 관련 | 5 | 5 |
| 요약≠ | Audio fingerprinting is a technique for creating a compact, robust identifier (fingerprint) for audio recordings that uniquely represents the content while being tolerant to modifications such as compression, noise, or time-shifting. Introduced by Haitsma and Kalker (2002), it underlies music identification services like Shazam and is critical for copyright enforcement, music matching, and library deduplication. A fingerprint is not a waveform hash; it captures perceptual content and remains stable across reasonable audio alterations. | Music segmentation is the task of dividing a musical recording into distinct structural sections (e.g., verse, chorus, bridge, pre-chorus, outro). Introduced by Goto (2001), it identifies major structural boundaries and labels sections according to musical form. Segmentation is essential for music understanding, audio editing, and composition analysis. It enables higher-level tasks like cover song identification and song structure-aware music generation. |
| ScholarGate데이터셋 ↗ |
|
|