手法を比較
選択した手法を並べて確認できます。異なる行はハイライト表示されます。
| オーディオフィンガープリンティング× | 音楽ジャンル分類× | |
|---|---|---|
| 分野 | 音楽情報検索 | 音楽情報検索 |
| 系統 | Machine learning | Machine learning |
| 提唱年 | 2002 | 2002 |
| 提唱者≠ | Jeroen Haitsma | George Tzanetakis |
| 種類≠ | Perceptual audio hashing | Audio feature-based classification |
| 原典≠ | Haitsma, J., & Kalker, T. (2002). A highly robust audio fingerprinting system. In Proceedings of the International Symposium on Music Information Retrieval. link ↗ | Tzanetakis, G., & Cook, P. (2002). Musical genre classification of audio signals. IEEE Transactions on Speech and Audio Processing, 10(5), 293-302. DOI ↗ |
| 別名 | robust hashing, perceptual hashing, music identification | genre recognition, music categorization, style classification |
| 関連 | 5 | 5 |
| 概要≠ | Audio fingerprinting is a technique for creating a compact, robust identifier (fingerprint) for audio recordings that uniquely represents the content while being tolerant to modifications such as compression, noise, or time-shifting. Introduced by Haitsma and Kalker (2002), it underlies music identification services like Shazam and is critical for copyright enforcement, music matching, and library deduplication. A fingerprint is not a waveform hash; it captures perceptual content and remains stable across reasonable audio alterations. | Music genre classification is the task of automatically assigning genre labels (rock, jazz, classical, pop, etc.) to audio recordings. Introduced formally by Tzanetakis and Cook (2002), it is one of the earliest and most studied music information retrieval problems. It remains critical for music discovery, recommendation systems, digital library organization, and music streaming services. Modern systems achieve high accuracy on standard datasets using deep learning. |
| ScholarGateデータセット ↗ |
|
|