Machine learningStructure analysis
Music Segmentation
Music segmentation is the task of dividing a musical recording into distinct structural sections (e.g., verse, chorus, bridge, pre-chorus, outro). Introduced by Goto (2001), it identifies major structural boundaries and labels sections according to musical form. Segmentation is essential for music understanding, audio editing, and composition analysis. It enables higher-level tasks like cover song identification and song structure-aware music generation.
Open in MethodMindSoonVideoSoon
Read the full method
Members only
Sign inSign in with a free account to read this section.
Sources
- Goto, M., & Hasegawa, Y. (2001). Automatic transcription of popular music audio. In Proceedings of the Fourth International Conference on Music Information Retrieval. link ↗
- Levy, M., & Sandler, M. (2008). Structural segmentation of musical audio by constrained clustering. IEEE Transactions on Audio, Speech, and Language Processing, 16(2), 318-326. DOI: 10.1109/TASL.2007.911636 ↗
- McVicar, M., Santos-Rodríguez, R., Ni, Y., & De Bie, T. (2014). Automatic annotation of musical key and time signature from audio using Hidden Markov Models. In Proceedings of the International Society for Music Information Retrieval Conference. link ↗