Machine learningStructure analysis

Music Segmentation

Music segmentation is the task of dividing a musical recording into distinct structural sections (e.g., verse, chorus, bridge, pre-chorus, outro). Introduced by Goto (2001), it identifies major structural boundaries and labels sections according to musical form. Segmentation is essential for music understanding, audio editing, and composition analysis. It enables higher-level tasks like cover song identification and song structure-aware music generation.

Open in MethodMindSoonVideoSoon

Read the full method

Members only

Sign in with a free account to read this section.

Sign in

Sources

  1. Goto, M., & Hasegawa, Y. (2001). Automatic transcription of popular music audio. In Proceedings of the Fourth International Conference on Music Information Retrieval. link
  2. Levy, M., & Sandler, M. (2008). Structural segmentation of musical audio by constrained clustering. IEEE Transactions on Audio, Speech, and Language Processing, 16(2), 318-326. DOI: 10.1109/TASL.2007.911636
  3. McVicar, M., Santos-Rodríguez, R., Ni, Y., & De Bie, T. (2014). Automatic annotation of musical key and time signature from audio using Hidden Markov Models. In Proceedings of the International Society for Music Information Retrieval Conference. link

Related methods

Referenced by

ScholarGateMusic Segmentation (Music Segmentation and Structure Detection Algorithm). Retrieved 2026-06-04 from https://scholargate.app/en/music-information-retrieval/music-segmentation