Machine learningFeature extraction
Pitch Detection Algorithm
Pitch detection (or fundamental frequency estimation) is the task of automatically determining the perceived pitch of a monophonic (single-source) audio signal at each moment in time. Formalized by de Cheveigné and Kawahara (2002) through the YIN algorithm, it is foundational to music and speech processing. Pitch detection enables vocal analysis, music transcription, instrument tuning, and speech analysis. Monophonic pitch is unambiguous; polyphonic pitch detection is fundamentally harder and a distinct problem.
Open in MethodMindSoonVideoSoon
Read the full method
Members only
Sign inSign in with a free account to read this section.
Sources
- de Cheveigné, A., & Kawahara, H. (2002). YIN, a fundamental frequency estimator for speech and music. The Journal of the Acoustical Society of America, 111(4), 1917-1930. DOI: 10.1121/1.1458024 ↗
- McLeod, P., & Wyvill, G. (2005). A smarter way to find pitch. In Proceedings of the International Computer Music Conference. link ↗
- Mauch, M., Cannam, C., Bittner, R., Fazekas, G., Salamon, J., Wade, J., & Benetos, E. (2015). Computer-aided Research on Monophonic Singing. In Frontiers in Psychology. DOI: 10.3389/fpsyg.2015.00599 ↗