TY - JOUR
T1 - Multiple fundamental frequency estimation based on sparse representations in a structured dictionary
AU - Genussov, Michal
AU - Cohen, Israel
N1 - Funding Information: ✩ This research is part of the M.Sc. thesis of the first author, M. Genussov, “Transcription and classification of audio data by sparse representations and geometric methods”, Technion, Israel Institute of Technology, 2010. ✩✩ This research was supported by the Israel Science Foundation (grant no. 1130/ 11). * Corresponding author. E-mail addresses: [email protected] (M. Genussov), [email protected] (I. Cohen).
PY - 2013/1
Y1 - 2013/1
N2 - Automatic transcription of polyphonic music is an important task in audio signal processing, which involves identifying the fundamental frequencies (pitches) of several notes played at a time. Its difficulty stems from the fact that harmonics of different notes tend to overlap, especially in western music. This causes a problem in assigning the harmonics to their true fundamental frequencies, and in deducing spectra of several notes from their sum. We present here a multi-pitch estimation algorithm based on sparse representations in a structured dictionary, suitable for the spectra of music signals. In the vectors of this dictionary, most of the elements are forced to be zero except the elements that represent the fundamental frequencies and their harmonics. Thanks to the structured dictionary, the algorithm does not require a diverse or a large dataset for training and is computationally more efficient than alternative methods. The performance of the proposed structured dictionary transcription system is empirically examined, and its advantage is demonstrated compared to alternative dictionary learning methods.
AB - Automatic transcription of polyphonic music is an important task in audio signal processing, which involves identifying the fundamental frequencies (pitches) of several notes played at a time. Its difficulty stems from the fact that harmonics of different notes tend to overlap, especially in western music. This causes a problem in assigning the harmonics to their true fundamental frequencies, and in deducing spectra of several notes from their sum. We present here a multi-pitch estimation algorithm based on sparse representations in a structured dictionary, suitable for the spectra of music signals. In the vectors of this dictionary, most of the elements are forced to be zero except the elements that represent the fundamental frequencies and their harmonics. Thanks to the structured dictionary, the algorithm does not require a diverse or a large dataset for training and is computationally more efficient than alternative methods. The performance of the proposed structured dictionary transcription system is empirically examined, and its advantage is demonstrated compared to alternative dictionary learning methods.
KW - Multi-pitch estimation
KW - Music information retrieval
KW - Piano transcription
KW - Sparse representations
UR - http://www.scopus.com/inward/record.url?scp=84869500090&partnerID=8YFLogxK
U2 - https://doi.org/10.1016/j.dsp.2012.08.012
DO - https://doi.org/10.1016/j.dsp.2012.08.012
M3 - مقالة
SN - 1051-2004
VL - 23
SP - 390
EP - 400
JO - Digital Signal Processing: A Review Journal
JF - Digital Signal Processing: A Review Journal
IS - 1
ER -