Multiple fundamental frequency estimation based on sparse representations in a structured dictionary

Michal Genussov, Israel Cohen

Research output: Contribution to journalArticlepeer-review

Abstract

Automatic transcription of polyphonic music is an important task in audio signal processing, which involves identifying the fundamental frequencies (pitches) of several notes played at a time. Its difficulty stems from the fact that harmonics of different notes tend to overlap, especially in western music. This causes a problem in assigning the harmonics to their true fundamental frequencies, and in deducing spectra of several notes from their sum. We present here a multi-pitch estimation algorithm based on sparse representations in a structured dictionary, suitable for the spectra of music signals. In the vectors of this dictionary, most of the elements are forced to be zero except the elements that represent the fundamental frequencies and their harmonics. Thanks to the structured dictionary, the algorithm does not require a diverse or a large dataset for training and is computationally more efficient than alternative methods. The performance of the proposed structured dictionary transcription system is empirically examined, and its advantage is demonstrated compared to alternative dictionary learning methods.

Original languageEnglish
Pages (from-to)390-400
Number of pages11
JournalDigital Signal Processing: A Review Journal
Volume23
Issue number1
DOIs
StatePublished - Jan 2013

Keywords

  • Multi-pitch estimation
  • Music information retrieval
  • Piano transcription
  • Sparse representations

All Science Journal Classification (ASJC) codes

  • Signal Processing
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'Multiple fundamental frequency estimation based on sparse representations in a structured dictionary'. Together they form a unique fingerprint.

Cite this