Real-time online singing voice separation from monaural recordings using robust low-rank modeling

Pablo Sprechmann, Alex Bronstein, Guillermo Sapiro

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Separating the leading vocals from the musical accompaniment is a challenging task that appears naturally in several music processing applications. Robust principal component analysis (RPCA) has been recently employed to this problem producing very successful results. The method decomposes the signal into a low-rank component corresponding to the accompaniment with its repetitive structure, and a sparse component corresponding to the voice with its quasi-harmonic structure. In this paper we first introduce a non-negative variant of RPCA, termed as robust low-rank non-negative matrix factorization (RNMF). This new framework better suits audio applications. We then propose two efficient feed-forward architectures that approximate the RPCA and RNMF with low latency and a fraction of the complexity of the original optimization method. These approximants allow incorporating elements of unsupervised, semi- and fully-supervised learning into the RPCA and RNMF frameworks. Our basic implementation shows several orders of magnitude speedup compared to the exact solvers with no performance degradation, and allows online and faster-than-real-time processing. Evaluation on the MIR-1K dataset demonstrates state-of-the-art performance.

Original languageEnglish
Title of host publicationProceedings of the 13th International Society for Music Information Retrieval Conference, ISMIR 2012
Pages67-72
Number of pages6
StatePublished - 2012
Externally publishedYes
Event13th International Society for Music Information Retrieval Conference, ISMIR 2012 - Porto, Portugal
Duration: 8 Oct 201212 Oct 2012

Publication series

NameProceedings of the 13th International Society for Music Information Retrieval Conference, ISMIR 2012

Conference

Conference13th International Society for Music Information Retrieval Conference, ISMIR 2012
Country/TerritoryPortugal
CityPorto
Period8/10/1212/10/12

All Science Journal Classification (ASJC) codes

  • Music
  • Information Systems

Fingerprint

Dive into the research topics of 'Real-time online singing voice separation from monaural recordings using robust low-rank modeling'. Together they form a unique fingerprint.

Cite this