Gain-robust multi-pitch tracking using sparse nonnegative matrix factorization

Robert Peharz, Michael Wohlmayr, Franz Pernkopf

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

5 Citations (Scopus)

Abstract

While nonnegative matrix factorization (NMF) has successfully been applied for gain-robust multi-pitch detection, a method to track pitch values over time was not provided. We embed NMF-based pitch detection into a recently proposed pitch-tracking system, based on a factorial hidden Markov model (FHMM). The original system models speech spectra with Gaussian mixture models, which is sensitive to a gain mismatch between training and test data. We therefore combine the advantages of these two approaches and derive a gain-adaptive observation model for the FHMM. As training algorithm we use a modification of ℓ0-sparse NMF, which represents the short-time spectrum with scalable basis vectors. In experiments we show that the new approach significantly increases the gain-robustness of the original tracking system.

Original languageEnglish
Title of host publication2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers
Pages5416-5419
Number of pages4
ISBN (Print)9781457705397
DOIs
Publication statusPublished - 18 Aug 2011
Externally publishedYes
Event36th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011 - Prague, Czech Republic
Duration: 22 May 201127 May 2011

Conference

Conference36th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011
CountryCzech Republic
CityPrague
Period22/05/1127/05/11

Keywords

  • factorial model
  • multi-pitch
  • sparse NMF

Fingerprint Dive into the research topics of 'Gain-robust multi-pitch tracking using sparse nonnegative matrix factorization'. Together they form a unique fingerprint.

  • Cite this

    Peharz, R., Wohlmayr, M., & Pernkopf, F. (2011). Gain-robust multi-pitch tracking using sparse nonnegative matrix factorization. In 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011 - Proceedings (pp. 5416-5419). [5947583] Institute of Electrical and Electronics Engineers. https://doi.org/10.1109/ICASSP.2011.5947583