Speech enhancement using emotion dependent codebooks

D.H.R. Naidu, S. Srinivasan

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

1 Citation (Scopus)

Abstract

Several speech enhancement approaches utilize trained models of clean speech data, such as codebooks, Gaussian mixtures, and hidden Markov models. These models are typically trained on neutral clean speech data, without any emotion. However, in practical scenarios, emotional speech is a common occurrence, which brings into question the suitability of using models trained on neutral speech for enhancement of noisy emotional speech. We investigate this problem using the example of a codebook-based speech enhancement approach, which utilizes trained codebooks of linear prediction parameters. Anger and happiness are used as examples of emotions. Our experiments demonstrate that employing emotion-dependent speech codebooks results in a significant benefit over using emotion-independent codebooks for enhancing emotional noisy speech. We also present results using a Bayesian framework employing both emotiondependent and independent speech codebooks that exhibits a robust behavior when the type of emotion is not known a priori. Index Terms ?? Speech enhancement, codebook, emotional speech
Original languageEnglish
Title of host publicationProceedings of IWAENC 2012, International Workshop on Acoustic Signal Enhancement, September 4-6, 2012, Aachen, Germany
Place of PublicationPiscataway
PublisherInstitute of Electrical and Electronics Engineers
Pages1-4
ISBN (Print)978-3-8007-3451-1
Publication statusPublished - 2012

Fingerprint

Dive into the research topics of 'Speech enhancement using emotion dependent codebooks'. Together they form a unique fingerprint.

Cite this