Speech enhancement using emotion dependent codebooks

D.H.R. Naidu, S. Srinivasan

Onderzoeksoutput: Hoofdstuk in Boek/Rapport/CongresprocedureConferentiebijdrageAcademicpeer review

1 Citaat (Scopus)


Several speech enhancement approaches utilize trained models of clean speech data, such as codebooks, Gaussian mixtures, and hidden Markov models. These models are typically trained on neutral clean speech data, without any emotion. However, in practical scenarios, emotional speech is a common occurrence, which brings into question the suitability of using models trained on neutral speech for enhancement of noisy emotional speech. We investigate this problem using the example of a codebook-based speech enhancement approach, which utilizes trained codebooks of linear prediction parameters. Anger and happiness are used as examples of emotions. Our experiments demonstrate that employing emotion-dependent speech codebooks results in a significant benefit over using emotion-independent codebooks for enhancing emotional noisy speech. We also present results using a Bayesian framework employing both emotiondependent and independent speech codebooks that exhibits a robust behavior when the type of emotion is not known a priori. Index Terms ?? Speech enhancement, codebook, emotional speech
Originele taal-2Engels
TitelProceedings of IWAENC 2012, International Workshop on Acoustic Signal Enhancement, September 4-6, 2012, Aachen, Germany
Plaats van productiePiscataway
UitgeverijInstitute of Electrical and Electronics Engineers
ISBN van geprinte versie978-3-8007-3451-1
StatusGepubliceerd - 2012


Duik in de onderzoeksthema's van 'Speech enhancement using emotion dependent codebooks'. Samen vormen ze een unieke vingerafdruk.

Citeer dit