A spectrum-based speech enhancement system estimates and tracks the noise spectrum of a mixed speech and noise signal. The system frames and windows a digitized signal and applies the frames to a fast Fourier transform processor to generate discrete Fourier transformed (DFT) signals representing the speech plus noise signal. The system calculates the power spectrum of each frame. The speech enhancement system employs a leaky integrator that is responsive to identified noise-only components of the signal. The leaky integrator has an adaptive time-constant which compensates for non-stationary environmental noise. In addition, the speech enhancement system identified noise-only intervals by using a technique that monitors the Teager energy of the signal. The transition between noise-only signals and speech plus noise signals is softened by being made non-binary. Once the noise spectrum has been estimated, it is used to generate gain factors that multiply the DFT signals to produce noise-reduced DFT signals. The gain factors are generated based on an audible noise threshold. The method generates audible a priori and a posteriori signal to noise ratio signals and then calculates audible gain signals from these values.
|IPC||G10L 15/ 20 A N|
|Publication status||Published - 22 Jun 2000|
- speech, signal, components, frame, noise