Doorgaan naar hoofdnavigatie Doorgaan naar zoeken Ga verder naar hoofdinhoud

Modeling binaural signal detection

  • D.J. Breebaart

Onderzoeksoutput: ScriptieDissertatie 1 (Onderzoek TU/e / Promotie TU/e)

561 Downloads (Pure)

Samenvatting

With the advent of multimedia technology and powerful signal processing systems, audio processing and reproduction has gained renewed interest. Examples of products that have been developed are audio coding algorithms to efficiently store and transmit music and speech, or audio reproduction systems that create virtual sound sources. Usually, these systems have to meet the high audio quality of e.g. the compact-disc standard. Engineers have become aware of the fact that signal-to-noise ratios and distortion measures do not tell the whole story when it comes to sound quality. As a consequence, new algorithms have to be evaluated by extensive listening tests. Drawbacks of this method of evaluation are that these tests are expensive and time consuming. Moreover, listening tests usually do not give any insight why a specific algorithm does or does not work. Hence there is a demand for objective and fast evaluation tools for new audio technologies. One way to meet these demands is to develop a model of the auditory system that can predict the perceived distortion and which can indicate the nature of these distortions. This thesis describes and validates a model for the binaural hearing system. In particular, it aims at predicting the audibility of changes in arbitrary binaural stimuli. Two important properties for binaural hearing are interaural intensity differences (IIDs) and interaural time differences (ITDs) present in the waveforms arriving at both ears. These interaural differences enable us to estimate the position of a sound source but also contribute to our ability to detect signals in noisy environments. Hence one of the most important objectives for a comprehensive model is its ability to describe the sensitivity for interaural differences in a large variety of conditions. The basis of the model relies on psychoacoustic experiments that were performed with human listeners. In one series of experiments, subjects had to detect the presence of interaural cues for various statistical distributions of the IIDs and ITDs. The results revealed that the energy of the difference of the signals arriving at both ears following a peripheral filtering stage can successfully describe the sensitivity for interaural time and intensity differences. This approach is very similar to Durlach’s EC theory. Furthermore, other listening experiments with varying degrees of stimulus uncertainty revealed that the detection process of the binaural auditory system may well be simulated as a template-matching procedure. The idea of template matching based on the energy of the difference signal was incorporated in a time-domain detection model. This model transforms arbitrary stimuli into an internal representation. This representation comprises four dimensions: time, frequency channel, internal interaural delay and internal interaural level adjustment. The internal model activity as a function of these dimensions entails both binaural and monaural properties of the presented stimuli. The accuracy of these properties is limited by the addition of internal noise and by the limited frequency and time resolution incorporated in various stages of the model. An important model feature is the ’optimal detector’. This optimal detector analyzes the internal representation of the presented waveforms and extracts information from it, for example the presence or absence of a signal added to a masker. This process entails a strategy that optimally reduces the internal noise by integrating information across time and frequency channels. The model was tested for its ability to predict thresholds as a function of spectral and temporal stimulus parameters. During all simulations, all model parameters were kept constant. The results revealed that the model can account for a large variety of experimental data that are described in the literature. The most prominent finding was that the model can quantitatively account for the wider effective critical bandwidth observed in band-widening NoS?? experiments. This wider effective bandwidth is found if the threshold of audibility is measured for interaurally out-of-phase signals (S??) added to band-limited interaurally in-phase noise (No). In our model, this phenomenon is the result of the fact that the cue for detection is available in a range of filters if the masker bandwidth is sufficiently small. The increased effective bandwidth does therefore not reflect a worse binaural spectral resolution compared the monaural spectral resolution but it follows from the ability to integrate information across frequency. It was also shown that the optimal detector can account for effects found by manipulating temporal stimulus properties. To be more precise, the model can account for the phenomenon that the temporal resolution of the binaural auditory system obtained from stimuli with time-varying interaural correlations seems to be worse for sinusoidally-varying cross correlation than for rectangular correlation modulations. To extend the model’s predictive scope towards more natural listening conditions, experiments were performed with virtual sound sources, which were generated by using head-related transfer functions (HRTFs). The complexity of these impulse responses was gradually decreased by a spectral smoothing operation. During listening tests, subjects had to rate the audibility of this operation. The results revealed that the fine structure of HRTF phase and magnitude spectra is relatively unimportant for the generation of virtual sound sources in the horizontal plane. The same experiment was subsequently simulated with the model. Comparisons between subject data and model predictions showed that the model could not only predict whether the HRTF smoothing was audible or not, but that it could also predict the amount of perceptual degradation for supra-threshold HRTF smoothing.
Originele taal-2Engels
KwalificatieDoctor in de Filosofie
Toekennende instantie
  • Industrial Engineering and Innovation Sciences
Begeleider(s)/adviseur
  • Kohlrausch, Armin, Promotor
  • Colburn, H.S., Promotor, Externe Persoon
  • van de Par, S.L.J.D.E., Co-Promotor
Datum van toekenning20 jun. 2001
Plaats van publicatieEindhoven
Uitgever
Gedrukte ISBN's90-386-0963-9
DOI's
StatusGepubliceerd - 2001

Vingerafdruk

Duik in de onderzoeksthema's van 'Modeling binaural signal detection'. Samen vormen ze een unieke vingerafdruk.

Citeer dit