Samenvatting
With the advent of multimedia technology and powerful signal processing
systems, audio processing and reproduction has gained renewed interest.
Examples of products that have been developed are audio coding algorithms
to efficiently store and transmit music and speech, or audio reproduction
systems that create virtual sound sources. Usually, these systems have to
meet the high audio quality of e.g. the compact-disc standard. Engineers have
become aware of the fact that signal-to-noise ratios and distortion measures
do not tell the whole story when it comes to sound quality. As a consequence,
new algorithms have to be evaluated by extensive listening tests. Drawbacks
of this method of evaluation are that these tests are expensive and time
consuming. Moreover, listening tests usually do not give any insight why a
specific algorithm does or does not work. Hence there is a demand for objective
and fast evaluation tools for new audio technologies. One way to meet
these demands is to develop a model of the auditory system that can predict
the perceived distortion and which can indicate the nature of these distortions.
This thesis describes and validates a model for the binaural hearing system.
In particular, it aims at predicting the audibility of changes in arbitrary
binaural stimuli. Two important properties for binaural hearing are interaural
intensity differences (IIDs) and interaural time differences (ITDs) present in
the waveforms arriving at both ears. These interaural differences enable us
to estimate the position of a sound source but also contribute to our ability
to detect signals in noisy environments. Hence one of the most important
objectives for a comprehensive model is its ability to describe the sensitivity
for interaural differences in a large variety of conditions.
The basis of the model relies on psychoacoustic experiments that were
performed with human listeners. In one series of experiments, subjects had
to detect the presence of interaural cues for various statistical distributions of
the IIDs and ITDs. The results revealed that the energy of the difference of the
signals arriving at both ears following a peripheral filtering stage can successfully
describe the sensitivity for interaural time and intensity differences. This
approach is very similar to Durlach’s EC theory. Furthermore, other listening
experiments with varying degrees of stimulus uncertainty revealed that the
detection process of the binaural auditory system may well be simulated as a
template-matching procedure.
The idea of template matching based on the energy of the difference signal
was incorporated in a time-domain detection model. This model transforms
arbitrary stimuli into an internal representation. This representation comprises
four dimensions: time, frequency channel, internal interaural delay
and internal interaural level adjustment. The internal model activity as a
function of these dimensions entails both binaural and monaural properties
of the presented stimuli. The accuracy of these properties is limited by the
addition of internal noise and by the limited frequency and time resolution
incorporated in various stages of the model. An important model feature is
the ’optimal detector’. This optimal detector analyzes the internal representation
of the presented waveforms and extracts information from it, for example
the presence or absence of a signal added to a masker. This process entails a
strategy that optimally reduces the internal noise by integrating information
across time and frequency channels.
The model was tested for its ability to predict thresholds as a function of
spectral and temporal stimulus parameters. During all simulations, all model
parameters were kept constant. The results revealed that the model can
account for a large variety of experimental data that are described in the
literature. The most prominent finding was that the model can quantitatively
account for the wider effective critical bandwidth observed in band-widening
NoS?? experiments. This wider effective bandwidth is found if the threshold
of audibility is measured for interaurally out-of-phase signals (S??) added
to band-limited interaurally in-phase noise (No). In our model, this phenomenon
is the result of the fact that the cue for detection is available in a
range of filters if the masker bandwidth is sufficiently small. The increased
effective bandwidth does therefore not reflect a worse binaural spectral
resolution compared the monaural spectral resolution but it follows from
the ability to integrate information across frequency. It was also shown
that the optimal detector can account for effects found by manipulating
temporal stimulus properties. To be more precise, the model can account
for the phenomenon that the temporal resolution of the binaural auditory
system obtained from stimuli with time-varying interaural correlations seems
to be worse for sinusoidally-varying cross correlation than for rectangular
correlation modulations.
To extend the model’s predictive scope towards more natural listening conditions,
experiments were performed with virtual sound sources, which were
generated by using head-related transfer functions (HRTFs). The complexity
of these impulse responses was gradually decreased by a spectral smoothing
operation. During listening tests, subjects had to rate the audibility of this
operation. The results revealed that the fine structure of HRTF phase and
magnitude spectra is relatively unimportant for the generation of virtual
sound sources in the horizontal plane. The same experiment was subsequently
simulated with the model. Comparisons between subject data and
model predictions showed that the model could not only predict whether the
HRTF smoothing was audible or not, but that it could also predict the amount
of perceptual degradation for supra-threshold HRTF smoothing.
| Originele taal-2 | Engels |
|---|---|
| Kwalificatie | Doctor in de Filosofie |
| Toekennende instantie |
|
| Begeleider(s)/adviseur |
|
| Datum van toekenning | 20 jun. 2001 |
| Plaats van publicatie | Eindhoven |
| Uitgever | |
| Gedrukte ISBN's | 90-386-0963-9 |
| DOI's | |
| Status | Gepubliceerd - 2001 |
Vingerafdruk
Duik in de onderzoeksthema's van 'Modeling binaural signal detection'. Samen vormen ze een unieke vingerafdruk.Citeer dit
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver