A Front-Back Confusion Metric in Horizontal Sound Localization: The FBC Score

Tim Fischer; Marco Caversaccio; Wilhelm Wimmer

doi:10.1101/2020.02.12.945303

Abstract

In sound localization experiments, currently used metrics for front-back confusion (FBC) analysis weight the occurring FBCs equally, regardless of their deviation from the cone of confusion. To overcome this limitation, we introduce the FBC Score. A sound localization experiment in the horizontal plane with 12 bilaterally implanted cochlear implants (CI) users and 12 normal hearing subjects was performed to validate the method with real clinical data. The overall FBC Rate of the CI users was twice as high as the FBC Score. For the control group, the FBC Rate was 4 times higher than the FBC Score. The results indicate that the FBC Rate is inflated by FBCs that show a considerable deviation from the corresponding value on the cone of confusion.

I. Introduction

SOUND source localization primarily relies on the evaluation of interaural time and level differences. Front-back confusions (FBCs) occur because sound sources on the cone of confusion are equidistant from the left ear and the right ear and thus provide identical interaural time and level differences for a listener [1]. An illustration for a sound source on the cone of confusion is given in Figure 1.

Fig. 1.

The cone of confusion is defined as an imaginary cone extending outward from the center of the head [2]. The axis of the cone is defined by the interaural axis. Due to the symmetry of this cone around the interaural axis, sound source (A) and (B) produce identical interaural time and level differences as they are equidistant from the left ear and the right ear. Sound sources on the cone of confusion are thus useless for binaural sound localization.

FBC errors are not simply outliers with extreme error amplitudes. The underlying cause is different from normal localization errors (see Figure 1) which is why FBCs should be analyzed seperately [3]–[7].

There is a continuing debate in the literature on the definition of an FBC error. In horizontal sound localization experiments, the most common definition classifies responses crossing the interaural axis as FBCs [3]–[6], [8], [9]. This definition, which defines the FBC Rate, is sufficient for experiments that only require a coarse angular resolution of the test setup or feedback method [7], [10], [11]. However, such setups limit the measurement resolution for sound localization accuracy.

A refined FBC definition requires FBC responses to fall within a specific range of a response-dependent regression line, which is mirrored on the interaural axis [11], [12].

Further definitions as to what constitutes a FBC error are defined by fixed threshold values. These thresholds refer either to a minimum required deviation of the given response in relation to the interaural axis or to a maximum allowed deviation between the response and the stimulus mirrored on the interaural axis [13]. There is no general consensus regarding the magnitude of the thresholds.

Apart from thresholds that are not standardized, the current limitation of FBC analysis approaches is that all FBCs are considered equally strong. The deviation from the measurement position to which the error refers, which is the deviation of the response from the stimulus position mirrored on the interaural axis, is not considered. In addition, special care should be taken with stimuli presenting close to the interaural axis, as localization errors overlap with FBC errors in this area.

Herein, we propose a metric that allows to quantify the impact of FBCs on sound localization outcomes in the horizontal plane more precise than with the commonly used FBC Rate. The FBC Score can be applied regardless of the number of available localization results, the localization performance of the subjects and the measurement setup.

II. Materials and Methods

A. Study design and participants

This study was designed in accordance with the Declaration of Helsinki and was approved by the local institutional review board (KEK-BE, No. 2018-00901). Twelve bilaterally implanted cochlear implants (CIs) users participated in the study, all using Sonnet (Med-El GmbH, Innsbruck, Austria) processors with an omnidirectional microphone setting. The CI users had a monosyllabic word recognition score in quiet of 70% or better at 60/65 dB_SPL. For comparison, a control group of 12 normal hearing (NH) adults was included.

B. Static Sound Source Localization Test

Static sound source localization was performed with 12 equally spaced loudspeakers arranged in a circle around the subject. The test stimulus consisted of pink noise with 200 ms length. To prevent the use of monaural level cues, level roving between 60 to 70 dB_SPL was applied. In total, 36 stimuli per subject were played, 3 stimuli per loudspeaker. The order of the stimuli with respect to the loudspeaker was randomized [10]. The loudspeakers were hidden behind a sound transparent curtain, no prior knowledge about possible stimuli directions was provided to the subjects. The subjects’ feedback on the perceived location of the stimulus was recorded via a graphical user interface showing a dial with a resolution scale of 1-degree angle and a login button.

C. Calculation of the FBC Score

We define the outcome of a sound localization experiment as a set of stimuli and corresponding subject responses , where denotes the total number of test items. In relation to the FBC Rate (see Eq. 1), only responses that do cross the interaural axis are considered for the FBC analysis. After exclusion, we obtain a reduced set of stimuli S = {s₁, s₂, …, s_N} and responses R = {r₁, r₂, …, r_N} with N items.

The ideal position of an FBC, i.e., the position of a stimulus s_i mirrored on the interaural axis, is defined as the FBC center c_i. We compute the deviation θ_i between the response r_i and the FBC center c_i as the shortest absolute angular difference (minor arc). An example for one stimulus-response pair is given in Figure 2. The maximum allowable deviation θ_max,i is defined by the interaural axis and the FBC center c_i (see Figure 3). The farther away a response r_i is from its corresponding FBC center and the closer to the interaural axis, the less likely it is to be an FBC. Therefore, we introduce a weighting factor w_i for each measured FBC: where θ_max,i is the maximum deviation in the direction of the response (clockwise or counterclockwise) under consideration of the interaural axis. The weighting w_i ranges from 0 (response on the interaural axis) to 1 (a perfect FBC). An illustration of the procedure is given in Figure 3.

Fig. 2.

The FBC center c₁ is the stimulus origin s₁ mirrored on the interaural axis. The interaural axis corresponds to the line from 270° to 90°. Valid FBCs can occur within the light shaded stripe. The deviation of the response r₁ from c₁ is denoted with θ₁. In this example, the response r₁ lies in counter clockwise direction with respect to c₁.

Fig. 3.

Weighting factor calculation for the example shown in Figure 2. The y-axis shows the front-back confusion weighting factor w and the x-axis shows the rear azimuth. The deviation of the response r₁ from c₁ is denoted by θ₁. The response r₁ lies in counter clockwise (ccw) direction with respect to s₁, therefore θ_max,1 is equal to θ_max,1(ccw). The areas θ_max,1(ccw) and θ_max,1(cw) are limited by the interaural axis as defined in Figure 2.

To provide a subject-level FBC metric that considers deviations of the responses from the cone of confusion and their proximity to the interaural axis, we propose the FBC Score as defined in Equation 3.

An FBC Score of 100% indicates that all responses were FBCs and exactly match the stimulus positions mirrored on the interaural axis. In contrast, an FBC Score of 0% would indicate that the responses did not contain any FBCs during the trial. For a calculation example with numeric data, a step-by-step guide for the calculation of the FBC Score with corresponding explanations is provided in the Appendix A. To facilitate the calculations, we implemented the procedure as a MATLAB function¹. The script takes the sets of stimuli Ŝ and responses as input parameters. In addition to the calculations described above, a separate analysis of front-back and back-front confusions is performed.

III. Results

For the CI users, the overall FBC Score is 27% in contrast to the almost twice as high FBC Rate of 47% (p< .001, two-sided Wilcoxon signed rank test). For the NH control group, the FBC Rate is 4 times as high as the FBC Score (FBC Score: 1%, FBC Rate: 4%; p= 0.062). Table I shows the results of the FBC Score analysis compared to the FBC Rate for each CI user and NH subject.

View this table:

TABLE I.

The table shows the results on subject-level for the front-back confusion (FBC) score analysis compared to the FBC Rate for each cochlear implant (CI) user and normal hearing (NH) subject.

IV. Conclusion and Discussion

For the evaluation of sound localization experiments, the errors are quantified in terms of their deviation from the stimulus position. However, for FBC analysis, thus far, only a rough distinction between “no FBC” and “FBC” is made, and the occurrences are counted. In this report, we propose a method for quantifying the severity of an FBC, allowing a more precise analysis of this phenomenon.

In the example presented here, the FBC Rate would indicate that the tested subjects are prone to FBCs (47%). However, the FBC Score shows that this assumption does not necessarily hold true since the impact of the FBCs is significantly smaller by a factor of 1.7 (27%). Therefore, the FBC Rate in our example includes many FBCs with a considerable deviation from the corresponding stimulus position on the cone of confusion, to which this phenomenon actually refers.

An alternative option to illustrate this influence on the phenomenon of FBCs would be to indicate, in addition to the FBC Rate, the localization error with respect to the stimulus’ position mirrored on the interaural axis. In our opinion, however, this approach may overestimate the importance of FBCs caused by stimuli near the interaural axis for two reasons: First, the influence of FBCs near the interaural axis is rather small in real life situations. Second, both error types, i.e., the FBC and localization errors, overlap for stimuli originating from close to the interaural axis, which can lead to distortions in the FBC analysis. By weighting the distance between the stimulus and the interaural axis and by considering the proximity of the stimulus to the interaural axis, this influence on the assessment of the FBC phenomenon can be mitigated. The effect of this weighting is particularly demonstrated by the 4 times lower FBC Score compared to the FBC Rate in the NH control group. Here, all FBC-causing stimuli originated from the measurement position with the smallest distance to the interaural axis.

The studies in [11], [12] defined responses as FBCs if the maximum deviation from the FBC center was less than or equal to a 40 or 45-degree angle. Such a static threshold value is not feasible in clinical studies with hearing-impaired subjects, due to the large differences in localization accuracy between the NH control and the test group [14]. An explanation of the underlying theoretical and psychoacoustic mechanism would be missing for the justification of such a threshold value. Besides the static threshold value, the iterative regression method applied in [11], [12] relies on the distribution and number of sound localization responses. The FBC Score does neither depend on a predefined threshold value nor on the quality or quantity of the localization results. This may be especially important when evaluating the performance of hearing-impaired patients in clinical trials. These localization results often show a wide individual variability [14] and consist of limited samples due to the limited testing time and subject attention.

Acknowledgment

The authors would like to thank Christoph Schmid from the Department of ENT, Head and Neck Surgery, Bern University Hospital (Inselspital), for the recruitment of the CI users of this study.

Appendix A

Example calculation with numeric data

In the following, we illustrate the calculation of the FBC Score using a small example data set with 5 stimuli Ŝ and responses from a sound localization test in the horizontal plane (see Figure 4). Table II summarizes the results for the computational steps involved.

View this table:

TABLE II.

Data of the calculation example.

Fig. 4.

Visualization of the data for the example calculation of the front-back confusion (FBC) Score. Stimuli are indicated with Ŝ and responses with . In this example, all stimuli lie inside the frontal azimuth and thus responses inside rear azimuth (light shaded area) are considered FBCs.

Step 1 - Calculate the FBC Rate:

First, the FBC Rate is calculated. It is defined as the rate of the number of responses crossing the interaural axis with respect to the number of presented stimuli (see Eq. 1). Only does not cross the interaural axis. Excluding ŝ₅ and results in N = 4 pairs of stimuli s_i and responses r_i and an FBC Rate of 80%.

Step 2 - Calculation of c_i and θ_i:

The FBC center c_i is obtained by mirroring the stimulus position s_i on the interaural axis. The deviation θ_i is calculated as the absolute difference between the response r_i and c_i measured over the minor arc. For example, for stimulus 3, we have θ₃ = |r₃ − c₃| = |95° − 115°| = 20°.

Step 3 - Calculation of θ_max,i:

The maximum deviation θ_max,i between r_iand c_i is limited by the interaural axis. If r_i lies in clockwise direction of c_i, the clockwise limit is applied (θ_max,i= θ_max,i(cw)). The same applies for the counter clockwise direction.

For example, with stimulus s₃, the minor arc for the response r₃ lies in counter clockwise direction to the FBC center c₃, therefore the maximum deviation is θ_max,3 = θ_max,3(ccw) = 25°.

Step 4 - Calculation of w_i:

With θ_i and θ_max,i we compute the weighting factor w_i for each stimulus using equation (2).

Step 5 - FBC Score:

We now calculate the FBC Score as defined in Equation (3):

To facilitate the calculations, we implemented the procedure as a MATLAB function¹.

Footnotes

References

[1].↵
L. Rayleigh, “Xii. on our perception of sound direction,” The London, Edinburgh, and Dublin Philosophical Magazine and Journal of Science, vol. 13, no. 74, pp. 214–232, 1907.
OpenUrl
[2].↵
H. Wallach, “On Sound Localization,” J Acoust Soc Am, vol. 10, no. 1, pp. 83–83, 7 1938.
OpenUrl
[3].↵
S. Carlile, P. Leong, and S. Hyams, “The nature and distribution of errors in sound localization by human listeners,” Hear Res, vol. 114, no. 1-2, pp. 179–196, 12 1997.
OpenUrl CrossRef PubMed Web of Science
[4].
F. L. Wightman and D. J. Kistler, “Resolution of front–back ambiguity in spatial hearing by listener and source movement,” J Acoust Soc Am, vol. 105, no. 5, pp. 2841–2853, 5 1999.
OpenUrl CrossRef PubMed Web of Science
[5].
P. A. Hill, P. A. Nelson, O. Kirkeby, and H. Hamada, “Resolution of front–back confusion in virtual acoustic imaging systems,” J Acoust Soc Am, vol. 108, no. 6, pp. 2901–2910, 12 2000.
OpenUrl PubMed
[6].↵
E. H. A. Langendijk, D. J. Kistler, and F. L. Wightman, “Sound localization in the presence of one or two distracters,” J Acoust Soc Am, vol. 109, no. 5, pp. 2123–2134, 5 2001.
OpenUrl CrossRef PubMed Web of Science
[7].↵
M. T. Pastore, S. J. Natale, W. A. Yost, and M. Dorman, “Head Movements Allow Listeners Bilaterally Implanted With Cochlear Implants to Resolve Front-Back Confusions,” Ear Hear., vol. 39, no. 6, pp. 1224–1231, 2018.
OpenUrl CrossRef
[8].↵
P. Zahorik, P. Bangayan, V. Sundareswaran, K. Wang, and C. Tam, “Perceptual recalibration in human sound localization: Learning to remediate front-back reversals,” J Acoust Soc Am, vol. 120, no. 1, pp. 343–359, 7 2006.
OpenUrl CrossRef PubMed Web of Science
[9].↵
Y. Cai, G. Chen, X. Zhong, G. Yu, H. Mo, J. Jiang, X. Chen, F. Zhao, and Y. Zheng, “Influence of Audiovisual Training on Horizontal Sound Localization and Its Related ERP Response,” Front Hum Neurosci, vol. 12, p. 423, 10 2018.
OpenUrl
[10].↵
W. Wimmer, M. Kompis, C. Stieger, M. Caversaccio, and S. Weder, “Directional microphone contralateral routing of signals in cochlear implant users: A within-subjects comparison,” Ear Hear., vol. 38, no. 3, pp. 368–373, 2017.
OpenUrl CrossRef
[11].↵
C. Montagne and Y. Zhou, “Audiovisual Interactions in Front and Rear Space,” Front Psychol, vol. 9, 5 2018.
[12].↵
E. A. Macpherson and J. C. Middlebrooks, “Localization of brief sounds: Effects of level and background noise,” J Acoust Soc Am, vol. 108, no. 4, pp. 1834–1849, 10 2000.
OpenUrl CrossRef PubMed Web of Science
[13].↵
1. P. Strumillo
T. Letowski and S. Letowski, “Localization error: Accuracy and precision of auditory localization,” in Advances in Sound Localization, P. Strumillo, Ed. Rijeka, Croatia: IntechOpen, 2011, ch. 4. [Online]. Available: https://doi.org/10.5772/15652
[14].↵
M. A. Akeroyd, “An overview of the major phenomena of the localization of sound sources by normal-hearing, hearing-impaired, and aided listeners.” Trends Hear., vol. 18, 12 2014.