RT Journal Article
SR Electronic
T1 Decoding Speech and Music Stimuli from the Frequency Following Response
JF bioRxiv
FD Cold Spring Harbor Laboratory
SP 661066
DO 10.1101/661066
A1 Steven Losorelli
A1 Blair Kaneshiro
A1 Gabriella A. Musacchia
A1 Nikolas H. Blevins
A1 Matthew B. Fitzgerald
YR 2019
UL http://biorxiv.org/content/early/2019/06/05/661066.abstract
AB The ability to differentiate complex sounds is essential for communication. Here, we propose using a machine-learning approach, called classification, to objectively evaluate auditory perception. In this study, we recorded frequency following responses (FFRs) from 13 normal-hearing adult participants to six short music and speech stimuli sharing similar fundamental frequencies but varying in overall spectral and temporal characteristics. Each participant completed a perceptual identification test using the same stimuli. We used linear discriminant analysis to classify FFRs. Results showed statistically significant FFR classification accuracies using both the full response epoch in the time domain (72.3% accuracy, p &lt; 0.001) as well as real and imaginary Fourier coefficients up to 1 kHz (74.6%, p &lt; 0.001). We classified decomposed versions of the responses in order to examine which response features contributed to successful decoding. Classifier accuracies using Fourier magnitude and phase alone in the same frequency range were lower but still significant (58.2% and 41.3% respectively, p &lt; 0.001). Classification of overlapping 20-msec subsets of the FFR in the time domain similarly produced reduced but significant accuracies (42.3%–62.8%, p &lt; 0.001). Participants’ mean perceptual responses were most accurate (90.6%, p &lt; 0.001). Confusion matrices from FFR classifications and perceptual responses were converted to distance matrices and visualized as dendrograms. FFR classifications and perceptual responses demonstrate similar patterns of confusion across the stimuli. Our results demonstrate that classification can differentiate auditory stimuli from FFR responses with high accuracy. Moreover, the reduced accuracies obtained when the FFR is decomposed in the time and frequency domains suggest that different response features contribute complementary information, similar to how the human auditory system is thought to rely on both timing and frequency information to accurately process sound. Taken together, these results suggest that FFR classification is a promising approach for objective assessment of auditory perception.