A hierarchy of processing complexity and timescales for natural sounds in the human auditory cortex

听觉皮层自然声音分类听觉场景分析刺激（心理学）等级制度频率拓扑神经科学心理学代表（政治）计算机科学语音识别人工智能认知心理学政治法学经济市场经济政治学

作者

Kyle Rupp,Jasmine L. Hect,Emily E. Harford,Lori L. Holt,Avniel Singh Ghuman,Taylor J. Abel

出处

期刊：Proceedings of the National Academy of Sciences of the United States of America [National Academy of Sciences]
日期：2025-04-28 卷期号：122 (18)

链接

nih.govdoi.org

标识

DOI：10.1073/pnas.2412243122

摘要

Efficient behavior is supported by humans’ ability to rapidly recognize acoustically distinct sounds as members of a common category. Within the auditory cortex, critical unanswered questions remain regarding the organization and dynamics of sound categorization. We performed intracerebral recordings during epilepsy surgery evaluation as 20 patient-participants listened to natural sounds. We then built encoding models to predict neural responses using sound representations extracted from different layers within a deep neural network (DNN) pretrained to categorize sounds from acoustics. This approach yielded accurate models of neural responses throughout the auditory cortex. The complexity of a cortical site’s representation (measured by the depth of the DNN layer that produced the best model) was closely related to its anatomical location, with shallow, middle, and deep layers associated with core (primary auditory cortex), lateral belt, and parabelt regions, respectively. Smoothly varying gradients of representational complexity existed within these regions, with complexity increasing along a posteromedial-to-anterolateral direction in core and lateral belt and along posterior-to-anterior and dorsal-to-ventral dimensions in parabelt. We then characterized the time (relative to sound onset) when feature representations emerged; this measure of temporal dynamics increased across the auditory hierarchy. Finally, we found separable effects of region and temporal dynamics on representational complexity: sites that took longer to begin encoding stimulus features had higher representational complexity independent of region, and downstream regions encoded more complex features independent of temporal dynamics. These findings suggest that hierarchies of timescales and complexity represent a functional organizational principle of the auditory stream underlying our ability to rapidly categorize sounds.

求助该文献

最长约 10秒，即可获得该文献文件

A hierarchy of processing complexity and timescales for natural sounds in the human auditory cortex

今日热心研友