分布语义学
计算机科学
自然语言处理
语义特征
人工智能
语义学(计算机科学)
特征(语言学)
聚类分析
代表(政治)
感知
语义属性
语义相似性
语言学
心理学
神经科学
法学
程序设计语言
哲学
政治
政治学
作者
Brian Riordan,Michael N. Jones
标识
DOI:10.1111/j.1756-8765.2010.01111.x
摘要
Abstract Since their inception, distributional models of semantics have been criticized as inadequate cognitive theories of human semantic learning and representation. A principal challenge is that the representations derived by distributional models are purely symbolic and are not grounded in perception and action; this challenge has led many to favor feature‐based models of semantic representation. We argue that the amount of perceptual and other semantic information that can be learned from purely distributional statistics has been underappreciated. We compare the representations of three feature‐based and nine distributional models using a semantic clustering task. Several distributional models demonstrated semantic clustering comparable with clustering‐based on feature‐based representations. Furthermore, when trained on child‐directed speech, the same distributional models perform as well as sensorimotor‐based feature representations of children’s lexical semantic knowledge. These results suggest that, to a large extent, information relevant for extracting semantic categories is redundantly coded in perceptual and linguistic experience. Detailed analyses of the semantic clusters of the feature‐based and distributional models also reveal that the models make use of complementary cues to semantic organization from the two data streams. Rather than conceptualizing feature‐based and distributional models as competing theories, we argue that future focus should be on understanding the cognitive mechanisms humans use to integrate the two sources.
科研通智能强力驱动
Strongly Powered by AbleSci AI