变化(天文学)
计算机科学
构造(python库)
管道(软件)
自然语言处理
空格(标点符号)
语言学
功能(生物学)
人工智能
语义相似性
社会学
天体物理学
进化生物学
生物
操作系统
物理
哲学
程序设计语言
作者
Naitian Zhou,David Jurgens,David Bamman
出处
期刊:Cornell University - arXiv
日期:2023-11-16
被引量:2
标识
DOI:10.48550/arxiv.2311.09130
摘要
Much work in the space of NLP has used computational methods to explore sociolinguistic variation in text. In this paper, we argue that memes, as multimodal forms of language comprised of visual templates and text, also exhibit meaningful social variation. We construct a computational pipeline to cluster individual instances of memes into templates and semantic variables, taking advantage of their multimodal structure in doing so. We apply this method to a large collection of meme images from Reddit and make available the resulting \textsc{SemanticMemes} dataset of 3.8M images clustered by their semantic function. We use these clusters to analyze linguistic variation in memes, discovering not only that socially meaningful variation in meme usage exists between subreddits, but that patterns of meme innovation and acculturation within these communities align with previous findings on written language.
科研通智能强力驱动
Strongly Powered by AbleSci AI