计算机科学
模棱两可
利用
情态动词
安全性令牌
编码器
加权
社会化媒体
模式
过程(计算)
情报检索
数据挖掘
人工智能
计算机安全
万维网
放射科
程序设计语言
高分子化学
化学
社会学
操作系统
医学
社会科学
作者
Yangming Zhou,Yuzhou Yang,Qichao Ying,Zhenxing Qian,Xinpeng Zhang
标识
DOI:10.1145/3591106.3592271
摘要
The easy sharing of multimedia content on social media has caused a rapid dissemination of fake news, which threatens society's stability and security. Therefore, fake news detection has garnered extensive research interest in the field of social forensics. Current methods primarily concentrate on the integration of textual and visual features but fail to effectively exploit multi-modal information at both fine-grained and coarse-grained levels. Furthermore, they suffer from an ambiguity problem due to a lack of correlation between modalities or a contradiction between the decisions made by each modality. To overcome these challenges, we present a Multi-grained Multi-modal Fusion Network (MMFN) for fake news detection. Inspired by the multi-grained process of human assessment of news authenticity, we respectively employ two Transformer-based pre-trained models to encode token-level features from text and images. The multi-modal module fuses fine-grained features, taking into account coarse-grained features encoded by the CLIP encoder. To address the ambiguity problem, we design uni-modal branches with similarity-based weighting to adaptively adjust the use of multi-modal features. Experimental results demonstrate that the proposed framework outperforms state-of-the-art methods on three prevalent datasets.
科研通智能强力驱动
Strongly Powered by AbleSci AI