计算机科学
水准点(测量)
人工智能
一般化
构造(python库)
机器学习
可转让性
资源(消歧)
深度学习
自然语言处理
判别式
标杆管理
特征(语言学)
卷积神经网络
实证研究
主流
领域(数学分析)
大数据
基线(sea)
透视图(图形)
作者
Xiaorong He,Wenhao You
出处
期刊:International Journal of Intelligent Computing and Cybernetics
[Emerald (MCB UP)]
日期:2025-12-31
卷期号:: 1-17
标识
DOI:10.1108/ijicc-10-2025-0655
摘要
Purpose To address the lack of Chinese benchmark resources for detecting AI-generated (AIGC) multimodal fake reviews, we construct and validate a large-scale text–image dataset and benchmark to support platform governance and consumer protection. Design/methodology/approach Authentic restaurant reviews were curated and paired with synthetic deceptive counterparts generated by a large language–vision model, yielding a balanced Chinese multimodal dataset of over 20,000 text–image samples. Mainstream unimodal (text-only, image-only) and multimodal pre-trained models were evaluated under a unified protocol. We further conducted generalization tests via information-perturbation stress tests and cross-lingual transfer scenarios, and compared fusion strategies (early, intermediate, deep). Findings Multimodal models employing deep fusion consistently outperform unimodal and shallow-fusion baselines in accuracy and robustness. They retain superior performance under feature perturbations and demonstrate stronger transferability across languages, confirming the benefit of jointly leveraging complementary textual and visual cues for fake-review detection. Originality/value This work presents, to our knowledge, the first Chinese multimodal AIGC fake review dataset accompanied by a comprehensive benchmark. It provides an open, reproducible resource and empirical evidence that deep multimodal fusion substantially improves detection effectiveness and robustness, offering practical guidance for future research and real-world deployment.
科研通智能强力驱动
Strongly Powered by AbleSci AI