计算机科学
假新闻
模型检查
语言模型
自然语言处理
人工智能
程序设计语言
互联网隐私
作者
Yangxiao Bai,Kaiqun Fu
标识
DOI:10.1109/bigdata62323.2024.10826000
摘要
The widespread dissemination of online misinformation poses significant threats to the public interest, highlighting the urgent need for effective fake news detection. In the era of Large Language Models (LLMs), the rise of AI-generated fake news has intensified this issue, making misinformation more pervasive and harder to control. While fact-checking offers a promising solution by leveraging external knowledge, efficiently linking claims within news articles to relevant external facts remains a significant challenge. To address this, we propose a mis-information detection framework FCRV (Full-Context Retrieval and Verification) that constructs a "full-context" for news articles by integrating LLM-based claim extraction with Retrieval-Augmented Generation (RAG) for fact-checking. We implemented an LLM pipeline for human-like extraction of key claims from datasets, significantly improving extraction quality over traditional methods. Our retrieval workflow effectively detects fictitious entities prevalent in AI-generated news by identifying claims lacking a basis in reality. Experiments across multiple human-generated and AI-generated datasets demonstrate that verifying news using this "full-context" approach leads to more stable and robust fake news detection, enhancing scalability, accuracy, and the model’s ability to handle AI-generated content.
科研通智能强力驱动
Strongly Powered by AbleSci AI