DelphiAgent: A trustworthy multi-agent verification framework for automated fact verification

可信赖性 计算机科学 计算机安全
作者
Cheng Xiong,Ge Zheng,Xiao Ma,Chunlin Li,Jiangfeng Zeng
出处
期刊:Information Processing and Management [Elsevier BV]
卷期号:62 (6): 104241-104241 被引量:9
标识
DOI:10.1016/j.ipm.2025.104241
摘要

Large Language Models (LLMs) have been investigated for many reasoning-intensive tasks including fact verification and exhibited outstanding performance via coupling LLM’s internal and external knowledge. However, non-agentic LLM-based methods produce responses based on direct prompts in an one-off manner, suffering from challenges in factuality and hallucinations . In this paper, we propose DelphiAgent, an innovative agentic framework for trustworthy fact-checking that employs multiple LLMs to emulate the workflow of the Delphi method , aiming at enhancing transparency in the decision-making procedure and mitigating hallucinations when generating justifications. This is implemented through a duel-system framework that integrates the evidence mining module and the Delphi decision-making module. The evidence mining module extracts evidence from raw uncensored reports and refines evidence, ensuring the provision of instructive rationales for the subsequent module. Meanwhile, drawing inspiration from the Delphi method , the decision-making module devises multiple LLM-based agents with distinct personalities to make factuality judgments individually based on the claim and its verified evidence, and reaches a consensus through multiple rounds of feedback and synthesis. The experimental findings from two challenging datasets indicate that DelphiAgent not only surpasses current LLM-based approaches but also is on par with state-of-the-art LLM-enhanced supervised baselines without necessitating a training regime, with macF1 improvements reaching up to 6.84% on RAWFC and comparable performance on LIAR-RAW. Furthermore, the generated justifications throughout the workflow underscore the trustworthiness of our proposed framework. The official implementation of this paper is available at https://github.com/zjfgh2015/DelphiAgent . • Non-agentic LLMs suffer from challenges in factuality and hallucinations. • DelphiAgent employs multiple LLMs to emulate the Delphi method . • DelphiAgent enhances transparency and mitigates hallucinations. • DelphiAgent competes SOTA baselines without necessitating a training regime.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
湛无不盛完成签到,获得积分10
1秒前
2秒前
orixero应助凶狠的幻丝采纳,获得10
2秒前
3秒前
4秒前
ya完成签到,获得积分10
5秒前
曲奇饼干发布了新的文献求助30
5秒前
6秒前
6秒前
Jry应助猫猫叽丫丫采纳,获得10
7秒前
缓慢沁完成签到,获得积分10
7秒前
小新小新发布了新的文献求助10
8秒前
马小发布了新的文献求助10
8秒前
岸上芒果lucky酱完成签到 ,获得积分10
8秒前
9秒前
Lusteri发布了新的文献求助10
9秒前
kingripple发布了新的文献求助10
9秒前
sht完成签到,获得积分10
9秒前
11秒前
豆腐花完成签到,获得积分10
12秒前
13秒前
迅速念之完成签到,获得积分10
14秒前
14秒前
si发布了新的文献求助10
16秒前
米乐时光发布了新的文献求助10
16秒前
研友_VZG7GZ应助gcr66采纳,获得10
18秒前
18秒前
19秒前
CodeCraft应助负责的方盒采纳,获得30
19秒前
慕青应助郝玉婷采纳,获得10
20秒前
soundwave完成签到,获得积分10
21秒前
诚心的鸽子完成签到,获得积分10
21秒前
HC完成签到,获得积分10
22秒前
haha应助霍霍采纳,获得50
22秒前
自由的雪一完成签到,获得积分10
23秒前
顺利中发布了新的文献求助30
24秒前
25秒前
26秒前
26秒前
26秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
晶种分解过程与铝酸钠溶液混合强度关系的探讨 8888
Chemistry and Physics of Carbon Volume 18 800
The Organometallic Chemistry of the Transition Metals 800
Leading Academic-Practice Partnerships in Nursing and Healthcare: A Paradigm for Change 800
The formation of Australian attitudes towards China, 1918-1941 640
Signals, Systems, and Signal Processing 610
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6430300
求助须知:如何正确求助?哪些是违规求助? 8246304
关于积分的说明 17536599
捐赠科研通 5486641
什么是DOI,文献DOI怎么找? 2895841
邀请新用户注册赠送积分活动 1872303
关于科研通互助平台的介绍 1711807