Atomic Fact Decomposition Helps Attributed Question Answering

计算机科学 分解 集合(抽象数据类型) 答疑 公制(单位) 可信赖性 过程(计算) 情报检索 自然语言处理 秩(图论) 比例(比率) 人工智能 自然语言 查询扩展 查询语言 排名(信息检索) 数据挖掘 搜索引擎 相似性(几何) 数据集 任务(项目管理) 信息抽取 Atom(片上系统) 数据集成 计算语言学 理论计算机科学 语言模型 知识抽取 归属 文献检索
作者
Zhichao Yan,Jiapu Wang,Jiaoyan Chen,Xiaoli Li,Jiye Liang,Ru Li,Jeff Z. Pan
出处
期刊:IEEE Transactions on Knowledge and Data Engineering [IEEE Computer Society]
卷期号:37 (12): 6959-6972 被引量:2
标识
DOI:10.1109/tkde.2025.3608716
摘要

Attributed Question Answering (AQA) aims to provide both a trustworthy answer and a reliable attribution report for a given question. Retrieval is a widely adopted approach, including two general paradigms: Retrieval-Then-Read (RTR) and post-hoc retrieval. Recently, Large Language Models (LLMs) have shown remarkable proficiency, prompting growing interest in AQA among researchers. However, RTR-based AQA often suffers from irrelevant knowledge and rapidly changing information, even when LLMs are adopted, while post-hoc retrievalbased AQA struggles with comprehending long-form answers with complex logic, and precisely identifying the content needing revision and preserving the original intent. To tackle these problems, this paper proposes an Atomic fact decompositionbased Retrieval and Editing (ARE) framework, which decomposes the generated long-form answers into molecular clauses and atomic facts by the instruction-tuned LLMs. Notably, the instruction-tuned LLMs are fine-tuned using a well-constructed dataset, generated from large scale Knowledge Graphs (KGs). This process involves extracting one-hop neighbors from a given set of entities and transforming the result into coherent long-form text. Subsequently, ARE leverages a search engine to retrieve evidences related to atomic facts, inputting these evidences into an LLM-based verifier to determine whether the facts require expansion for re-retrieval or editing. Furthermore, the edited facts are backtracked into the original answer, with evidence aggregated based on the relationship between molecular clauses and atomic facts. Extensive evaluations demonstrate the superior performance of our proposed method over the state-of-the-arts on several datasets, with an additionally proposed new metric Attrp for evaluating the precision of evidence attribution.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
默默的惜霜完成签到,获得积分10
1秒前
1秒前
1秒前
2秒前
小林完成签到,获得积分20
2秒前
Hello应助浪而而采纳,获得10
2秒前
4秒前
科研通AI6.4应助直率豆芽采纳,获得30
5秒前
5秒前
发一篇sci发布了新的文献求助10
6秒前
称心寒松发布了新的文献求助10
6秒前
竹蜻蜓完成签到,获得积分10
6秒前
7秒前
happy发布了新的文献求助10
7秒前
1234354346完成签到,获得积分10
8秒前
只只应助yxl采纳,获得10
8秒前
neurospine完成签到,获得积分10
9秒前
9秒前
9秒前
佩格同学完成签到,获得积分10
10秒前
奇思妙想完成签到,获得积分10
11秒前
BulingBuling发布了新的文献求助10
11秒前
windli发布了新的文献求助10
11秒前
香蕉觅云应助不会看文献采纳,获得10
12秒前
13秒前
立青发布了新的文献求助10
13秒前
可爱的函函应助happy采纳,获得10
15秒前
丘比特应助智圆行方采纳,获得10
16秒前
上官若男应助鲸鱼采纳,获得10
17秒前
Z66完成签到,获得积分10
18秒前
18秒前
donk666完成签到,获得积分10
20秒前
顾君如完成签到,获得积分10
21秒前
瞬光完成签到,获得积分10
21秒前
斯文的芹菜完成签到,获得积分10
21秒前
21秒前
22秒前
一抹阳光完成签到 ,获得积分10
23秒前
haha完成签到,获得积分10
25秒前
25秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Les Mantodea de Guyane Insecta, Polyneoptera 2000
Quality by Design - An Indispensable Approach to Accelerate Biopharmaceutical Product Development 800
Pulse width control of a 3-phase inverter with non sinusoidal phase voltages 777
Signals, Systems, and Signal Processing 610
Research Methods for Applied Linguistics: A Practical Guide 600
Research Methods for Applied Linguistics 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6403304
求助须知:如何正确求助?哪些是违规求助? 8222086
关于积分的说明 17425457
捐赠科研通 5455848
什么是DOI,文献DOI怎么找? 2883301
邀请新用户注册赠送积分活动 1859531
关于科研通互助平台的介绍 1701023