Enhancing the Completeness of Rationales for Multi-Step Question Answering

完备性(序理论) 答疑 计算机科学 情报检索 程序设计语言 数学 数学分析
作者
Shangzi Xue,Zhenya Huang,Xin Lin,Jiayu Liu,Longhu Qin,Tianhuang Su,Haifeng Liu,Qi Liu
标识
DOI:10.1145/3627673.3679660
摘要

Learning to answer multi-step complex questions requires machines to perform like a human to think and reason step by step, which is one of the core abilities of a question answering system. Recent advancements have revealed that large language models exhibit remarkable reasoning capabilities by generating intermediate chain-of-thought rationales. However, the completeness of their rationales lacks assurance as they are susceptible to omitting steps and making factual errors. In this paper, drawing inspiration from human-like reasoning processes in answering multi-step questions, we explicitly plan the rationales to ensure their completeness. We propose a two-stage Decomposition-Evaluation (Dec-Eval) framework including a step decomposition stage and a rationale generation stage. Specifically, in the first stage, we decompose the complex question into simpler sub-ones and simulate a human's ability to grasp logical clues to ensure the integrity of step planning. Then, in the second stage, based on the sub-questions, we generate and evaluate rationales step by step. Both stages work together organically, improving the completeness of rationales and the accuracy of the answer. To further control the question answering process, we propose a novel knowledge injection mechanism that incorporates external knowledge to guide both stages. Extensive experiments on three challenging multi-step QA datasets demonstrate that Dec-Eval can explicitly generate more logical rationales, and significantly improve the reasoning performances of different backbone models.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
PDF的下载单位、IP信息已删除 (2025-6-4)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
刚刚
1秒前
1秒前
1秒前
可恶啊关注了科研通微信公众号
1秒前
2秒前
木子发布了新的文献求助30
2秒前
思源应助changmengying采纳,获得10
3秒前
云里完成签到,获得积分10
3秒前
乐观寻绿发布了新的文献求助10
3秒前
3秒前
娇娇大王发布了新的文献求助10
4秒前
传奇3应助niu采纳,获得10
4秒前
美满冷安完成签到,获得积分10
4秒前
5秒前
5秒前
甜蜜的日记本完成签到,获得积分20
5秒前
沫荔发布了新的文献求助30
5秒前
xue发布了新的文献求助10
5秒前
勤劳的梦发布了新的文献求助10
5秒前
猪猪发布了新的文献求助10
5秒前
5秒前
充电宝应助饺子采纳,获得10
5秒前
为你等候发布了新的文献求助10
7秒前
折花几慕应助盛欢采纳,获得20
7秒前
7秒前
7秒前
7秒前
大模型应助lrc采纳,获得10
7秒前
8秒前
美满冷安发布了新的文献求助10
8秒前
8秒前
力劈华山发布了新的文献求助10
8秒前
xiaxue完成签到,获得积分10
8秒前
www发布了新的文献求助10
9秒前
欧皇发布了新的文献求助10
9秒前
追寻航空发布了新的文献求助10
9秒前
9秒前
10秒前
riverhj完成签到,获得积分10
10秒前
高分求助中
【重要!!请各位用户详细阅读此贴】科研通的精品贴汇总(请勿应助) 10000
植物基因组学(第二版) 1000
Plutonium Handbook 1000
Three plays : drama 1000
International Code of Nomenclature for algae, fungi, and plants (Madrid Code) (Regnum Vegetabile) 1000
Psychology Applied to Teaching 14th Edition 600
Robot-supported joining of reinforcement textiles with one-sided sewing heads 600
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 生物化学 物理 内科学 纳米技术 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 冶金 细胞生物学 免疫学
热门帖子
关注 科研通微信公众号,转发送积分 4095415
求助须知:如何正确求助?哪些是违规求助? 3633556
关于积分的说明 11517532
捐赠科研通 3344280
什么是DOI,文献DOI怎么找? 1838000
邀请新用户注册赠送积分活动 905541
科研通“疑难数据库(出版商)”最低求助积分说明 823220