发布文献求助

Multi Q-Table Q-Learning

增强学习最短路径问题强化学习路径（计算）表（数据库）计算机科学 K最短路径路由人工智能算法数学优化数学理论计算机科学数据挖掘图形程序设计语言

作者

Nitchakun Kantasewi,Sanparith Marukatat,Somying Thainimit,Manabu Okumura

标识

DOI：10.1109/ictemsys.2019.8695963

摘要

Q-learning is a popular reinforcement learning technique for solving shortest path (STP) problem. In a maze with multiple sub-tasks such as collecting treasures and avoiding traps, it has been observed that the Q-learning converges to the optimal path. However, the sum of obtained rewards along the path in average is moderate. This paper proposes Multi-Q-Table Q-learning to address a problem of low average sum of rewards. The proposed method constructs a new Q-table whenever a sub-goal is reached. This modification let an agent to learn that the sub-reward is already collect and it can be obtained only once. Our experimental results show that a modified algorithm can achieve an optimal answer to collect all treasures (positive rewards), avoid pit and reach goal with the shortest path. With a small size of maze, the proposed algorithm uses the larger amount of time to achieved optimal solution compared to the conventional Q-learning.

求助该文献

最长约 10秒，即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI

我的文献求助列表浏览历史

一分钟了解求助规则 | 捐赠本站 | 历史今天

更新

2025年影响因子查询已上线 (2025-6-18)

更新

PDF的下载单位、IP信息已删除 (2025-6-4)

科研通是完全免费的文献互助平台，具备全网最快的应助速度，最高的求助完成率。对每一个文献求助，科研通都将尽心尽力，给求助人一个满意的交代。

实时播报: 年富力强聂师傅发布了新的文献求助20

刚刚; FashionBoy上传了应助文件

1秒前; bkagyin的应助被yuan采纳，获得10

2秒前; 诸葛一笑完成签到，获得积分20

4秒前; 义气的含烟发布了新的文献求助10

5秒前; 深情安青上传了应助文件

6秒前; xxfsx上传了应助文件

6秒前; WB87上传了应助文件

7秒前; 大模型上传了应助文件

9秒前; GGGGD完成签到，获得积分20

9秒前; LAST完成签到，获得积分10

9秒前; 你嵙这个期刊没买上传了应助文件

10秒前; 可可熊发布了新的文献求助10

10秒前; 浮游上传了应助文件

11秒前; unflycn完成签到，获得积分10

11秒前; 蹦蹦又跳跳发布了新的文献求助10

12秒前; hbq完成签到，获得积分20

12秒前; 量子星尘发布了新的文献求助10

13秒前; 谜语发布了新的文献求助10

15秒前; 柔弱夜梦完成签到，获得积分20

15秒前; 嘟噜嘟噜上传了应助文件

16秒前; zgw完成签到，获得积分10

17秒前; 乾乾完成签到，获得积分10

19秒前; 从容映易完成签到，获得积分10

20秒前; 田様上传了应助文件

21秒前; 烟花的应助被小宋采纳，获得10

21秒前; 你嵙这个期刊没买上传了应助文件

22秒前; 永远少年完成签到，获得积分10

24秒前; charint发布了新的文献求助10

25秒前; 共享精神的应助被ll采纳，获得10

25秒前; 俞事发布了新的文献求助10

26秒前; fsdghert发布了新的文献求助10

30秒前; jason0023发布了新的文献求助10

31秒前; 脑洞疼上传了应助文件

31秒前; Augustin发布了新的文献求助50

32秒前; 所所的应助被ira采纳，获得10

32秒前; 华仔上传了应助文件

34秒前; orixero的应助被畅快的觅风采纳，获得10

34秒前; CAt5完成签到，获得积分10

34秒前; xxfsx上传了应助文件

35秒前

高分求助中: (应助此贴封号)【重要！！请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000; Alloy Phase Diagrams 1000; Introduction to Early Childhood Education 1000; 2025-2031年中国兽用抗生素行业发展深度调研与未来趋势报告 1000; List of 1,091 Public Pension Profiles by Region 901; Item Response Theory 600; Historical Dictionary of British Intelligence (2014 / 2nd EDITION!) 500

热门求助领域（近24小时）

热门帖子: 关注科研通微信公众号，转发送积分 5425524; 求助须知：如何正确求助？哪些是违规求助？ 4539563; 关于积分的说明 14168635; 捐赠科研通 4457118; 什么是DOI，文献DOI怎么找？ 2444431; 邀请新用户注册赠送积分活动 1435362; 关于科研通互助平台的介绍 1412800

今日热心研友

殷勤的紫槐

注：热心度 = 本日应助数 + 本日被采纳获取积分÷10

Copyright © 2020-2025 AbleSci.COM, 科研通, All Right Reserved

科研通是非营利科研互助平台，不忘初心，为科研助力

本站互助的所有文件仅供个人学习研究用，禁止任何人把求助的所得文献进行盈利或传播

皖ICP备2024041134号-1

皖公网安备34019202002308

科研通【文献互助QQ群】：如果您有特殊求助，或发布求助超过24小时未得到应助，可加群求助，群号：941272744【点击一键加群】

科研通【志愿服务QQ群】：如果您热爱文献互助，有热心愿意为更多人服务，请加入小伙伴群，点击申请加入

关注微信服务号

科研通