Weapon–Target Assignment by Reinforcement Learning with Pointer Network

航空航天 指针(用户界面) 图书馆学 航空学 运筹学 工程类 计算机科学 人工智能 航空航天工程
作者
Hyungho Na,Jaemyung Ahn,Il‐Chul Moon
出处
期刊:Journal of aerospace information systems [American Institute of Aeronautics and Astronautics]
卷期号:20 (1): 53-59 被引量:8
标识
DOI:10.2514/1.i011150
摘要

No AccessTechnical NotesWeapon–Target Assignment by Reinforcement Learning with Pointer NetworkHyungho Na, Jaemyung Ahn and Il-Chul MoonHyungho Na https://orcid.org/0000-0002-7687-2513Korea Advanced Institute of Science and Technology, Daejeon 34141, Republic of Korea, Jaemyung Ahn https://orcid.org/0000-0003-4971-5130Korea Advanced Institute of Science and Technology, Daejeon 34141, Republic of Korea and Il-Chul Moon https://orcid.org/0000-0002-1798-1306Korea Advanced Institute of Science and Technology, Daejeon 34141, Republic of KoreaPublished Online:27 Nov 2022https://doi.org/10.2514/1.I011150SectionsRead Now ToolsAdd to favoritesDownload citationTrack citations ShareShare onFacebookTwitterLinked InRedditEmail About References [1] Ponda S. S., Johnson L. B., Geramifard A. and How J. P., "Cooperative Mission Planning for Multi-UAV Teams," Handbook of Unmanned Aerial Vehicles, Vol. 2, Aug. 2015, pp. 1447–1490. https://doi.org/10.1007/978-90-481-9707-1_16 CrossrefGoogle Scholar[2] Johnson L. B., Choi H.-L., Ponda S. S. and How J. P., "Decentralized Task Allocation Using Local Information Consistency Assumptions," Journal of Aerospace Information Systems, Vol. 14, No. 2, 2017, pp. 103–122. https://doi.org/10.2514/1.I010461 LinkGoogle Scholar[3] Wang Z., Delahaye D., Farges J.-L. and Alam S., "Air Traffic Assignment for Intensive Urban Air Mobility Operations," Journal of Aerospace Information Systems, Vol. 18, No. 11, 2021, pp. 860–875. https://doi.org/10.2514/1.I010954 LinkGoogle Scholar[4] Sheu J.-B., "A Novel Dynamic Resource Allocation Model for Demand-Responsive City Logistics Distribution Operations," Transportation Research Part E: Logistics and Transportation Review, Vol. 42, No. 6, 2006, pp. 445–472. https://doi.org/10.1016/j.tre.2005.05.004 CrossrefGoogle Scholar[5] Krichman M., Ghose D., Speyer J. L. and Shamma J. S., "Theater Level Campaign Resource Allocation," Proceedings of the 2001 American Control Conference (Cat. No. 01CH37148), Inst. of Electrical and Electronics Engineers, New York, Vol. 6, 2001, pp. 4716–4721. https://doi.org/10.1109/acc.2001.945727 Google Scholar[6] Ahuja R. K., Kumar A., Jha K. C. and Orlin J. B., "Exact and Heuristic Algorithms for the Weapon-Target Assignment Problem," Operations Research, Vol. 55, No. 6, 2007, pp. 1136–1146. https://doi.org/10.1287/opre.1070.0440 CrossrefGoogle Scholar[7] Lloyd S. P. and Witsenhausen H. S., "Weapons Allocation is NP-Complete," 1986 Summer Computer Simulation Conference, Soc. for Modelling and Simulation International (SCS), San Diego, CA, 1986, pp. 1054–1058. Google Scholar[8] Lee Z.-J., Lee C.-Y. and Su S.-F., "An Immunity-Based Ant Colony Optimization Algorithm for Solving Weapon–Target Assignment Problem," Applied Soft Computing, Vol. 2, No. 1, 2002, pp. 39–47. https://doi.org/10.1016/S1568-4946(02)00027-3 CrossrefGoogle Scholar[9] Bo Z., Feng-xing Z. and Jia-hua W., "A Novel Approach to Solving Weapon-Target Assignment Problem Based on Hybrid Particle Swarm Optimization Algorithm," Proceedings of 2011 International Conference on Electronic & Mechanical Engineering and Information Technology, Inst. of Electrical and Electronics Engineers, New York, Vol. 3, 2011, pp. 1385–1387. https://doi.org/10.1109/EMEIT.2011.6023352 Google Scholar[10] Wang S. and Chen W., "Solving Weapon-Target Assignment Problems by Cultural Particle Swarm Optimization," 2012 4th International Conference on Intelligent Human-Machine Systems and Cybernetics, Inst. of Electrical and Electronics Engineers, New York, Vol. 1, 2012, pp. 141–144. https://doi.org/10.1109/IHMSC.2012.41 Google Scholar[11] Lee Z.-J., Su S.-F. and Lee C.-Y., "Efficiently Solving General Weapon-Target Assignment Problem by Genetic Algorithms with Greedy Eugenics," IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), Vol. 33, No. 1, 2003, pp. 113–121. https://doi.org/10.1109/TSMCB.2003.808174 CrossrefGoogle Scholar[12] Cho D.-H. and Choi H.-L., "Greedy Maximization for Asset-Based Weapon-Target Assignment with Timedependent Rewards," Cooperative Control of Multi-Agent Systems: Theory and Applications, Wiley, Hoboken, NJ, 2017, pp. 115–139. https://doi.org/10.1002/9781119266235.ch5 CrossrefGoogle Scholar[13] Shin M.-K., Lee D. and Choi H.-L., "Weapon-Target Assignment Problem with Interference Constraints Using Mixed-Integer Linear Programming," arXiv preprint arXiv:1911.12567, 2019. https://doi.org/10.48550/arXiv.1911.12567 Google Scholar[14] Lee D., Shin M. K. and Choi H.-L., "Weapon Target Assignment Problem with Interference Constraints," AIAA Scitech 2020 Forum, AIAA Paper 2020-0388, 2020. https://doi.org/10.2514/6.2020-0388 Google Scholar[15] Bello I., Pham H., Le Q. V., Norouzi M. and Bengio S., "Neural Combinatorial Optimization with Reinforcement Learning," arXiv preprint arXiv:1611.09940, 2016. https://doi.org/10.48550/arXiv.1611.09940 Google Scholar[16] Nazari M., Oroojlooy A., Snyder L. and Takac M., "Reinforcement Learning for Solving the Vehicle Routing Problem," Advances in Neural Information Processing Systems, Vol. 31, Curran Associates, Red Hook, NY, Dec. 2018, pp. 9861–9871. https://doi.org/10.48550/arXiv.1802.04240 Google Scholar[17] Grondman I., Busoniu L., Lopes G. A. and Babuska R., "A Survey of Actor-Critic Reinforcement Learning: Standard and Natural Policy Gradients," IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews), Vol. 42, No. 6, 2012, pp. 1291–1307. https://doi.org/10.1109/TSMCC.2012.2218595 CrossrefGoogle Scholar[18] Vinyals O., Fortunato M. and Jaitly N., "Pointer Networks," Advances in Neural Information Processing Systems, Vol. 28, Curran Associates, Red Hook, NY, Dec. 2015, pp. 2692–2700. https://doi.org/10.48550/arXiv.1506.03134 Google Scholar[19] Vaswani A., Shazeer N., Parmar N., Uszkoreit J., Jones L., Gomez A. N., Kaiser Ł. and Polosukhin I., "Attention is All You Need," Advances in Neural Information Processing Systems, Vol. 30, Curran Assoc., Red Hook, NY, Dec. 2017, pp. 6000–6010. https://doi.org/10.48550/arXiv.1706.03762 Google Scholar[20] Bello I., Zoph B., Vaswani A., Shlens J. and Le Q. V., "Attention Augmented Convolutional Networks," Proceedings of the IEEE/CVF International Conference on Computer Vision, Oct. 2019, pp. 3286–3295. https://doi.org/10.48550/arXiv.1904.09925 Google Scholar[21] Kool W., van Hoof H. and Welling M., "Attention, Learn to Solve Routing Problems!" International Conference on Learning Representations, May 2019. https://doi.org/10.48550/arXiv.1803.08475 Google Scholar[22] Sutskever I., Vinyals O. and Le Q. V., "Sequence to Sequence Learning with Neural Networks," Advances in Neural Information Processing Systems, Vol. 27, Dec. 2014, pp. 3104–3112. https://doi.org/10.48550/arXiv.1409.3215 Google Scholar[23] Sutton R. S. and Barto A. G., Reinforcement Learning: An Introduction, MIT Press, Cambridge, MA, 2018, pp. 329–332, Chap. 13. https://doi.org/0.1109/TNN.1998.712192 Google Scholar Previous article FiguresReferencesRelatedDetails What's Popular Volume 20, Number 1January 2023 Metrics CrossmarkInformationCopyright © 2022 by the American Institute of Aeronautics and Astronautics, Inc. All rights reserved. All requests for copying and permission to reprint should be submitted to CCC at www.copyright.com; employ the eISSN 2327-3097 to initiate your request. See also AIAA Rights and Permissions www.aiaa.org/randp. TopicsAlgorithms and Data StructuresArtificial IntelligenceArtificial Neural NetworkComputing SystemComputing and InformaticsComputing, Information, and CommunicationControl SystemsData ScienceEvolutionary AlgorithmGenetic AlgorithmGuidance, Navigation, and Control SystemsMachine LearningMilitary ScienceMilitary TechnologyMissile Systems, Dynamics and TechnologyRoboticsRobotics SystemsWeapon Systems KeywordsWeapon Target AssignmentReinforcement LearningMarkov Decision ProcessArtificial Neural NetworkProbability DistributionParticle Swarm OptimizationStochastic Gradient DescentGenetic AlgorithmMixed Integer Linear ProgrammingMulti Agent SystemAcknowledgmentThis work was supported by Theater Defense Research Center funded by Defense Acquisition Program Administration under Grant UD200043CD.PDF Received17 May 2022Accepted7 November 2022Published online27 November 2022
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
Yina完成签到 ,获得积分10
1秒前
happy发布了新的文献求助10
3秒前
holi完成签到 ,获得积分10
4秒前
和谐的冬莲完成签到 ,获得积分10
6秒前
phoenix001完成签到,获得积分10
6秒前
HEIKU应助啦啦啦啦采纳,获得10
7秒前
jenningseastera应助啦啦啦啦采纳,获得10
7秒前
Superman完成签到 ,获得积分10
9秒前
皮哈哈完成签到,获得积分10
11秒前
啦啦啦啦完成签到,获得积分10
14秒前
年轻千愁完成签到 ,获得积分10
19秒前
春景当思完成签到,获得积分10
21秒前
Hello应助快乐慕灵采纳,获得10
21秒前
满意外套完成签到 ,获得积分10
22秒前
Oracle应助虞无声采纳,获得50
28秒前
28秒前
满天星辰独览完成签到 ,获得积分10
31秒前
贝贝贝完成签到,获得积分10
31秒前
gao发布了新的文献求助10
32秒前
YC完成签到,获得积分10
33秒前
happy完成签到,获得积分10
33秒前
xybjt完成签到 ,获得积分10
37秒前
slx0410完成签到,获得积分10
41秒前
开心就吃猕猴桃完成签到,获得积分10
41秒前
42秒前
流浪完成签到,获得积分10
43秒前
李梦瑾完成签到,获得积分10
43秒前
砳熠完成签到 ,获得积分10
44秒前
蔡翌文完成签到 ,获得积分10
45秒前
ning_qing完成签到 ,获得积分10
46秒前
致远发布了新的文献求助10
48秒前
酷波er应助科研通管家采纳,获得10
50秒前
英俊的铭应助科研通管家采纳,获得10
50秒前
典雅三颜完成签到 ,获得积分10
53秒前
科研通AI5应助虚心念桃采纳,获得10
54秒前
烟花应助Smiles采纳,获得10
59秒前
59秒前
1分钟前
液晶屏99发布了新的文献求助10
1分钟前
ALLon完成签到 ,获得积分10
1分钟前
高分求助中
【此为提示信息,请勿应助】请按要求发布求助,避免被关 20000
Continuum Thermodynamics and Material Modelling 2000
Encyclopedia of Geology (2nd Edition) 2000
105th Edition CRC Handbook of Chemistry and Physics 1600
Maneuvering of a Damaged Navy Combatant 650
Периодизация спортивной тренировки. Общая теория и её практическое применение 310
Mixing the elements of mass customisation 300
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 物理 生物化学 纳米技术 计算机科学 化学工程 内科学 复合材料 物理化学 电极 遗传学 量子力学 基因 冶金 催化作用
热门帖子
关注 科研通微信公众号,转发送积分 3779327
求助须知:如何正确求助?哪些是违规求助? 3324815
关于积分的说明 10220149
捐赠科研通 3039982
什么是DOI,文献DOI怎么找? 1668528
邀请新用户注册赠送积分活动 798717
科研通“疑难数据库(出版商)”最低求助积分说明 758503