Weapon–Target Assignment by Reinforcement Learning with Pointer Network

航空航天 指针(用户界面) 图书馆学 航空学 运筹学 工程类 计算机科学 人工智能 航空航天工程
作者
Hyungho Na,Jaemyung Ahn,Il‐Chul Moon
出处
期刊:Journal of aerospace information systems [American Institute of Aeronautics and Astronautics]
卷期号:20 (1): 53-59 被引量:18
标识
DOI:10.2514/1.i011150
摘要

No AccessTechnical NotesWeapon–Target Assignment by Reinforcement Learning with Pointer NetworkHyungho Na, Jaemyung Ahn and Il-Chul MoonHyungho Na https://orcid.org/0000-0002-7687-2513Korea Advanced Institute of Science and Technology, Daejeon 34141, Republic of Korea, Jaemyung Ahn https://orcid.org/0000-0003-4971-5130Korea Advanced Institute of Science and Technology, Daejeon 34141, Republic of Korea and Il-Chul Moon https://orcid.org/0000-0002-1798-1306Korea Advanced Institute of Science and Technology, Daejeon 34141, Republic of KoreaPublished Online:27 Nov 2022https://doi.org/10.2514/1.I011150SectionsRead Now ToolsAdd to favoritesDownload citationTrack citations ShareShare onFacebookTwitterLinked InRedditEmail About References [1] Ponda S. S., Johnson L. B., Geramifard A. and How J. P., "Cooperative Mission Planning for Multi-UAV Teams," Handbook of Unmanned Aerial Vehicles, Vol. 2, Aug. 2015, pp. 1447–1490. https://doi.org/10.1007/978-90-481-9707-1_16 CrossrefGoogle Scholar[2] Johnson L. B., Choi H.-L., Ponda S. S. and How J. P., "Decentralized Task Allocation Using Local Information Consistency Assumptions," Journal of Aerospace Information Systems, Vol. 14, No. 2, 2017, pp. 103–122. https://doi.org/10.2514/1.I010461 LinkGoogle Scholar[3] Wang Z., Delahaye D., Farges J.-L. and Alam S., "Air Traffic Assignment for Intensive Urban Air Mobility Operations," Journal of Aerospace Information Systems, Vol. 18, No. 11, 2021, pp. 860–875. https://doi.org/10.2514/1.I010954 LinkGoogle Scholar[4] Sheu J.-B., "A Novel Dynamic Resource Allocation Model for Demand-Responsive City Logistics Distribution Operations," Transportation Research Part E: Logistics and Transportation Review, Vol. 42, No. 6, 2006, pp. 445–472. https://doi.org/10.1016/j.tre.2005.05.004 CrossrefGoogle Scholar[5] Krichman M., Ghose D., Speyer J. L. and Shamma J. S., "Theater Level Campaign Resource Allocation," Proceedings of the 2001 American Control Conference (Cat. No. 01CH37148), Inst. of Electrical and Electronics Engineers, New York, Vol. 6, 2001, pp. 4716–4721. https://doi.org/10.1109/acc.2001.945727 Google Scholar[6] Ahuja R. K., Kumar A., Jha K. C. and Orlin J. B., "Exact and Heuristic Algorithms for the Weapon-Target Assignment Problem," Operations Research, Vol. 55, No. 6, 2007, pp. 1136–1146. https://doi.org/10.1287/opre.1070.0440 CrossrefGoogle Scholar[7] Lloyd S. P. and Witsenhausen H. S., "Weapons Allocation is NP-Complete," 1986 Summer Computer Simulation Conference, Soc. for Modelling and Simulation International (SCS), San Diego, CA, 1986, pp. 1054–1058. Google Scholar[8] Lee Z.-J., Lee C.-Y. and Su S.-F., "An Immunity-Based Ant Colony Optimization Algorithm for Solving Weapon–Target Assignment Problem," Applied Soft Computing, Vol. 2, No. 1, 2002, pp. 39–47. https://doi.org/10.1016/S1568-4946(02)00027-3 CrossrefGoogle Scholar[9] Bo Z., Feng-xing Z. and Jia-hua W., "A Novel Approach to Solving Weapon-Target Assignment Problem Based on Hybrid Particle Swarm Optimization Algorithm," Proceedings of 2011 International Conference on Electronic & Mechanical Engineering and Information Technology, Inst. of Electrical and Electronics Engineers, New York, Vol. 3, 2011, pp. 1385–1387. https://doi.org/10.1109/EMEIT.2011.6023352 Google Scholar[10] Wang S. and Chen W., "Solving Weapon-Target Assignment Problems by Cultural Particle Swarm Optimization," 2012 4th International Conference on Intelligent Human-Machine Systems and Cybernetics, Inst. of Electrical and Electronics Engineers, New York, Vol. 1, 2012, pp. 141–144. https://doi.org/10.1109/IHMSC.2012.41 Google Scholar[11] Lee Z.-J., Su S.-F. and Lee C.-Y., "Efficiently Solving General Weapon-Target Assignment Problem by Genetic Algorithms with Greedy Eugenics," IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), Vol. 33, No. 1, 2003, pp. 113–121. https://doi.org/10.1109/TSMCB.2003.808174 CrossrefGoogle Scholar[12] Cho D.-H. and Choi H.-L., "Greedy Maximization for Asset-Based Weapon-Target Assignment with Timedependent Rewards," Cooperative Control of Multi-Agent Systems: Theory and Applications, Wiley, Hoboken, NJ, 2017, pp. 115–139. https://doi.org/10.1002/9781119266235.ch5 CrossrefGoogle Scholar[13] Shin M.-K., Lee D. and Choi H.-L., "Weapon-Target Assignment Problem with Interference Constraints Using Mixed-Integer Linear Programming," arXiv preprint arXiv:1911.12567, 2019. https://doi.org/10.48550/arXiv.1911.12567 Google Scholar[14] Lee D., Shin M. K. and Choi H.-L., "Weapon Target Assignment Problem with Interference Constraints," AIAA Scitech 2020 Forum, AIAA Paper 2020-0388, 2020. https://doi.org/10.2514/6.2020-0388 Google Scholar[15] Bello I., Pham H., Le Q. V., Norouzi M. and Bengio S., "Neural Combinatorial Optimization with Reinforcement Learning," arXiv preprint arXiv:1611.09940, 2016. https://doi.org/10.48550/arXiv.1611.09940 Google Scholar[16] Nazari M., Oroojlooy A., Snyder L. and Takac M., "Reinforcement Learning for Solving the Vehicle Routing Problem," Advances in Neural Information Processing Systems, Vol. 31, Curran Associates, Red Hook, NY, Dec. 2018, pp. 9861–9871. https://doi.org/10.48550/arXiv.1802.04240 Google Scholar[17] Grondman I., Busoniu L., Lopes G. A. and Babuska R., "A Survey of Actor-Critic Reinforcement Learning: Standard and Natural Policy Gradients," IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews), Vol. 42, No. 6, 2012, pp. 1291–1307. https://doi.org/10.1109/TSMCC.2012.2218595 CrossrefGoogle Scholar[18] Vinyals O., Fortunato M. and Jaitly N., "Pointer Networks," Advances in Neural Information Processing Systems, Vol. 28, Curran Associates, Red Hook, NY, Dec. 2015, pp. 2692–2700. https://doi.org/10.48550/arXiv.1506.03134 Google Scholar[19] Vaswani A., Shazeer N., Parmar N., Uszkoreit J., Jones L., Gomez A. N., Kaiser Ł. and Polosukhin I., "Attention is All You Need," Advances in Neural Information Processing Systems, Vol. 30, Curran Assoc., Red Hook, NY, Dec. 2017, pp. 6000–6010. https://doi.org/10.48550/arXiv.1706.03762 Google Scholar[20] Bello I., Zoph B., Vaswani A., Shlens J. and Le Q. V., "Attention Augmented Convolutional Networks," Proceedings of the IEEE/CVF International Conference on Computer Vision, Oct. 2019, pp. 3286–3295. https://doi.org/10.48550/arXiv.1904.09925 Google Scholar[21] Kool W., van Hoof H. and Welling M., "Attention, Learn to Solve Routing Problems!" International Conference on Learning Representations, May 2019. https://doi.org/10.48550/arXiv.1803.08475 Google Scholar[22] Sutskever I., Vinyals O. and Le Q. V., "Sequence to Sequence Learning with Neural Networks," Advances in Neural Information Processing Systems, Vol. 27, Dec. 2014, pp. 3104–3112. https://doi.org/10.48550/arXiv.1409.3215 Google Scholar[23] Sutton R. S. and Barto A. G., Reinforcement Learning: An Introduction, MIT Press, Cambridge, MA, 2018, pp. 329–332, Chap. 13. https://doi.org/0.1109/TNN.1998.712192 Google Scholar Previous article FiguresReferencesRelatedDetails What's Popular Volume 20, Number 1January 2023 Metrics CrossmarkInformationCopyright © 2022 by the American Institute of Aeronautics and Astronautics, Inc. All rights reserved. All requests for copying and permission to reprint should be submitted to CCC at www.copyright.com; employ the eISSN 2327-3097 to initiate your request. See also AIAA Rights and Permissions www.aiaa.org/randp. TopicsAlgorithms and Data StructuresArtificial IntelligenceArtificial Neural NetworkComputing SystemComputing and InformaticsComputing, Information, and CommunicationControl SystemsData ScienceEvolutionary AlgorithmGenetic AlgorithmGuidance, Navigation, and Control SystemsMachine LearningMilitary ScienceMilitary TechnologyMissile Systems, Dynamics and TechnologyRoboticsRobotics SystemsWeapon Systems KeywordsWeapon Target AssignmentReinforcement LearningMarkov Decision ProcessArtificial Neural NetworkProbability DistributionParticle Swarm OptimizationStochastic Gradient DescentGenetic AlgorithmMixed Integer Linear ProgrammingMulti Agent SystemAcknowledgmentThis work was supported by Theater Defense Research Center funded by Defense Acquisition Program Administration under Grant UD200043CD.PDF Received17 May 2022Accepted7 November 2022Published online27 November 2022
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
刚刚
2秒前
3秒前
默默犀牛发布了新的文献求助10
4秒前
温婉的乐蕊关注了科研通微信公众号
4秒前
4秒前
酷波er应助谨慎的秋烟采纳,获得10
6秒前
成就发布了新的文献求助10
7秒前
隐形曼青应助科研通管家采纳,获得10
8秒前
8秒前
英姑应助科研通管家采纳,获得10
8秒前
8秒前
DoLaso完成签到,获得积分10
8秒前
molihuakai应助科研通管家采纳,获得10
8秒前
zed320完成签到 ,获得积分10
8秒前
8秒前
浮游应助科研通管家采纳,获得10
8秒前
8秒前
共享精神应助科研通管家采纳,获得10
8秒前
英姑应助科研通管家采纳,获得10
8秒前
8秒前
香蕉觅云应助科研通管家采纳,获得10
8秒前
8秒前
共享精神应助科研通管家采纳,获得10
9秒前
cdercder应助科研通管家采纳,获得10
9秒前
Orange应助科研通管家采纳,获得10
9秒前
9秒前
whh发布了新的文献求助10
9秒前
领导范儿应助失眠紫真采纳,获得10
11秒前
12秒前
13秒前
13秒前
13秒前
李天王完成签到,获得积分10
14秒前
yyyy应助月亮与六便士采纳,获得10
14秒前
14秒前
wang完成签到,获得积分10
14秒前
CodeCraft应助小c采纳,获得10
15秒前
严小之完成签到,获得积分10
15秒前
16秒前
高分求助中
Adhesion Science: Principles & Practice 1234
Signals, Systems, and Signal Processing 610
Petrology and Plate Tectonics,2025 450
Burger's Medicinal Chemistry and Drug Discovery 400
New directions for experimental lessons in science teaching: Myth, Mystery, Necessity? by Emily K. da Silva Cunha Souto (Author), Flávia Lins Silva (Author) 333
Scientific experimentation in the classroom: Comparison between genetic-Socratic-exemplary teaching and workshop teaching by Ingrid Hofer (Author) 333
Programming for Chemical Engineers Using C, C++, and MATLAB 320
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6723930
求助须知:如何正确求助?哪些是违规求助? 8459755
关于积分的说明 18059782
捐赠科研通 5977790
什么是DOI,文献DOI怎么找? 2997190
邀请新用户注册赠送积分活动 1973447
关于科研通互助平台的介绍 1928153