Training reinforcement learning-based controller using performance simulation of the laser remelting process

计算机科学强化学习过程（计算）培训（气象学）控制器（灌溉）人工智能模拟机器学习操作系统农学物理生物气象学

作者

Honghe Wu,Evgueni V. Bordatchev,O. Remus Tutunea‐Fatan

出处

期刊：Procedia Computer Science [Elsevier]
日期：2024-01-01 卷期号：232: 1849-1858 被引量：2

标识

DOI：10.1016/j.procs.2024.02.007

摘要

Surface topography modification is a common post-processing step used to enhance surface quality or to add micro/nano surface structures for various functionalities. Laser remelting (LRM) is a new technology aimed to resolve some of the shortcomings of traditional post-processing technologies. In LRM, a high-energy laser beam melts the top surface and offers control of the molten material flow through the control of laser power, scanning speed, melt pool size, and scan trajectory. By controlling the molten material flow, surface polishing by laser remelting (SP-LRM) can be used to produce finishes characterized by areal arithmetical mean height - Sa < 0.1 µm as well as surface microstructures. LRM is faster than traditional methods, creates no harmful byproducts, and allows for polishing high aspect ratio freeform surfaces without harming surrounding areas. The primary limiting factor for wide-rage industrial use of the LRM is represented by thermodynamic process instabilities that are associated with rapid melting and solidification. For instance, deep trenches or grinding marks in the initial surface can cause small deviations of local surface absorptivity to lead to laser melt pool instabilities. The high-speed nature of the LRM process combined with the complex thermodynamic phenomena makes process instabilities difficult to model, detect, predict, and control. This study aimed to use reinforcement learning (RL) in order to build a 'self-learning' controller capable of adapting to any material and control instabilities on-line and in real-time. Along these lines, the study will detail a preliminary application of two RL implementations involving SP-LRM performance simulation to demonstrate the implementation of a double dueling deep Q-network (DDDQN) controller for melt pool temperature distribution control. The results obtained showed that the 3-action implementation achieved 48% less variance and 35% increase in the mean value of the total reward collected compared with the 31-action implementation.

求助该文献

最长约 10秒，即可获得该文献文件

Training reinforcement learning-based controller using performance simulation of the laser remelting process

今日热心研友