序列(生物学)
表(数据库)
马尔可夫决策过程
过程(计算)
约束(计算机辅助设计)
马尔可夫链
算法
动作(物理)
价值(数学)
工程类
国家(计算机科学)
计算机科学
功能(生物学)
基质(化学分析)
人工智能
马尔可夫过程
机器学习
数据挖掘
数学
物理
材料科学
复合材料
进化生物学
操作系统
统计
遗传学
量子力学
生物
机械工程
作者
Zepeng Chen,Lin Li,Fu Zhao,John W. Sutherland,Fengfu Yin
出处
期刊:Procedia CIRP
[Elsevier]
日期:2023-01-01
卷期号:116: 684-689
被引量:19
标识
DOI:10.1016/j.procir.2023.02.115
摘要
Owing to increasing environmental concerns, disassembly has become an important step in value recovery from end-of-life (EoL) products. As one of the prevalent electronic devices, smartphones are especially challenging to manage at the end of life. They have a diverse internal structure that leads to difficulties in modeling and developing general plans for disassembly. Given this background, an improved method that uses a Q-learning algorithm is proposed to optimize the disassembly sequence of EoL smartphones. A constraint relationship is first developed of EoL smartphone parts. The disassembly sequence planning problem is then formalized with a Markov Decision Process. The optimization objective is the disassembly time to obtain the target parts. The disassembly time and the target parts are converted into a reward value for the Q-learning algorithm. A State-Action-Reward Matrix is established based on converted reward value and disassembly sequence planning problem. A Q-table is trained to find the best action for each state using the State-Action-Reward Matrix. The disassembly sequence with the maximum reward value is secured through the trained Q-table and objective function. A case study of the 'Xiaomi 5' is conducted to verify the applicability of the proposed method. The experimental results demonstrate that the proposed method can provide feasible sequence planning for targeted parts in the disassembly of EoL smartphones.
科研通智能强力驱动
Strongly Powered by AbleSci AI