计算机科学
选择(遗传算法)
样品(材料)
人工智能
机器学习
理论(学习稳定性)
马尔可夫决策过程
强化学习
集合(抽象数据类型)
过程(计算)
最优化问题
模式识别(心理学)
马尔可夫过程
数学
算法
统计
化学
色谱法
程序设计语言
操作系统
作者
Saite Fan,Xinmin Zhang,Zhihuan Song
标识
DOI:10.1109/tii.2021.3100284
摘要
An imbalanced number of faulty and normal samples causes serious damage to the performance of the conventional diagnosis methods. To settle the data-imbalance fault diagnosis problem, this article presents a novel general imbalanced sample selection strategy (DiagSelect) based on deep reinforcement learning. In DiagSelect, the problem of imbalanced sample selection from the training set is formulated as a multiarmed bandit problem of deep reinforcement learning. The nondifferentiable optimization problem of imbalanced sample selection can be solved by the Markov decision process. The parameters of DiagSelect can be optimized by REINFORCE with the feedback of the validation set. DiagSelect performs intelligent imbalanced sample selection to obtain better diagnosis performance autonomously. As a data-level technique, DiagSelect can be easily used in conjunction with the conventional diagnosis models. DiagSelect is validated in a synthetic dataset and an industrial process dataset. The results have shown the effectiveness, stability, and transferability of DiagSelect.
科研通智能强力驱动
Strongly Powered by AbleSci AI