计算机科学
元学习(计算机科学)
初始化
人工智能
特征(语言学)
任务(项目管理)
特征学习
机器学习
财产(哲学)
断层(地质)
干扰(通信)
模式识别(心理学)
学习迁移
代表(政治)
频道(广播)
政治
政治学
法学
计算机网络
语言学
哲学
管理
认识论
地震学
地质学
经济
程序设计语言
作者
Zhiqian Zhao,Runchao Zhao,Xiaofeng Wu,X.S. Hu,Renwei Che,Xiang Zhang,Yinghou Jiao
出处
期刊:Neurocomputing
[Elsevier]
日期:2023-10-01
卷期号:552: 126551-126551
被引量:2
标识
DOI:10.1016/j.neucom.2023.126551
摘要
Considering the changing working conditions of rotating machinery in operation, it is often difficult to collect data accurately in some severe fault states, and the lack of data can lead to poor performance of deep learning-based fault diagnosis models. In the few-shot scenario, traditional meta-learning methods yield superior predictions by identifying ideal initialization parameters. When the update gradients of tasks are in different directions, meta-learning modifies the weights of samples in the current tasks to average them out, which may lead to negative transfer between tasks and poor generalization of biases. Based on the above problems, we propose a meta-learning network with anti-interference (AIML) for few-shot fault diagnosis, which is obtained by combining a dynamic fine-tuning technique to increase the gradient agreement of the tasks. AIML consists of two functions: the feature encoding network (FEN) and the base network. AIML uses the property of shared parameters about transfer learning to learn shared feature representations of different tasks, while FEN dynamically adjusts the loss weights of conflicting tasks due to meta-learning with two-level optimization techniques. First, in the internal loop, AIML updates the gradient of the base network while the network parameters of FEN are fixed. Instead of immediately computing the gradient of the optimal parameters for the new task, the FEN is then integrated in the outer loop to learn the meta-representation, and the weights obtained for the various tasks relate to the gradient for meta-optimization. Only during the outer loop stage of the meta-training is the FEN updated. Three publicly available datasets are used to assess the performance of AIML, and the results show that it is more effective at resolving problems involving few-shot fault diagnosis.
科研通智能强力驱动
Strongly Powered by AbleSci AI