Mechanical Engineering

Small-Sample Fault Diagnosis Method Based on Multi-Head Convolution and Differential Self-Attention

Expand
  • 1.School of Mechanical and Electrical Engineering,Guangdong University of Technology,Guangzhou 510006,Guangdong,China
    2.Intelligent Manufacturing Research Institute,Guangdong Academy of Sciences,Guangzhou 510030,Guangdong,China
    3.Guangdong Provincial Key Laboratory of Modern Control Technology,Guangzhou 510030,Guangdong,China
陈新度(1967-),男,博士,教授,博士生导师,主要从事智能装备、制造系统建模以及仿真优化等研究。

Received date: 2022-09-26

  Online published: 2023-02-20

Supported by

the Guangdong Province Key Field R&D Program(2019B090917004)

Abstract

Bearing is one of the most widely used rotating parts in industrial equipment. If the bearing runs in fault condition for a long time, it will cause huge economic loss and threaten personal safety, so that the investigation of bearing fault diagnosis is of great significance. Fault diagnosis technology based on deep learning is becoming more and more mature, but there are problems such as over-fitting, unstable effect and low accuracy in the case of small samples. In order to solve these problems, this paper proposes a Transformer variant model MDT (Multi-Head Convolution and Differential Self-Attention Transformer) to realize end-to-end few-shot fault diagnosis. This model combines the new data embedding algorithm of MC (Multi-Head Convolution) and the DSA (Differential Self-Attention) mechanism. The MC algorithm performs multi-path one-dimension convolution on the sample, extends the sample from one dimension to two dimensions by multi-channel output, and extracts rich fault information in each frequency domain in the original sample through multiple convolution kernel sizes. As compared with the original dot product self-attention in Transformer, the DSA mechanism obtains the corresponding attention weight vector for each feature through the difference, so as to extract deeper fault features from the sample. MDT inherits the powerful ability of Transformer to process sequence data, which can extract richer fault information from time-domain signals and avoid the overfitting problem common in small-sample models. Experimental results show that the proposed method can stably obtain more than 99% test accuracy in the bearing fault diagnosis task with only 100 training samples per fault type, and has strong anti-overfitting ability and strong robustness.

Cite this article

CHEN Xindu, FU Zhisen, WU Zhiheng, et al. . Small-Sample Fault Diagnosis Method Based on Multi-Head Convolution and Differential Self-Attention[J]. Journal of South China University of Technology(Natural Science), 2023 , 51(7) : 21 -33 . DOI: 10.12141/j.issn.1000-565X.220626

References

1 赵玉成,陈荣华,马占国 .旋转机械动力辨识与故障诊断技术[M].徐州:中国矿业大学出版社,2008
2 SHAO H, JIANG H, ZHANG X,et al .Rolling bearing fault diagnosis using an optimization deep belief network[J].Measurement Science and Technology201526(11):115002.
3 GOODFELLOW I, POUGET-ABADIE J, MIRZA M,et al .Generative adversarial networks[J].Communications of the ACM202063(11):139-144.
4 RADFORD A, METZ L, CHINTALA S .Unsupervised representation learning with deep convolutional generative adversarial networks[DB/OL].(2016-01-17)[2022-08-04]..
5 GULRAJANI I, AHMED F, ARJOVSKY M,et al .Improved training of Wasserstein GANs[C]∥NIPS’17:Proceedings of the 31st International Conference on Neural Information Processing Systems,2017.[S. l.]:[s. n.],2017:5767-5777.
6 SHEN S, JIN G, GAO K,et al .AE-GAN:adversarial eliminating with GAN[DB/OL].(2017-09-26)[2022-08-04]..
7 MIRZA M, OSINDERO S .Conditional generative adversarial nets[DB/OL].(2014-11-06)[2022-08-04]..
8 SAUFI S R, AHMAD Z A B, LEONG M S,et al .Gearbox fault diagnosis using a deep learning model with limited data sample[J].IEEE Transactions on Industrial Informatics202016(10):6263-6271.
9 LI X, ZHANG W, DING Q .Understanding and improving deep learning-based rolling bearing fault diagnosis with attention mechanism[J].Signal Processing2019161:136-154.
10 ZHANG X, HE C, LU Y,et al .Fault diagnosis for small samples based on attention mechanism[J].Measurement2022187:110242.
11 XIE Z, CHEN J, FENG Y,et al .End to end multi-task learning with attention for multi-objective fault diagnosis under small sample[J].Journal of Manufacturing Systems202262:301-316.
12 VASWANI A, SHAZEER N, PARMAR N,et al .Attention is all you need[C]∥ Proceedings of the 31st Conference on Neural Information Processing Systems (NIPS 2017).Long Beach:[s. n.],2017
13 SHAZEER N, MIRHOSEINI A, MAZIARZ K,et al .Outrageously large neural networks:the sparsely-gated mixture-of-experts layer[DB/OL].(2017-03-04)[2022-08-04]..
14 DING Y, JIA M, MIAO Q,et al .A novel time-frequency Transformer based on self-attention mechanism and its application in fault diagnosis of rolling bearings[J].Mechanical Systems and Signal Processing2022168:108616.
15 郑英,金淼,张洪,等 .一种基于一维多路卷积神经网络的故障分类方法:CN110033021A[P].2019-07-19.
16 DAUBECHIES I, LU J, WU H T .Synchrosqueezed wavelet transforms:an empirical mode decomposition-like tool[J].Applied and Computational Harmonic Analysis201130(2):243-261.
17 HUANG N E, SHEN Z, LONG S R,et al .The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis[J].Proceedings of the Royal Society of London. Series A:Mathematical,Physical and Engineering Sciences1998454(1971):903-995.
Outlines

/