环境科学与技术

基于改进EfficientNetV2的铝液泄漏声音识别与预警机制

展开
  • 1.华南理工大学 机械与汽车工程学院, 广东 广州 510640;

    2.广州现代产业技术研究院, 广东 广州 510640;

    3.人工智能与数字经济广东省实验室(广州), 广东 广州 510640

网络出版日期: 2025-07-18

Sound Recognition and Early Warning Mechanism for Liquid Aluminum Leakage Based on Improved EfficientNetV2

Expand
  • 1. School of Mechanical & Automotive Engineering, South China University of Technology, Guangzhou 510640, Guangdong, China; 2. Guangzhou Modern Industrial Technology Research Institute, Guangzhou 510640, Guangdong, China;

    3. Artificial Intelligence and Digital Economy Guangdong Provincial Laboratory (Guangzhou), Guangzhou 510640, Guangdong, China

Online published: 2025-07-18

摘要

铝液泄漏是导致铝加工深井铸造爆炸事故的直接原因。解决实际工程中铝液泄漏判断方法存在的滞后性大、准确率低、监测范围受限问题,提出了基于改进EfficientNetV2的铝液泄漏声音识别方法。该方法以铝液泄漏的声音特征作为判断依据来改善监测范围,通过简化堆叠因子与引入ECA注意力机制改进EfficientNetV2结构来提高识别速率与准确率。先利用拾音器采集声音数据,构建了7类声音场景,再从声音信号中提取对数梅尔语谱图作为特征集,输入到改进的EfficientNetV2中进行训练与验证,得到铝液泄漏声音识别模型。实验结果表明:改进的EfficientNetV2识别准确率为95.48%原始的EfficientNetV2、ResNet、RegNet和DenseNet相比,指标FLOPs分别降至上述模型的12.34%、8.64%、11.14%和10.80%,指标Params分别降至上述模型的11.37%、9.55%、15.95%和17.24%,指标FPS(CPU)分别提升为上述模型的6.53倍、6.14倍、4.41倍和8.00倍;说明改进的EfficientNetV2具有准确快速的识别性能。此外,基于声音识别方法提出了铝液泄漏风险预警机制,并将其应用于铸造单元的实时风险监测,验证了所提方法的有效性,可为铝加工深井铸造爆炸事故预防提供参考。

本文引用格式

梁艳辉, 温承杰, 闫军威, 等 . 基于改进EfficientNetV2的铝液泄漏声音识别与预警机制[J]. 华南理工大学学报(自然科学版), 0 : 1 . DOI: 10.12141/j.issn.1000-565X.250006

Abstract

Liquid aluminum leakage was the direct cause that resulted in explosion accidents in aluminum deep well-casting. In order to solve the problems that the judgment methods of liquid aluminum leakage had large lag, low accuracy and limited monitoring range in practical engineering, a sound recognition method for liquid aluminum leakage based on improved EfficientNetV2 was proposed. This method improved the monitoring range by using the sound characteristics of liquid aluminum leakage as the judgment basis, and improved the EfficientNetV2 structure by simplifying the stacking factor and introducing the ECA attention mechanism to improve the recognition rate and accuracy. First, the sound data was collected by the pickup, and seven kinds of sound scenes were constructed. Then, the logarithmic Mel-spectrogram was extracted from the sound signal as the feature set, which was input into the improved EfficientNetV2 for training and verification to obtain the sound recognition model of aluminum liquid leakage. The experimental results showed that the recognition accuracy of the improved EfficientNetV2 reached 95.48%. Compared with the original EfficientNetV2、ResNet、RegNet and DenseNet, the PLOPs were reduced to 12.34%, 8.64%, 11.14%, and 10.80% of the above models, the Params were reduced to 11.37%, 9.55%, 15.95%, and 17.24% of the above models, and FPS (CPU) was increased to 6.53 times, 6.14 times, 4.41 times, and 8.00 times of the above models. It showed that the improved EfficientNetV2 had accurate and fast recognition performance. In addition, based on the sound recognition method, an early warning mechanism for liquid aluminum leakage was proposed and applied to the real-time risk monitoring of the casting unit, which verified the effectiveness of the proposed method and provided a reference for the prevention of explosion accidents in aluminum deep well-casting.

Options
文章导航

/