Journal of South China University of Technology(Natural Science Edition) ›› 2025, Vol. 53 ›› Issue (5): 94-108.doi: 10.12141/j.issn.1000-565X.240439

• Computer Science & Technology • Previous Articles     Next Articles

A Liver Tumor Image Segmentation Method Based on Multi-Scale Feature Fusion and Reconstruction Convolution

MA Jinlin1,2, JIU Zhiqing2, MA Ziping3, XIA Mingge2, ZHANG Kai2, CHENG Yexia4, MA Ruishi2   

  1. 1.Key Laboratory of Intelligent Information Processing of Image and Graphics,North Minzu University,Yinchuan 750021,Ningxia,China
    2.School of Computer Science and Engineering,North Minzu University,Yinchuan 750021,Ningxia,China
    3.School of Mathematics and Information Science,North Minzu University,Yinchuan 750021,Ningxia,China
    4.China Mobile Communications Corporation,Beijing 100033,China
  • Received:2024-09-02 Online:2025-05-25 Published:2024-12-02
  • Contact: 酒志青(2000—),女,硕士生,主要从事图像识别研究。 E-mail:2313795280@qq.com
  • About author:马金林(1976—),男,博士,副教授,主要从事计算机视觉、图像识别、计算机图形学等研究。E-mail: 624160@qq.com
  • Supported by:
    the National Natural Science Foundation of China(62462001);the Natural Science Foundation of Ningxia Hui Autonomous Region(2024AAC03147)

Abstract:

Aiming at the problem of insufficient expression ability of liver tumor image features and limited global contextual information transmission, an improved U-Net liver tumor image segmentation method is proposed. Firstly, a low-rank reconstruction convolution is designed to optimize the large number of parameter problems caused by traditional convolution operations, and is used to construct a convolution kernel reconstruction module that uses residual structure to improve the encoder decoder, so that the encoder retains more detailed information and the decoder recovers information more effectively, thereby enhancing the expression ability of liver tumor image features. Then, to enrich the transmission of global contextual information, a three-branch spatial pyramid pooling module is designed to optimize the bottleneck structure of information transmission and to break the limitation of a single path. Secondly, a multi-scale feature fusion module is designed to optimize the reuse mechanism of encoder information, enhance the modeling ability of the model for global contextual information, and improve its efficiency in extracting liver tumor image features in different scales. Finally, the performance of the proposed method is tested on LiTS2017 and 3DIRCADb datasets. Experimental results show that the method achieves a Dice coefficient and an IoU value of 97.56% and 95.25% in the liver image segmentation task on LiTS2017 dataset, and 89.71% and 81.58% in the liver tumor image segmentation task. Moreover, the Dice coefficient and IoU value in the liver image segmentation task on 3DIRCADb dataset respectively reach 97.63% and 95.39%, while respectively reach 89.62% and 81.63% in the liver tumor image segmentation task.

Key words: liver tumor image segmentation, convolutional kernel reconstruction, spatial pyramid pooling, multi-scale feature fusion

CLC Number: