Journal of South China University of Technology(Natural Science Edition) ›› 2025, Vol. 53 ›› Issue (3): 31-39.doi: 10.12141/j.issn.1000-565X.240155

• Computer Science & Technology • Previous Articles     Next Articles

Image Inpainting Algorithm Based on Hybrid Encoding and Mask Space Modulation

XIAN Jin1, XU Xiaoru2, XIAN Yunting1, XIAN Chuhua1   

  1. 1.School of Computer Science and Engineering,South China University of Technology,Guangzhou 510006,Guangdong,China
    2.School of Law and Humanities,Jiangsu Vocational College of Finance and Economics,Huaian 223003,Jiangsu,China
  • Received:2024-04-03 Online:2025-03-10 Published:2024-10-11
  • Contact: XU Xiaoru
  • Supported by:
    the Key-Areas R & D Program of Guangdong Province(2022B0101070001)

Abstract:

Image inpainting refers to the process of filling in missing regions of an image with plausible content, which is one of the significant issues in the fields of computer vision and image processing research. Current research on image inpainting algorithms has made substantial progress. However, when dealing with complex images with large missing areas, existing algorithms still face challenges in generating high-quality complete images due to the lack of effective network structures to capture long-range dependencies and high-level semantic information in the images. To address the issue of large-scale missing image inpainting, this paper proposed an image inpainting algorithm based on hybrid encoding and mask spatial modulation. The aim is to expand the limited receptive field of image inpainting networks, effectively obtain global information from the visible regions of the image, and fully utilize the effective information from the visible regions. Firstly, a hybrid encoding network was used to extract local and global information features from the visible regions of the image. Then, a mask spatial modulation module dynamically adjusted the diversity in generating missing regions based on the size of the missing area. Finally, a method based on StyleGAN2 was used to generate complete images. Experimental results show that the proposed algorithm can effectively handle images with large-scale missing areas, generating high-quality images with diversity, and can be applied to data augmentation in visual saliency models.

Key words: image inpainting, image enhancement, hybrid encoding, mask space modulation

CLC Number: