华南理工大学学报(自然科学版) ›› 2023, Vol. 51 ›› Issue (9): 99-109.doi: 10.12141/j.issn.1000-565X.220420

• 计算机科学与技术 • 上一篇    下一篇

基于密集特征推理及混合损失函数的修复算法

李海燕1 尹浩林1 李鹏2 周丽萍2   

  1. 1.云南大学 信息学院,云南 昆明 650500
    2.云南大学 云南大学学报(自然科学版)编辑部,云南 昆明 650500
  • 收稿日期:2022-07-04 出版日期:2023-09-25 发布日期:2023-02-08
  • 通信作者: 李海燕(1976-),女,教授,博士生导师,主要从事人工智能、图像处理研究。 E-mail:leehy@ynu.edu.cn
  • 作者简介:李海燕(1976-),女,教授,博士生导师,主要从事人工智能、图像处理研究。
  • 基金资助:
    国家自然科学基金资助项目(62266049);云南省万人计划“云岭教学名师”(2019010015)

Image Inpainting Algorithm Based on Dense Feature Reasoning and Mix Loss Function

LI Haiyan1 YIN Haolin1 LI Peng2 ZHOU Liping2   

  1. 1.School of Information Science and Engineering,Yunnan University,Kunming 650500,Yunnan,China
    2.Editorial Department of Journal of Yunnan University(Natural Science Edition),Yunnan University,Kunming 650500,Yunnan,China
  • Received:2022-07-04 Online:2023-09-25 Published:2023-02-08
  • Contact: 李海燕(1976-),女,教授,博士生导师,主要从事人工智能、图像处理研究。 E-mail:leehy@ynu.edu.cn
  • About author:李海燕(1976-),女,教授,博士生导师,主要从事人工智能、图像处理研究。
  • Supported by:
    the National Natural Science Foundation of China(62266049)

摘要:

为有效解决现有算法修复大面积不规则缺失图像时存在特征利用率低、图像结构连贯性差的问题,提出基于密集特征推理(DFR)及混合损失函数的图像修复算法。修复网络由多个特征推理(FR)模块密集连接组成,首先将待修复图像输入第1个推理模块中进行特征推理,之后将输出特征图通道合并送入下一个推理模块,后续推理的每一个模块的输入都是来自前面所有推理模块的推理特征,如此循环,以充分利用每个推理模块捕获的特征信息;然后提出一个传播一致性注意力机制(PCA),提高修补区域与已知区域的整体一致性;最后,提出混合损失函数(ML)优化修复结果的结构连贯性。整个DFR网络使用组归一化(GN),小批量训练也可达到优异的修复效果。在国际公认的Paris StreetView巴黎街景数据集和CelebA人脸数据集上验证文中所提算法的性能,主客观的实验结果表明:所提算法能有效修复大面积不规则缺失图像,提升特征利用率与结构连贯性,其平均峰值信噪比(PSNR)、平均结构相似度(SSIM)、均方误差(MSE)、弗雷歇距离(FID)及学习感知图像块相似度(LPIPS)指标优于对比算法。

关键词: 图像修复, 密集特征推理, 注意力机制, 混合损失函数, 组归一化

Abstract:

To effectively solve the problems of low feature utilization and poor image structure coherence occurred when existing algorithms are used to repair large irregularly missing images, this study proposed an image repair algorithm based on dense feature inference (DFR) and hybrid loss function. The repair network consists of multiple inference modules (FRs) densely connected. Firstly, after the image to be restored was fed into the first inference module for feature inference, the output feature map channels were merged and sent to the next inference module. The input of each subsequent inference module was the inferred features from all the previous inference modules and so on, so as to make full use of the feature information captured by each reasoning module. Subsequently, a propagation consistent attention (PCA) mechanism was proposed to improve the overall consistency of the patched regions with the known regions. Finally, a hybrid loss function (ML) was proposed to optimize the structural coherence of the repair results. The whole DFR network adopted group normalization (GN), and excellent repair results can be achieved even using small training batches. The performance of the proposed algorithm was verified on Paris StreetView and CelebA face datasets, which are internationally recognized datasets. The objective and subjective experimental results show that the proposed algorithm can effectively repair large irregular missing images, improve feature utilization and structural coherence. Its average peak signal-to-noise ratio (PSNR), average structural similarity ( SSIM), mean square error (MSE), Fréchet distance (FID) and learning perceptual image block similarity (LPIPS) metrics all outperform the comparison algorithms.

Key words: image inpainting, dense feature reasoning, attention mechanism, hybrid loss function, group normalization

中图分类号: