Journal of South China University of Technology (Natural Science Edition)

• Computer Science & Technology •     Next Articles

Normal Estimation from Single Monocular Images based on Multi-Scale Convolution Network

XIAN Chuhua1 LIU Xin1 LI Guiqing1 JIN Shuo2   

  1. 1. School of Computer Science and Engineering,South China University of Technology,Guangzhou 510006,Guangdong,China; 2. Tricorn (Beijing) Technology Co. ,Ltd. ,Beijing 100029,China
  • Received:2018-10-31 Online:2018-12-25 Published:2018-11-01
  • Contact: 李桂清(1966-),男,博士,教授,博士生导师,主要从事计算机图形学、计算机动画、虚拟现实 CAGD 研究. E-mail:ligq@scut.edu.cn
  • About author:冼楚华(1982-),男,博士,副教授,主要从事几何建模与处理、计算机图形学、智能图形处理、CAD/CAE 集成学研 究. E-mail:chhxian@ scut. edu. cn
  • Supported by:
    Supported by the National Natural Science Foundation of China(61572202) and the Natural Sciene Foundation of Guangdong Province of China(2015A030313220,2017A030313347)

Abstract: Normal estimation from monocular images is one of the most important issues in computer graphics and computer vision research. Short of three-dimensional information,the corresponding normal is predicted from the monocular images,which is of great significance for 3D scene reconstruction,3D model recognition,3D semantic segmentation,etc. In order to find the solution to the problem,this paper adopts a multi-scale convolutional net- work structure to predict an end-to-end output of the image. The network consists of two scales,the first layer uses the DenseNet classification network with the best performance in ImageNet to process the input globally. The second level uses a fully convolutional network to further fine-tune the output obtained from the first level. The experimen- tal results show that the network proposed in this paper can achieve better results in normal prediction of monocular image even without using other pre-processing or post-processing steps.

Key words: normal estimation, monocular image, convolutional neural network