华南理工大学学报(自然科学版) ›› 2015, Vol. 43 ›› Issue (5): 120-125.doi: 10.3969/j.issn.1000-565X.2015.05.019

• 计算机科学与技术 • 上一篇    下一篇

基于大间隔编码的空间非负矩阵分解

刘大琨 谭晓阳   

  1. 南京航空航天大学 计算机科学与技术学院,江苏 南京 210016
  • 收稿日期:2014-12-15 出版日期:2015-05-25 发布日期:2015-05-07
  • 通信作者: 刘大琨(1984-),男,博士生,主要从事机器学习、模式识别、计算机视觉研究. E-mail:liudakun315@nuaa.edu.cn
  • 作者简介:刘大琨(1984-),男,博士生,主要从事机器学习、模式识别、计算机视觉研究.
  • 基金资助:

    国家自然科学基金资助项目(61073112,61373060);江苏省自然科学基金资助项目(BK2012793);教育部博士点基金资助项目(20123218110033)

Spatial Non-Negative Matrix Factorization Based on Max-Margin Coding

Liu Da-kun Tan Xiao-yang   

  1. College of Computer Science and Technology,Nanjing University of Aeronautics and Astronautics,Nanjing 210016,Jiangsu,China
  • Received:2014-12-15 Online:2015-05-25 Published:2015-05-07
  • Contact: 刘大琨(1984-),男,博士生,主要从事机器学习、模式识别、计算机视觉研究. E-mail:liudakun315@nuaa.edu.cn
  • About author:刘大琨(1984-),男,博士生,主要从事机器学习、模式识别、计算机视觉研究.
  • Supported by:
    Supported by the National Natural Science Foundation of China(61073112,61373060),the Natural Science Foundation of Jiangsu Province,China(BK2012793) and the Ph. D. Program Foundation of the Ministry of Education of China (20123218110033)

摘要: 虽然基于局部的表示方法在图像处理中具有很好的鲁棒性,但非负矩阵分解只有隐式局部约束,导致分解不唯一和基图像不够局部. 另外,局部性与判别性作为样本表示的重要性质几乎没有在非负矩阵分解中被同时考虑过. 为此,文中提出了基于大间隔编码的空间非负矩阵分解,将图像数据看作像素构成的二维网络,借鉴网络中的知识将空间信息嵌入基图像,不但施加了显式的局部约束,而且能够弥补数据向量化损失的空间信息. 同时,利用大间隔约束学到的额外一维空间平衡重建误差和判别性约束对基图像的影响. 在 AR数据库和扩展的 YaleB 数据库上的人脸识别实验结果表明,相比于非负矩阵及其他几种典型的扩展方法,基于大间隔编码的空间非负矩阵分解更加鲁棒.

关键词: 模式分类, 非负矩阵分解, 空间约束, 判别的子空间表示, 大间隔约束

Abstract: Although the parts-based representation results in strong robustness in image processing,the local con-straint in non-negative matrix factorization (NMF) is implicit,which leads to insufficient uniqueness and locality.Meanwhile,as two important property indexes,locality and discriminant in feature extraction are seldom considered in NMF simultaneously. In order to solve this problem,a discriminative NMF on the basis of max-margin coding is pro-posed. In this method,image data are regarded as a 2D network of pixels,and,on the basis of network knowledge,spatial information is embedded into basis images,which not only imposes an explicit local constraint but also com-pensates the spatial information loss caused by data vectorization. Additionally,an extra 1D space learned from max-margin constraint is adopted to balance the effects of reconstruction error and discriminative constraint on basis ima-ges. Experimental results on AR and extended YaleB databases for face recognition show that,in comparison with NMF and some of its variants,the proposed max-margin coding-based spatial NMF is more robust.

Key words: pattern classification, non-negative matrix factorization, spatial constraint, discriminative subspace representation, max-margin constraint