计算机科学与技术

基于锚点图的低秩缺失多视图子空间聚类

展开
  • 华南理工大学 数学学院,广东 广州 510640
刘小兰(1979-),女,博士,副教授,主要从事优化算法与机器学习研究.E-mail:liuxl@scut.edu.cn.

收稿日期: 2022-02-21

  网络出版日期: 2022-07-15

基金资助

广东省自然科学基金资助项目(2020A1515010699)

Anchor Graph Based Low-Rank Incomplete Multi-View Subspace Clustering

Expand
  • School of Mathematics,South China University of Technology,Guangzhou 510460,Guangdong,China
刘小兰(1979-),女,博士,副教授,主要从事优化算法与机器学习研究.E-mail:liuxl@scut.edu.cn.

Received date: 2022-02-21

  Online published: 2022-07-15

Supported by

the Natural Science Foundation of Guangdong Province(2020A1515010699)

摘要

经典的多视图聚类任务针对的是完整数据,然而实际任务中受限于信息的获取渠道,某些视图往往含有缺失数据,由此引出了缺失多视图聚类问题。针对此问题,现有的聚类模型大多基于非负矩阵分解或距离构造聚类图,存在着联合优化使解的性能得不到保障以及无法对全局结构进行充分刻画的不足。为了提升聚类图的性能,基于低秩子空间聚类和锚点图,文中提出了一种低秩缺失多视图子空间聚类算法ALIMSC。该算法先通过基于锚点图的缺失多视图聚类(APMC)算法得到数据的基准相似矩阵,将其嵌入低秩子空间聚类模型,通过升维对齐和加权融合的方法得到相似矩阵,再通过让相似矩阵与基准相似矩阵尽可能的一致来求得最终的聚类图。ALIMSC算法通过对每个视图的相似矩阵施加秩最小化约束来刻画高维数据的低维子空间分布,在原有锚点图的基础上进一步强调了数据的子空间算法结构,即聚类图中所体现的块对角性。在多个公开数据集上的实验结果表明,本文算法的聚类性能优于所对比的缺失多视图聚类算法。

本文引用格式

刘小兰, 石宗宇, 叶泽慧, 等 . 基于锚点图的低秩缺失多视图子空间聚类[J]. 华南理工大学学报(自然科学版), 2022 , 50(12) : 60 -70 . DOI: 10.12141/j.issn.1000-565X.220069

Abstract

Traditional multi-view clustering task is for complete data. However, in practical tasks, due to the limitation of the information acquisition method, some views tend to contain missing data, and this leads to the problem of incomplete multi-view clustering. In view of this problem, most of the existing clustering models are based on non-negative matrix factorization or distance graph, and their co-optimization strategy can easily make the performance of the solution insecure and the global structure can’t be fully characterized. In order to improve the performance of clustering graph, this paper proposed an incomplete multi-view clustering algorithm ALIMSC based on low-rank subspace clustering and anchor graph. The algorithm first obtained the benchmark similarity matrix of data by incomplete multi-view subspace clustering algorithm APMC based on anchor graph, which was embedded in the low-rank subspace clustering model. The similarity matrix was obtained by dimensionality ascending alignment and weighted fusion, and the final clustering graph was obtained by making the similarity matrix as consistent as possible with the benchmark similarity matrix. ALIMSC algorithm characterized the low-dimensional subspace distribution of high-dimensional data by imposing rank minimization constraint on the similarity matrix of each view and emphasized the subspace structure of the data on the basis of the original anchor graph, that is, the block diagonality reflected in the cluster graph. Experimental results on several public datasets show that the proposed algorithm outperforms the classical incomplete multi-view algorithms.

参考文献

1 KUMAR A, DAUMé H .A co-training approach for multi-view spectral clustering [C]∥ Proceedings of the 28th International Conference on Machine Learning.New York:ACM,2011:393-400.
2 YANG Y, WANG H .Multi-view clustering:a survey [J].Big Data Mining and Analytics,2018,1(2):83-107.
3 GAO H, NIE F, LI X,et al .Multi-view subspace clustering [C]∥ Proceedings of 2015 IEEE International Conference on Computer Vision.Santiago:IEEE,2015:4238-4246.
4 LI S Y, JIANG Y, ZHOU Z H .Partial multi-view clustering [C]∥ Proceedings of the 28th AAAI Conference on Artificial Intelligence.Québec City:AAAI,2014:1968-1974.
5 SHAO W, HE L, YU P S .Multiple incomplete views clustering via weighted nonnegative matrix factorization with L2,1 regularization [C]∥ Proceedings of 2015 European Conference on Machine Learning and Knowledge Discovery in Databases.Porto:Springer,2015:318-334.
6 ZHAO H, LIU H, FU Y .Incomplete multi-modal visual data grouping [C]∥ Proceedings of the 25th International Joint Conference on Artificial Intelligence.New York:AAAI,2016:2392-2398.
7 HU M, CHEN S .Doubly aligned incomplete multi-view clustering [C]∥ Proceedings of the 27th International Joint Conference on Artificial Intelligence.Stockholm:IJCAI Organization,2018:2262-2268.
8 WANG Q, DING Z, TAO Z,et al .Partial multi-view clustering via consistent GAN [C]∥ Proceedings of 2018 IEEE International Conference on Data Mining.New York: IEEE,2018:1290-1295.
9 HU M, CHEN S .One-pass incomplete multi-view clustering [C]∥ Proceedings of the 33rd AAAI Conference on Artificial Intelligence.Honolulu:AAAI,2019:3838-3845.
10 WEN J, XU Y, LIU H .Incomplete multiview spectral clustering with adaptive graph learning [J].IEEE Transactions on Cybernetics,2018,50(4):1418-1429.
11 WEN J, SUN H, FEI L,et al .Consensus guided incomplete multi-view spectral clustering [J].Neural Networks,2021,133:207-219.
12 LI Z, TANG C, ZHENG X,et al .High-order correlation preserved incomplete multi-view subspace clustering [J].IEEE Transactions on Image Processing,2022,31:2067-2080.
13 LIU W, WANG J, KUMAR S,et al .Hashing with graphs [C]∥ Proceedings of the 28th International Conference on Machine Learning.New York:ACM,2011:1-8.
14 KANG Z, LIN Z, ZHU X,et al .Structured graph learning for scalable subspace clustering:from single view to multiview [J].IEEE Transactions on Cybernetics,2022,52(9):8976-8986.
15 GUO J, YE J .Anchors bring ease:an embarrassingly simple approach to partial multi-view clustering [C]∥ Proceedings of the 33rd AAAI Conference on Artificial Intelligence.Honolulu:AAAI,2019:118-125.
16 LIU G, LIN Z, YAN S,et al .Robust recovery of subspace structures by low-rank representation [J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2012,35(1):171-184.
17 BOYD S, PARIKH N, CHU E .Distributed optimization and statistical learning via the alternating direction method of multipliers [M].Norwell:Now Publishers Inc.,2011.
18 CAI J F, CANDèS E J, SHEN Z .A singular value thresholding algorithm for matrix completion [J].SIAM Journal on Optimization,2010,20(4):1956-1982.
19 NIE F, WANG X, JORDAN M,et al .The constrained Laplacian rank algorithm for graph-based clustering [C]∥ Proceedings of the 30th AAAI Conference on Artificial Intelligence.Phoenix:AAAI,2016:1969-1976.
20 LV J, KANG Z, WANG B,et al .Multi-view subspace clustering via partition fusion [J].Information Sciences,2021,560:410-423.
21 GREENE D, CUNNINGHAM P .A matrix factorization approach for integrating multiple data views [C]∥ Proceedings of 2009 European Conference on Machine Learning and Knowledge Discovery in Databases.Bled:Springer,2009:423-438.
22 HULL J J .A database for handwritten text recognition research [J].IEEE Transactions on Pattern Analysis and Machine Intelligence,1994,16(5):550-554.
23 LECUN Y, BOTTOU L, BENGIO Y,et al .Gradient-based learning applied to document recognition [J].Proceedings of the IEEE,1998,86(11):2278-2324.
24 JAIN A K, DUIN R P W, MAO J .Statistical pattern recognition:a review [J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2000,22(1):4-37.
25 YIN Q, WU S, WANG L .Incomplete multi-view clustering via subspace learning [C]∥ Proceedings of the 24th ACM International on Conference on Information and Knowledge Management.New York:ACM,2015:383-392.
26 XIA R, PAN Y, DU L,et al .Robust multi-view spectral clustering via low-rank and sparse decomposition [C]∥ Proceedings of the 28th AAAI Conference on Artificial Intelligence.Québec City:AAAI,2014:2149-2155.
27 刘小兰,叶泽慧 .基于StarGAN和子空间学习的缺失多视图聚类 [J].华南理工大学学报(自然科学版),2020,48(11):87-98.
27 LIU Xiaolan, YE Zehui .Partial multi-view clustering based on StarGAN and subspace learning [J].Journal of South China University of Technology(Natural Science Edition),2020,48(11):87-98.
文章导航

/