Journal of South China University of Technology(Natural Science Edition) ›› 2023, Vol. 51 ›› Issue (9): 110-119.doi: 10.12141/j.issn.1000-565X.220751

• Computer Science & Technology • Previous Articles     Next Articles

Fast Multi-View Clustering Based on Uniform Label Matrix

LIU Yijun1 WANG Jiada2 ZHONG Shijie3 YANG Xiaojun3 YE Wujian1   

  1. 1.School of Integrated Circuits,Guangdong University of Technology,Guangzhou 510006,Guangdong,China
    2.School of Advanced Manufacturing,Guangdong University of Technology,Jieyang 515200,Guangdong,China
    3.School of Information Engineering,Guangdong University of Technology,Guangzhou 510006,Guangdong,China
  • Received:2022-11-15 Online:2023-09-25 Published:2023-02-20
  • Contact: 杨晓君(1983-),男,博士,副教授,主要从事机器学习、聚类算法研究。 E-mail:yangxj18@gdut.edu.cn
  • About author:刘怡俊(1977-),男,教授,博士生导师,主要从事类脑计算机、深度学习研究。E-mail:yjliu@gdut.edu.cn
  • Supported by:
    the National Key R&D Program of China(2018YFB1802100);the Key-Area R&D Program of Guangdong Province(2018B030338001)

Abstract:

In the field of multi-view clustering, many methods learn the similarity matrix directly from the original data, but this ignores the effect of noise in the original data. In addition, some methods must perform a feature decomposition on the graph Laplacian matrix, which leads to reduced interpretability and requires post-processing such as k-means. To address these issues, this paper proposed a fast multi-view clustering based on a unified label matrix. Firstly, a non-negative constraint was added to the objective function from the unified viewpoint of the normalized cut of the relaxation and the ratio cut. Then, a structured graph reconstruction was performed on the similarity matrix by the indicator matrix to ensure that the obtained graph has strong intra-cluster connections and weak inter-cluster connections. In addition, the number of iterations was reduced by setting a unified label matrix, thus further improving the speed of the method. Finally, the problem was solved optimally based on an alternating direction multiplication strategy. The algorithm aligns the multi-view dataset by randomly selecting the anchor addresses, and aligning the views can significantly improve the accuracy of clustering. The problem of the high computational complexity of traditional spectral clustering algorithms was effectively solved by using singular value decomposition instead of feature decomposition in the iterative process. Labels were obtained directly by indicating the column labels of the largest element of the matrix by row index. Experimental results on four real datasets demonstrate the effectiveness of the algorithm, and show that its clustering performance outperformed the nine existing benchmark algorithms.

Key words: multi-view clustering, label matrix, graph reconstruction, interpretability

CLC Number: