计算机科学与技术

基于样本互补锚点图的缺失多视图聚类算法

  • 刘小兰 ,
  • 徐宇鸿
展开
  • 1.华南理工大学 数学学院,广东 广州 510640
    2.华南理工大学 计算机科学与工程学院,广东 广州 510006
刘小兰(1979—),女,博士,教授,主要从事优化算法与机器学习研究。E-mail: liuxl@scut.edu.cn

收稿日期: 2025-05-19

  网络出版日期: 2025-07-17

基金资助

国家社会科学基金项目(21BTJ069);广东省线上线下混合一流课程(粤教高函[2023]33号)

Incomplete Multi-View Clustering Algorithm Based on Sample Complementary Anchor Graph

  • LIU Xiaolan ,
  • XU Yuhong
Expand
  • 1.School of Mathematics,South China University of Technology,Guangzhou 510460,Guangdong,China
    2.School of Computer Science and Engineering,South China University of Technology,Guangzhou 510006,Guangdong,China

Received date: 2025-05-19

  Online published: 2025-07-17

Supported by

the National Social Science Foundation of China(21BTJ069)

摘要

随着多视图数据在现实场景中得到广泛应用,如何处理缺失视图下的聚类问题已成为机器学习领域的重要挑战。传统锚点图聚类算法依赖完整实例构建锚点图,这导致其在高缺失率下因锚点不足难以表征数据结构,在低缺失率时又无法发挥锚点的优势。针对传统锚点图聚类算法中存在的锚点选择受限、权重分配僵化和计算复杂度高的问题,该文提出了一种基于样本互补锚点图的缺失多视图聚类算法(IMVC-SAC)。该算法首先设计跨视图锚点互补机制,通过在共有样本与视图特有样本中自适应选取锚点,以解决高缺失率下数据结构表征不足的问题;然后建立缺失模式感知的权重模型,依据样本的缺失模式与程度调整视图对相似矩阵的贡献度;最后利用双随机非负矩阵可分解特性,将谱聚类的时间复杂度从样本规模的立方阶复杂度优化至线性阶复杂度。在5个公开数据集上的实验结果表明,该算法的聚类性能优于目前主流算法,尤其在高缺失率下仍能保持较好的聚类效果,验证了其鲁棒性与有效性。

本文引用格式

刘小兰 , 徐宇鸿 . 基于样本互补锚点图的缺失多视图聚类算法[J]. 华南理工大学学报(自然科学版), 2026 , 54(2) : 16 -24 . DOI: 10.12141/j.issn.1000-565X.250145

Abstract

With the widespread application of multi-view data in real-world scenarios, clustering with incomplete views has emerged as a significant challenge in machine learning. Traditional anchor graph-based clustering algorithms rely on complete instances to build the anchor graphs. This dependency leads to insufficient anchors for capturing the underlying data structure under high missing rates, while failing to fully leverage the benefits of anchors when missing rate is low. To address the limitations of traditional methods, including restricted anchor selection, inflexible weight assignment, and high computational complexity, this paper proposed an incomplete multi-view clustering algorithm based on a Sample-Complementary Anchor Graphs (IMVC-SAC). First, the algorithm introduces a cross-view anchor complementation mechanism, which adaptively selects anchors from both shared samples and view-specific samples to enhance data structure representation, particularly under high missing rates. Second, it establishes a missing pattern-aware weighting model that dynamically adjusts the contribution of each view to the similarity matrix based on the missing pattern and degree of the samples. Finally, by leveraging the properties of doubly stochastic non-negative matrix factorization, the time complexity of spectral clustering is reduced from cubic to linear with respect to the sample size. Experimental results on five public datasets demonstrate that the proposed IMVC-SAC algorithm outperforms state-of-the-art methods in clustering performance. Notably, it maintains robust and effective clustering even under high missing rates, validating its superiority.

参考文献

[1] WANG Z, LI L, NING X,et al .Incomplete multi-view clustering via structure exploration and missing-view infe-rence[J].Information Fusion2024103:102123/1-12.
[2] LI S Y, JIANG Y, ZHOU Z H .Partial multi-view clustering[C]∥ Proceedings of the 28th AAAI Conference on Artificial Intelligence.Québec City:AAAI,2014:1968-1974.
[3] SHAO W, HE L, YU P S .Multiple incomplete views clustering via weighted nonnegative matrix factorization with regularization[C]∥ Proceedings of 2015 European Conference on Machine Learning and Knowledge Disco-very in Databases.Porto:Springer,2015:318-334.
[4] WEN J, XU Y, LIU H .Incomplete multiview spectral clustering with adaptive graph learning[J].IEEE Tran-sactions on Cybernetics201850(4):1418-1429.
[5] WEN J, SUN H, FEI L,et al .Consensus guided incomplete multi-view spectral clustering[J].Neural Networks2021133:207-219.
[6] GUO J, YE J .Anchors bring ease:an embarrassingly simple approach to partial multi-view clustering[C]∥ Proceedings of the 33rd AAAI Conference on Artificial Intelligence.Honolulu:AAAI,2019:118-125.
[7] YU X, JIANG Y, CHAO G,et al .Deep contrastive multi-view subspace clustering with representation and cluster interactive learning[J].IEEE Transactions on Knowledge and Data Engineering202437(1):188-199.
[8] CHAO G, XU K, XIE X,et al .Global graph propagation with hierarchical information transfer for incomplete contrastive multi-view clustering[C]∥ Proceedings of the 39th AAAI Conference on Artificial Intelligence.Philadelphia:AAAI,2025:15713-15721.
[9] SHEN Q, GUO Z, WANG H,et al .Reliable entropy-induced anchor learning for incomplete multi-view subspace clustering[J].IEEE Transactions on Circuits and Systems for Video Technology202535(6):5293-5306.
[10] WEI K, LI H, LIU Q,et al .Self-supervised,multi-view,semantics-aware anchor clustering[J].Electronics202413(23):4782/1-18.
[11] YU S, WANG S, ZHANG P,et al .DVSAI:diverse view-shared anchors based incomplete multi-view clustering[C]∥ Proceedings of the 38th AAAI Conference on Artificial Intelligence.Vancouver:AAAI,2024:16568-16577.
[12] MI Y, CHEN H, YUAN Z,et al .Fast multi-view subspace clustering with balance anchors guidance[J].Pattern Recognition2024145:109895/1-11.
[13] YANG B, WU J, ZHANG X,et al .Discrete correntropy-based multi-view anchor-graph clustering[J].Information Fusion2024103:102097/1-11.
[14] XIA W, GAO Q, WANG Q,et al .Tensorized bipartite graph learning for multi-view clustering[J].IEEE Transactions on Pattern Analysis and Machine Intelligence202245(4):5187-5202.
[15] HUANG D, WANG C D, LAI J H .Fast multi-view clustering via ensembles:towards scalability,superio-rity,and simplicity[J].IEEE Transactions on Know-ledge and Data Engineering202335(11):11388-11402.
[16] 赵兴旺,王淑君,刘晓琳,等 .基于二部图的联合谱嵌入多视图聚类算法[J].软件学报202435(9):4408-4424.
  ZHAO Xing-wang, WANG Shu-jun, LIU Xiao-lin,et al .Joint spectral embedding multi-view clustering algorithm based on bipartite graphs[J].Journal of Software202435(9):4408-4424.
[17] LIU W, HE J, CHANG S F .Large graph construction for scalable semi-supervised learning[C]∥ Proceedings of the 27th International Conference on Machine Learning.Haifa:Omni Press,2010:679-686.
[18] WANG S, LIU X, LIU L,et al .Highly-efficient incomplete large-scale multi-view clustering with consensus bipartite graph[C]∥ Proceedings of 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition.New Orleans:IEEE,2022:9776-9785.
[19] ZHANG R, HANG S, SUN Z,et al .Anchor-based fast spectral ensemble clustering[J].Information Fusion2025113:102587/1-13.
[20] LI Z, TANG C, ZHENG X,et al .High-order correlation preserved incomplete multi-view subspace cluste-ring[J].IEEE Transactions on Image Processing202231:2067-2080.
[21] HU M, CHEN S .One-pass incomplete multi-view clustering[C]∥ Proceedings of the 33rd AAAI Confe-rence on Artificial Intelligence.Honolulu:AAAI,2019:3838-3845.
[22] WEN J, ZHANG Z, ZHANG Z,et al .Generalized incomplete multiview clustering with flexible locality structure diffusion[J].IEEE Transactions on Cybernetics202051(1):101-114.
[23] GREENE D, CUNNINGHAM P .A matrix factorization approach for integrating multiple data views[C]∥ Proceedings of 2009 European Conference on Machine Learning and Knowledge Discovery in Databases.Bled:Springer,2009:423-438.
[24] JAIN A K, DUIN R P W, MAO J .Statistical pattern recognition:a review[J].IEEE Transactions on Pattern Analysis and Machine Intelligence200022(1):4-37.
[25] XIA R, PAN Y, DU L,et al .Robust multi-view spectral clustering via low-rank and sparse decomposition[C]∥ Proceedings of the 28th AAAI Conference on Artificial Intelligence.Palo Alto:AAAI,2014:2149-2155.
[26] LI F-F, ANDREETO M, RANZATO M,et al .Caltech 101(Version 1.0)[DB/OL].(2022-04-06)[2025-05-10]..
[27] 刘小兰,叶泽慧 .基于StarGAN和子空间学习的缺失多视图聚类[J].华南理工大学学报(自然科学版)202048(11):87-98.
  LIU Xiaolan, YE Zehui .Partial multi-view clustering based on StarGAN and subspace learning[J].Journal of South China University of Technology (Natural Science Edition)202048(11):87-98.
[28] ZHU J, WAN M, YANG G,et al .INCOMPLETE multi-view clustering based on low-rank adaptive graph learning[J].Knowledge-Based Systems2024305:112562/1-13.
文章导航

/