华南理工大学学报(自然科学版) ›› 2020, Vol. 48 ›› Issue (11): 87-98.doi: 10.12141/j.issn.1000-565X.200128

• 计算机科学与技术 • 上一篇    下一篇

基于 StarGAN 和子空间学习的缺失多视图聚类

刘小兰 叶泽慧   

  1. 华南理工大学 数学学院,广东 广州 510640
  • 收稿日期:2020-03-24 修回日期:2020-07-04 出版日期:2020-11-25 发布日期:2020-11-05
  • 通信作者: 刘小兰(1979-),女,博士,副教授,主要从事优化算法与机器学习研究。 E-mail:liuxl@scut.edu.cn
  • 作者简介:刘小兰(1979-),女,博士,副教授,主要从事优化算法与机器学习研究。
  • 基金资助:
    国家自然科学基金资助项目 (61502175); 广东省自然科学基金资助项目 (2020A1515010699)

Partial Multi-view Clustering Based on StarGAN and Subspace Learning

LIU Xiaolan YE Zehui   

  1. School of Mathematics,South China University of Technology,Guangzhou 510640,Guangdong,China
  • Received:2020-03-24 Revised:2020-07-04 Online:2020-11-25 Published:2020-11-05
  • Contact: 刘小兰(1979-),女,博士,副教授,主要从事优化算法与机器学习研究。 E-mail:liuxl@scut.edu.cn
  • About author:刘小兰(1979-),女,博士,副教授,主要从事优化算法与机器学习研究。
  • Supported by:
    Supported by the National Natural Science Foundation of China (61502175) and the Natural Science Founda-tion of Guangdong Province (2020A1515010699)

摘要: 传统多视图聚类方法假设每一个视图的数据都是完整的。然而,现实生活中有些数据在某些视图上缺失,由此产生了缺失多视图聚类问题。现有的缺失多视图聚类方法大多是基于核矩阵和非负矩阵分解提出的。这些方法大多只是学习一个共有的聚类结构,没有充分利用已有的数据信息推断缺失的数据。基于星型生成对抗网络 (Star-GAN) 和子空间学习,本文中提出了一种缺失多视图聚类算法 SSPMVC。SSPMVC 充分利用已有的数据信息,用基于 StarGAN 的生成模型生成视图缺失的数据,捕获了视图数据的完整性和一致性全局结构,然后将补全的多视图数据在子空间进行聚类。SSPMVC将生成模型和聚类模型联合训练,交替优化生成模型和聚类模型。实验结果表明,论文提出的算法优于与之比较的经典的多视图聚类方法。

关键词: 缺失多视图, 子空间聚类, 星型生成对抗网络

Abstract: The traditional multi-view clustering method assumes that the data of each view is complete. However,there is a lack of data in some views in real life,and thus leads to the problem of partial multi-view clustering.Most of the existing partial multi-view clustering methods are based on kernel matrix and non-negative matrix de-composition,and most of them are just learning a common clustering structure,rather than making full use of the existing data information to infer the missing data. Based on StarGAN and subspace learning,a partial multi-view clustering algorithm (SSPMVC) was proposed in this study. SSPMVC makes full use of the existing data informa-tion to generate the missing data with the generation model based on StarGAN,captures the integrity and consisten-cy global structure of the data,and then clusters the completed multi-view data in the subspace. The generation model and clustering model were trained jointly by SSPMVC,and the generation model and clustering model were alternately optimized. The experimental results show that the algorithm proposed in this paper is superior to the classical multi-view clustering method.

Key words: partial multi-view, subspace learning, StarGAN

中图分类号: