收稿日期: 2013-04-22
修回日期: 2013-09-27
网络出版日期: 2013-11-19
基金资助
教育部中国网格计划生物信息网格平台子项目(B12137040130)
Modularity- Based Graph Clustering for Analysis of Gene Microarray Data
Received date: 2013-04-22
Revised date: 2013-09-27
Online published: 2013-11-19
Supported by
教育部中国网格计划生物信息网格平台子项目(B12137040130)
李力 曹以诚 毛晓帆 . 用于基因芯片数据分析的模块性图聚类方法[J]. 华南理工大学学报(自然科学版), 2013 , 41(12) : 101 -106 . DOI: 10.3969/j.issn.1000-565X.2013.12.017
As an important clustering algorithm,graph clustering can be effectively applied to protein interactionnetworks and microarray data clustering.In this paper,to overcome the shortcomings of the existing graph cluste-ring methods for gene microarray data,a global graph clustering method based on the modularity and the subgraphsmoothness is proposed.In this algorithm,subgraph smoothness is introduced to avoid the local optimal solution,subgraphs with low smoothness values in the clustering results are split into singletons,and those newly- generatedsingletons are used in the next clustering step.After several iterations,the global optimal clustering result can beobtained.The proposed method is then compared with four commonly- used clustering methods (the classic graphclustering,the k- means algorithm,the SOM algorithm,and the Fuzzy algorithm) on a group of genome expressiondata,and the results show that (1) the proposed method is superior to the other four methods in terms of averagenon- overlap proportion and FOM' value; (2) when the dataset is divided into 39 clusters,the FOM' value of theproposed method is respectively 28.41%,19.21%,9.84% and 24.67% lower than those of the other four me-thods; and (3) the proposed method is of a classification accuracy,which is higher than that of the Fuzzy algorithmand the SOM algorithm,with an execution efficiency similar to that of the SOM algorithm but 5.94% higher thanthat of the Fuzzy method.
Key words: gene microarray; graph clustering; modularity; smoothness; algorithm
/
| 〈 |
|
〉 |