Journal of South China University of Technology (Natural Science Edition) ›› 2019, Vol. 47 ›› Issue (11): 71-77.doi: 10.12141/j.issn.1000-565X.190224

• Biotechnology • Previous Articles     Next Articles

Application of Sequence Alignment-Free Comparison-Based SeqDistK to Microbial Flora Clustering

LIU Xuemei HUANG Guanda HUANG Tianlai   

  1. School of Physics,South China University of Technology,Guangzhou 510640,Guangdong,China
  • Received:2019-05-05 Revised:2019-05-26 Online:2019-11-25 Published:2019-10-02
  • Contact: 刘雪梅(1975-),女,博士,副教授,主要从事生物信息学研究. E-mail:liuxm@scut.edu.cn
  • About author:刘雪梅(1975-),女,博士,副教授,主要从事生物信息学研究.
  • Supported by:
    Supported by the National Natural Science Foundation of China( 11722546,11675226)

Abstract: Using sequence alignment-free comparison method to study microbial flora classification is a hot topic in bioinformatics. In this paper,SeqDistK,the sequence alignment-free comparison software based on k-mer statis- tics,is presented. The open source software package can be obtained from https: ∥github. com /htczero /SeqDistK. SeqDistK has the advantages of fast calculation and high accuracy in microbial flora classification,and has the po- tential to adapt to large-scale data research. By adopting SeqDistK to cluster 63 distance matrices of 16S rRNA gene sequences,it is found that the clustering results are basically consistent with the existing classifications,which means that SeqDistK can accurately classify microbial flora clustering samples and provides effective software for phylogenetic analysis in molecular biology.

Key words: k-mer, sequence alignment-free comparison, SeqDistK, 16S rRNA, clustering

CLC Number: