Journal of South China University of Technology(Natural Science Edition) ›› 2022, Vol. 50 ›› Issue (4): 1-9.doi: 10.12141/j.issn.1000-565X.210427

Special Issue: 2022年计算机科学与技术

• Computer Science & Technology • Previous Articles     Next Articles

Semantic Textual Similarity Justification based on Multi-Model Ensemble

SU JindianHONG XiaobinYU Shanshan3   

  1. 1. School of Computer Science & Engineering,South China University of Technology,Guangzhou 510640,Guangdong,China;
    2. School of Mechanical & Automotive Engineering,South China University of Technology,Guangzhou 510640,Guangdong,China;
    3. College of Medical Information Engineering,Guangdong Pharmaceutical University,Guangzhou 510006,Guangdong,China
  • Received:2021-06-29 Revised:2021-09-16 Online:2022-04-25 Published:2021-09-24
  • Contact: 洪晓斌 (1979-),男,博士,教授,主要从事网络化智能测控技术及应用等研究 E-mail: mexbhong@ scut. edu. cn
  • About author:苏锦钿 (1980-),男,博士,副教授,主要从事自然语言处理、深度学习和程序语言设计等研究

Abstract: As the mainstream and typical methods in current natural language processing and artificial intelligence, various pre-trained language models perform differently on the downstream tasks, due to their different language modeling, feature representation, model structure, training tasks and pre-training corpus, et al. In order to better ensemble the knowledge in different pre-trained language models and utilize their learning abilities on the downstream tasks, we propose a multi-model ensemble method MME-STS (Multi-Model Ensemble for Semantic Textual Similarity) for semantic textual similarity justification tasks. The model structure and the corresponding feature representations are presented, and three different ensemble strategies based on average values, full-connected layer training and Adaboost algorithm with respect to model ensemble are also proposed. Experimental results show that MME-STS outperforms significantly over single pre-trained language model-based approaches on the two benchmark datasets of SemEval 2014 task 4 SICK and SemEval 2017 STS-B corpus in terms of Pearson correlation coefficient and Spearman coefficient metrics.

Key words: Deep learning, Semantic Textual Similarity, Natural Language Processing, Pre-trained Language Model, Model Ensemble