Computer Science & Technology

Semantic Textual Similarity Justification based on Multi-Model Ensemble

Expand
  • 1. School of Computer Science & Engineering,South China University of Technology,Guangzhou 510640,Guangdong,China;
    2. School of Mechanical & Automotive Engineering,South China University of Technology,Guangzhou 510640,Guangdong,China;
    3. College of Medical Information Engineering,Guangdong Pharmaceutical University,Guangzhou 510006,Guangdong,China
苏锦钿 (1980-),男,博士,副教授,主要从事自然语言处理、深度学习和程序语言设计等研究

Received date: 2021-06-29

  Revised date: 2021-09-16

  Online published: 2021-09-24

Abstract

As the mainstream and typical methods in current natural language processing and artificial intelligence, various pre-trained language models perform differently on the downstream tasks, due to their different language modeling, feature representation, model structure, training tasks and pre-training corpus, et al. In order to better ensemble the knowledge in different pre-trained language models and utilize their learning abilities on the downstream tasks, we propose a multi-model ensemble method MME-STS (Multi-Model Ensemble for Semantic Textual Similarity) for semantic textual similarity justification tasks. The model structure and the corresponding feature representations are presented, and three different ensemble strategies based on average values, full-connected layer training and Adaboost algorithm with respect to model ensemble are also proposed. Experimental results show that MME-STS outperforms significantly over single pre-trained language model-based approaches on the two benchmark datasets of SemEval 2014 task 4 SICK and SemEval 2017 STS-B corpus in terms of Pearson correlation coefficient and Spearman coefficient metrics.

Cite this article

SU Jindian, HONG Xiaobin, YU Shanshan . Semantic Textual Similarity Justification based on Multi-Model Ensemble[J]. Journal of South China University of Technology(Natural Science), 2022 , 50(4) : 1 -9 . DOI: 10.12141/j.issn.1000-565X.210427

Outlines

/