Computer Science & Technology

An Imbalanced Classification Method Based on Adaptive Sampling

  • CHEN Qiong ,
  • XIE Jia-Liang
Expand
  • 1.School of Management Science and Engineering,Anhui University of Finance and Economics,Bengbu 233030,Anhui,China;

     2.School of Electronics and Information Engineering,Anhui University,Hefei 230601,Anhui,China


陈琼 (1966-),女,博士,副教授,主要从事人工智能、机器学习、智能计算等研究

Received date: 2021-04-28

  Revised date: 2021-11-07

  Online published: 2021-11-22

Supported by

Key-Area Research and Development Program of Guangdong Province

Abstract

In view of the problem that traditional resampling methods mostly use fixed sampling strategies and cannot change the sampling strategy according to the optimization requirements of the model, this paper proposes an adaptive sampling-based imbalanced classification method (Adaptive Sampling Imbalanced Classification, ASIC). This method dynamically adjusts the sampling probabilities of samples of different classes on the training set according to the performance of the classification model on the validation set, so that the sampling probabilities of different classes are dynamically determined by the requirements of the current classification model. At the same time, this method pays extra attention to the minority classes, and gives the minority classes a higher sampling probability under the same other conditions, so as to compensate for the negative impact of the insufficient example number of the minority class itself on the classification model, thereby improving the classification model's ability to recognize minority classes. The experimental results show that the classification model trained with the ASIC method is better than the comparison methods in terms of balanced accuracy and geometric mean, and the more imbalanced the data distribution, the more obvious the superiority of the ASIC method.

Cite this article

CHEN Qiong , XIE Jia-Liang . An Imbalanced Classification Method Based on Adaptive Sampling[J]. Journal of South China University of Technology(Natural Science), 2022 , 50(4) : 26 -34,45 . DOI: 10.12141/j.issn.1000-565X.210267

Outlines

/