Journal of South China University of Technology(Natural Science Edition) ›› 2026, Vol. 54 ›› Issue (2): 91-101.doi: 10.12141/j.issn.1000-565X.250020

• Biological Engineering • Previous Articles     Next Articles

Prediction of Target Inhibitor Activity by Integrating Machine Learning and Metaheuristic Algorithms

LING Fei(), GU Xuerong()   

  1. School of Biology and Biological Engineering/ Guangdong Provincial Key Laboratory of Fermentation and Enzyme Engineering,South China University of Technology,Guangzhou 510006,Guangdong,China
  • Received:2025-01-17 Online:2026-02-25 Published:2025-09-19
  • Contact: GU Xuerong E-mail:fling@scut.edu.cn;202120124398@mail.scut.edu.cn
  • Supported by:
    the National Natural Science Foundation of China(12322119)

Abstract:

Traditional machine learning (ML) and deep learning (DL) play a key role in predicting the activity of target inhibitors. Many models based on existing datasets can predict compound bioactivity. However, debate persists regarding whether ML or DL performs better for such prediction tasks. In this study, datasets were constructed based on different molecular representations. Ten metaheuristic algorithms were applied to optimize the hyperparameters of eleven ML and DL models, aiming to systematically compare their predictive performance and identify the optimal ones. The results show that ML and DL models whose hyperparameters were optimized by metaheuristic algorithms significantly outperformed those optimized using the traditional grid search method. Furthermore, in low-dimensional feature spaces, graph-based DL models, such as SSA-GAT and SSA-Attentive FP, can automatically extract informative features from data via an end-to-end learning mechanism, yielding better performance than ML models. In contrast, in high-dimensional feature spaces (e.g., the feature space formed by combining RDKit descriptors with ECFP, AtomPairs, and MACCS fingerprints), ML methods, leveraging the complementary information in molecular features and the powerful optimization capability of metaheuristic algorithms, can effectively capture complex feature interactions. Consequently, ML methods often demonstrate higher accuracy and robustness in high-dimensional modeling. These findings provide valuable guidance for selecting between ML and DL approaches for target inhibitor activity prediction.

Key words: metaheuristic optimization algorithm, machine learning, deep learning, target inhibitor activity, molecular fingerprints, molecular graph

CLC Number: