Advanced Genetic Programming vs. State-of-the-Art AutoML in Imbalanced Binary Classification

Research output: Contribution to journalArticlepeer-review

1 Citation (Scopus)
32 Downloads (Pure)

Abstract

The objective of this article is to provide a comparative analysis of two novel genetic programming (GP) techniques, differentiable Cartesian genetic programming for artificial neural networks (DCGPANN) and geometric semantic genetic programming (GSGP), with state-of-the-art automated machine learning (AutoML) tools, namely Auto-Keras, Auto-PyTorch and Auto-Sklearn. While all these techniques are compared to several baseline algorithms upon their introduction, research still lacks direct comparisons between them, especially of the GP approaches with state-of-the-art AutoML. This study intends to fill this gap in order to analyze the true potential of GP for AutoML. The performances of the different tools are assessed by applying them to 20 benchmark datasets of the imbalanced binary classification field, thus an area that is a frequent and challenging problem. The tools are compared across the four categories average performance, maximum performance, standard deviation within performance, and generalization ability, whereby the metrics F1-score, G-mean, and AUC are used for evaluation. The analysis finds that the GP techniques, while unable to completely outperform state-of-the-art AutoML, are indeed already a very competitive alternative. Therefore, these advanced GP tools prove that they are able to provide a new and promising approach for practitioners developing machine learning (ML) models.
Original languageEnglish
Pages (from-to)1349-1363
Number of pages15
JournalEmerging Science Journal
Volume7
Issue number4
Early online date12 Jul 2023
DOIs
Publication statusPublished - 1 Aug 2023

Keywords

  • Genetic Programming
  • Automated Machine Learning
  • AutoML
  • Imbalanced Binary Classification

Fingerprint

Dive into the research topics of 'Advanced Genetic Programming vs. State-of-the-Art AutoML in Imbalanced Binary Classification'. Together they form a unique fingerprint.

Cite this