Bloat free Genetic Programming: Application to human oral bioavailability prediction

Research output: Contribution to journalReview articlepeer-review

5 Citations (Scopus)

Abstract

Being able to predict the human oral bioavailability for a potential new drug is extremely important for the drug discovery process. This problem has been addressed by several prediction tools, with Genetic Programming providing some of the best results ever achieved. In this paper we use the newest developments of Genetic Programming, in particular the latest bloat control method, Operator Equalisation, to find out how much improvement we can achieve on this problem. We show examples of some actual solutions and discuss their quality, comparing them with previously published results. We identify some unexpected behaviours related to overfitting, and discuss the way for further improving the practical usage of the Genetic Programming approach.

Original languageEnglish
Pages (from-to)585-601
Number of pages17
JournalInternational Journal Of Data Mining And Bioinformatics
Volume6
Issue number6
DOIs
Publication statusPublished - 2012

Keywords

  • Bloat
  • Code growth
  • Data mining
  • Drug discovery
  • Feature selection
  • Genetic programming
  • Human oral bioavailability
  • Operator equalisation
  • Overfitting
  • Prediction
  • Solution length
  • Symbolic regression

Fingerprint Dive into the research topics of 'Bloat free Genetic Programming: Application to human oral bioavailability prediction'. Together they form a unique fingerprint.

Cite this