A multidimensional genetic programming approach for identifying epsistatic gene interactions

William La Cava, Lee Spector, Sara Silva, Leonardo Vanneschi, Kourosh Danai, Jason H. Moore

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Citations (Scopus)

Abstract

We propose a novel methodology for binary and multiclass classification that uses genetic programming to construct features for a nearest centroid classifier. The method, coined M4GP, improves upon earlier approaches in this vein (M2GP and M3GP) by simplifying the program encoding, using advanced selection methods, and archiving solutions during the run. In our recent paper, we test this stategy against traditional GP formulations of the classification problem, showing that this framework outperforms boolean and floating point encodings. In comparison to several machine learning techniques, M4GP achieves the best overall ranking on benchmark problems. We then compare our algorithm against state-ofthe-art machine learning approaches to the task of disease classification using simulated genetics datasets with up to 5000 features. The results suggest that our proposed approach performs on par with the best results in literature with less computation time, while producing simpler models.

Original languageEnglish
Title of host publicationGECCO 2018 Companion
Subtitle of host publication Proceedings of the 2018 Genetic and Evolutionary Computation Conference Companion
PublisherAssociation for Computing Machinery, Inc
Pages23-24
Number of pages2
ISBN (Electronic)9781450357647
DOIs
Publication statusPublished - 6 Jul 2018
Event2018 Genetic and Evolutionary Computation Conference, GECCO 2018 - Kyoto, Japan
Duration: 15 Jul 201819 Jul 2018

Conference

Conference2018 Genetic and Evolutionary Computation Conference, GECCO 2018
Country/TerritoryJapan
CityKyoto
Period15/07/1819/07/18

Keywords

  • Classification
  • Feature construction
  • Genetics

Fingerprint

Dive into the research topics of 'A multidimensional genetic programming approach for identifying epsistatic gene interactions'. Together they form a unique fingerprint.

Cite this