Abstract
We propose a novel methodology for binary and multiclass classification that uses genetic programming to construct features for a nearest centroid classifier. The method, coined M4GP, improves upon earlier approaches in this vein (M2GP and M3GP) by simplifying the program encoding, using advanced selection methods, and archiving solutions during the run. In our recent paper, we test this stategy against traditional GP formulations of the classification problem, showing that this framework outperforms boolean and floating point encodings. In comparison to several machine learning techniques, M4GP achieves the best overall ranking on benchmark problems. We then compare our algorithm against state-ofthe-art machine learning approaches to the task of disease classification using simulated genetics datasets with up to 5000 features. The results suggest that our proposed approach performs on par with the best results in literature with less computation time, while producing simpler models.
Original language | English |
---|---|
Title of host publication | GECCO 2018 Companion |
Subtitle of host publication | Proceedings of the 2018 Genetic and Evolutionary Computation Conference Companion |
Publisher | Association for Computing Machinery, Inc |
Pages | 23-24 |
Number of pages | 2 |
ISBN (Electronic) | 9781450357647 |
DOIs | |
Publication status | Published - 6 Jul 2018 |
Event | 2018 Genetic and Evolutionary Computation Conference, GECCO 2018 - Kyoto, Japan Duration: 15 Jul 2018 → 19 Jul 2018 |
Conference
Conference | 2018 Genetic and Evolutionary Computation Conference, GECCO 2018 |
---|---|
Country/Territory | Japan |
City | Kyoto |
Period | 15/07/18 → 19/07/18 |
Keywords
- Classification
- Feature construction
- Genetics