On the generalization ability of geometric semantic genetic programming

Ivo Gonçalves, Sara Silva, Carlos M. Fonseca

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

40 Citations (Scopus)

Abstract

Geometric Semantic Genetic Programming (GSGP) is a recently proposed form of Genetic Programming (GP) that searches directly the space of the underlying semantics of the programs. The fitness landscape seen by the GSGP variation operators is unimodal with a linear slope by construction and, consequently, easy to search. Despite this advantage, the offspring produced by these operators grow very quickly. A new implementation of the same operators was proposed that computes the semantics of the offspring without having to explicitly build their syntax. This allowed GSGP to be used for the first time in real-life multidimensional datasets. GSGP presented a surprisingly good generalization ability, which was justified by some properties of the geometric semantic operators. In this paper, we show that the good generalization ability of GSGP was the result of a small implementation deviation from the original formulation of the mutation operator, and that without it the generalization results would be significantly worse. We explain the reason for this difference, and then we propose two variants of the geometric semantic mutation that deterministically and optimally adapt the mutation step. They reveal to be more efficient in learning the training data, and they also achieve a competitive generalization in only a single operator application. This provides a competitive alternative when performing semantic search, particularly since they produce small individuals and compute fast.

Original languageEnglish
Title of host publicationGenetic Programming - 18th European Conference, EuroGP 2015, Proceedings
PublisherSpringer Verlag
Pages41-52
Number of pages12
Volume9025
ISBN (Electronic)9783319165004
DOIs
Publication statusPublished - 2015
Event18th European Conference on Genetic Programming, EuroGP 2015 - Copenhagen, Denmark
Duration: 8 Apr 201510 Apr 2015

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume9025
ISSN (Print)03029743
ISSN (Electronic)16113349

Conference

Conference18th European Conference on Genetic Programming, EuroGP 2015
Country/TerritoryDenmark
CityCopenhagen
Period8/04/1510/04/15

Keywords

  • Drug discovery
  • Generalization
  • Geometric semantic genetic programming
  • Overfitting
  • Pharmacokinetics

Fingerprint

Dive into the research topics of 'On the generalization ability of geometric semantic genetic programming'. Together they form a unique fingerprint.

Cite this