Abstract
In iterative supervised learning algorithms it is common to reach a point in the search where no further induction seems to be possible with the available data. If the search is continued beyond this point, the risk of overfitting increases significantly. Following the recent developments in inductive semantic stochastic methods, this paper studies the feasibility of using information gathered from the semantic neighborhood to decide when to stop the search. Two semantic stopping criteria are proposed and experimentally assessed in Geometric Semantic Genetic Programming (GSGP) and in the Semantic Learning Machine (SLM) algorithm (the equivalent algorithm for neural networks). The experiments are performed on real-world high-dimensional regression datasets. he results show that the proposed semantic stopping criteria are able to detect stopping points that result in a competitive generalization for both GSGP and SLM. h is approach also yields computationally efficient algorithms as it allows the evolution of neural networks in less than 3 seconds on average, and of GP trees in at most 10 seconds. he usage of the proposed semantic stopping criteria in conjunction with the computation of optimal mutation/learning steps also results in small trees and neural networks.
Original language | English |
---|---|
Title of host publication | GECCO 2017 - Proceedings of the 2017 Genetic and Evolutionary Computation Conference |
Publisher | ACM - Association for Computing Machinery |
Pages | 929-936 |
Number of pages | 8 |
ISBN (Electronic) | 9781450349208 |
DOIs | |
Publication status | Published - 1 Jul 2017 |
Event | 2017 Genetic and Evolutionary Computation Conference, GECCO 2017 - Berlin, Germany Duration: 15 Jul 2017 → 19 Jul 2017 |
Conference
Conference | 2017 Genetic and Evolutionary Computation Conference, GECCO 2017 |
---|---|
Country/Territory | Germany |
City | Berlin |
Period | 15/07/17 → 19/07/17 |
Keywords
- Generalization
- Geometric semantic genetic programming
- Overfitting
- Semantic learning machine
- Stopping criteria