TY - JOUR
T1 - Semantic genetic programming for fast and accurate data knowledge discovery
AU - Castelli, Mauro
AU - Vanneschi, Leonardo
AU - Manzoni, Luca
AU - Popovič, Aleš
N1 - info:eu-repo/grantAgreement/FCT/3599-PPCDT/129038/PT#
Castelli, M., Vanneschi, L., Manzoni, L., & Popovič, A. (2016). Semantic genetic programming for fast and accurate data knowledge discovery. Swarm and Evolutionary Computation, 26, 1-7. https://doi.org/10.1016/j.swevo.2015.07.001-------The authors acknowledge project MassGP (PTDC/EEI-CTP/2975/2012), FCT, Portugal.
PY - 2016/2/1
Y1 - 2016/2/1
N2 - Big data knowledge discovery emerged as an important factor contributing to advancements in society at large. Still, researchers continuously seek to advance existing methods and provide novel ones for analysing vast data sets to make sense of the data, extract useful information, and build knowledge to inform decision making. In the last few years, a very promising variant of genetic programming was proposed: geometric semantic genetic programming. Its difference with the standard version of genetic programming consists in the fact that it uses new genetic operators, called geometric semantic operators, that, acting directly on the semantics of the candidate solutions, induce by definition a unimodal error surface on any supervised learning problem, independently from the complexity and size of the underlying data set. This property should improve the evolvability of genetic programming in presence of big data and thus makes geometric semantic genetic programming an extremely promising method for mining vast amounts of data. Nevertheless, to the best of our knowledge, no contribution has appeared so far to employ this new technology to big data problems. This paper intends to fill this gap. For the first time, in fact, we show the effectiveness of geometric semantic genetic programming on several complex real-life problems, characterized by vast amounts of data, coming from several different application domains.
AB - Big data knowledge discovery emerged as an important factor contributing to advancements in society at large. Still, researchers continuously seek to advance existing methods and provide novel ones for analysing vast data sets to make sense of the data, extract useful information, and build knowledge to inform decision making. In the last few years, a very promising variant of genetic programming was proposed: geometric semantic genetic programming. Its difference with the standard version of genetic programming consists in the fact that it uses new genetic operators, called geometric semantic operators, that, acting directly on the semantics of the candidate solutions, induce by definition a unimodal error surface on any supervised learning problem, independently from the complexity and size of the underlying data set. This property should improve the evolvability of genetic programming in presence of big data and thus makes geometric semantic genetic programming an extremely promising method for mining vast amounts of data. Nevertheless, to the best of our knowledge, no contribution has appeared so far to employ this new technology to big data problems. This paper intends to fill this gap. For the first time, in fact, we show the effectiveness of geometric semantic genetic programming on several complex real-life problems, characterized by vast amounts of data, coming from several different application domains.
KW - Genetic programming
KW - Knowledge discovery
KW - Semantics
UR - http://www.scopus.com/inward/record.url?scp=84959566827&partnerID=8YFLogxK
U2 - 10.1016/j.swevo.2015.07.001
DO - 10.1016/j.swevo.2015.07.001
M3 - Article
AN - SCOPUS:84959566827
SN - 2210-6502
VL - 26
SP - 1
EP - 7
JO - Swarm and Evolutionary Computation
JF - Swarm and Evolutionary Computation
ER -