TY - JOUR
T1 - A scalable genetic programming approach to integrate miRNA-target predictions
T2 - Comparing different parallel implementations of M3GP
AU - Beretta, Stefano
AU - Castelli, Mauro
AU - Muñoz, Luis
AU - Trujillo, Leonardo
AU - Martínez, Yuliana
AU - Popovič, Aleš
AU - Milanesi, Luciano
AU - Merelli, Ivan
N1 - Beretta, S., Castelli, M., Munoz, L., Trujillo, L., Martinez, Y., Popovic, A., ... Merelli, I. (2018). A Scalable Genetic Programming Approach to Integrate miRNA-Target Predictions: Comparing Different Parallel Implementations of M3GP. Complexity, [4963139]. DOI: 10.1155/2018/4963139
PY - 2018/1/1
Y1 - 2018/1/1
N2 - There are many molecular biology approaches to the analysis of microRNA (miRNA) and target interactions, but the experiments are complex and expensive. For this reason, in silico computational approaches able to model these molecular interactions are highly desirable. Although several computational methods have been developed for predicting the interactions between miRNA and target genes, there are substantial differences in the results achieved since most algorithms provide a large number of false positives. Accordingly, machine learning approaches are widely used to integrate predictions obtained from different tools. In this work, we adopt a method called multidimensional multiclass GP with multidimensional populations (M3GP), which relies on a genetic programming approach, to integrate and classify results from different miRNA-target prediction tools. The results are compared with those obtained with other classifiers, showing competitive accuracy. Since we aim to provide genome-wide predictions with M3GP and, considering the high number of miRNA-target interactions to test (also in different species), a parallel implementation of this algorithm is recommended. In this paper, we discuss the theoretical aspects of this algorithm and propose three different parallel implementations. We show that M3GP is highly parallelizable, it can be used to achieve genome-wide predictions, and its adoption provides great advantages when handling big datasets.
AB - There are many molecular biology approaches to the analysis of microRNA (miRNA) and target interactions, but the experiments are complex and expensive. For this reason, in silico computational approaches able to model these molecular interactions are highly desirable. Although several computational methods have been developed for predicting the interactions between miRNA and target genes, there are substantial differences in the results achieved since most algorithms provide a large number of false positives. Accordingly, machine learning approaches are widely used to integrate predictions obtained from different tools. In this work, we adopt a method called multidimensional multiclass GP with multidimensional populations (M3GP), which relies on a genetic programming approach, to integrate and classify results from different miRNA-target prediction tools. The results are compared with those obtained with other classifiers, showing competitive accuracy. Since we aim to provide genome-wide predictions with M3GP and, considering the high number of miRNA-target interactions to test (also in different species), a parallel implementation of this algorithm is recommended. In this paper, we discuss the theoretical aspects of this algorithm and propose three different parallel implementations. We show that M3GP is highly parallelizable, it can be used to achieve genome-wide predictions, and its adoption provides great advantages when handling big datasets.
UR - http://www.scopus.com/inward/record.url?scp=85062830331&partnerID=8YFLogxK
U2 - 10.1155/2018/4963139
DO - 10.1155/2018/4963139
M3 - Article
AN - SCOPUS:85062830331
SN - 1076-2787
VL - 2018
JO - Complexity
JF - Complexity
M1 - 4963139
ER -