Swarm intelligence for optimizing the parameters of multiple sequence aligners

Álvaro Rubio-Largo, Leonardo Vanneschi, Mauro Castelli, Miguel A. Vega-Rodríguez

Research output: Contribution to journalArticle

  • 1 Citations

Abstract

Different aligner heuristics can be found in the literature to solve the Multiple Sequence Alignment problem. These aligners rely on the parameter configuration proposed by their authors (also known as default parameter configuration), that tried to obtain good results (alignments with high accuracy and conservation) for any input set of unaligned sequences. However, the default parameter configuration is not always the best parameter configuration for every input set; namely, depending on the biological characteristics of the input set, one may be able to find a better parameter configuration that outputs a more accurate and conservative alignment. This work's main contributions include: to study the input set's biological characteristics and to then apply the best parameter configuration found depending on those characteristics. The framework uses a pre-computed file to take the best parameter configuration found for a dataset with similar biological characteristics. In order to create this file, we use a Particle Swarm Optimization (PSO) algorithm, that is, an algorithm based on swarm intelligence. To test the effectiveness of the characteristic-based framework, we employ five well-known aligners: Clustal W, DIALIGN-TX, Kalign2, MAFFT, and MUSCLE. The results of these aligners see clear improvements when using the proposed characteristic-based framework.

LanguageEnglish
Pages16-28
JournalSwarm and Evolutionary Computation
Volume42
Early online date1 Jan 2018
DOIs
StatePublished - 1 Oct 2018

Fingerprint

Swarm Intelligence
Configuration
Particle swarm optimization (PSO)
Conservation
Alignment
Multiple Sequence Alignment
Swarm intelligence
Particle Swarm Optimization Algorithm
High Accuracy
Heuristics
Output
Framework

Keywords

  • Characteristics-based framework
  • Evolutionary algorithms
  • Multiple sequence alignment
  • Swarm intelligence

Cite this

@article{f05815f232a64b8d989edea264e31d87,
title = "Swarm intelligence for optimizing the parameters of multiple sequence aligners",
abstract = "Different aligner heuristics can be found in the literature to solve the Multiple Sequence Alignment problem. These aligners rely on the parameter configuration proposed by their authors (also known as default parameter configuration), that tried to obtain good results (alignments with high accuracy and conservation) for any input set of unaligned sequences. However, the default parameter configuration is not always the best parameter configuration for every input set; namely, depending on the biological characteristics of the input set, one may be able to find a better parameter configuration that outputs a more accurate and conservative alignment. This work's main contributions include: to study the input set's biological characteristics and to then apply the best parameter configuration found depending on those characteristics. The framework uses a pre-computed file to take the best parameter configuration found for a dataset with similar biological characteristics. In order to create this file, we use a Particle Swarm Optimization (PSO) algorithm, that is, an algorithm based on swarm intelligence. To test the effectiveness of the characteristic-based framework, we employ five well-known aligners: Clustal W, DIALIGN-TX, Kalign2, MAFFT, and MUSCLE. The results of these aligners see clear improvements when using the proposed characteristic-based framework.",
keywords = "Characteristics-based framework, Evolutionary algorithms, Multiple sequence alignment, Swarm intelligence",
author = "{\'A}lvaro Rubio-Largo and Leonardo Vanneschi and Mauro Castelli and Vega-Rodr{\'i}guez, {Miguel A.}",
note = "Rubio-Largo, {\'A}., Vanneschi, L., Castelli, M., & Vega-Rodr{\'i}guez, M. A. (2018). Swarm intelligence for optimizing the parameters of multiple sequence aligners. Swarm and Evolutionary Computation. DOI: 10.1016/j.swevo.2018.04.003",
year = "2018",
month = "10",
day = "1",
doi = "10.1016/j.swevo.2018.04.003",
language = "English",
volume = "42",
pages = "16--28",
journal = "Swarm and Evolutionary Computation",
issn = "2210-6502",
publisher = "Elsevier",

}

Swarm intelligence for optimizing the parameters of multiple sequence aligners. / Rubio-Largo, Álvaro; Vanneschi, Leonardo; Castelli, Mauro; Vega-Rodríguez, Miguel A.

In: Swarm and Evolutionary Computation, Vol. 42, 01.10.2018, p. 16-28.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Swarm intelligence for optimizing the parameters of multiple sequence aligners

AU - Rubio-Largo,Álvaro

AU - Vanneschi,Leonardo

AU - Castelli,Mauro

AU - Vega-Rodríguez,Miguel A.

N1 - Rubio-Largo, Á., Vanneschi, L., Castelli, M., & Vega-Rodríguez, M. A. (2018). Swarm intelligence for optimizing the parameters of multiple sequence aligners. Swarm and Evolutionary Computation. DOI: 10.1016/j.swevo.2018.04.003

PY - 2018/10/1

Y1 - 2018/10/1

N2 - Different aligner heuristics can be found in the literature to solve the Multiple Sequence Alignment problem. These aligners rely on the parameter configuration proposed by their authors (also known as default parameter configuration), that tried to obtain good results (alignments with high accuracy and conservation) for any input set of unaligned sequences. However, the default parameter configuration is not always the best parameter configuration for every input set; namely, depending on the biological characteristics of the input set, one may be able to find a better parameter configuration that outputs a more accurate and conservative alignment. This work's main contributions include: to study the input set's biological characteristics and to then apply the best parameter configuration found depending on those characteristics. The framework uses a pre-computed file to take the best parameter configuration found for a dataset with similar biological characteristics. In order to create this file, we use a Particle Swarm Optimization (PSO) algorithm, that is, an algorithm based on swarm intelligence. To test the effectiveness of the characteristic-based framework, we employ five well-known aligners: Clustal W, DIALIGN-TX, Kalign2, MAFFT, and MUSCLE. The results of these aligners see clear improvements when using the proposed characteristic-based framework.

AB - Different aligner heuristics can be found in the literature to solve the Multiple Sequence Alignment problem. These aligners rely on the parameter configuration proposed by their authors (also known as default parameter configuration), that tried to obtain good results (alignments with high accuracy and conservation) for any input set of unaligned sequences. However, the default parameter configuration is not always the best parameter configuration for every input set; namely, depending on the biological characteristics of the input set, one may be able to find a better parameter configuration that outputs a more accurate and conservative alignment. This work's main contributions include: to study the input set's biological characteristics and to then apply the best parameter configuration found depending on those characteristics. The framework uses a pre-computed file to take the best parameter configuration found for a dataset with similar biological characteristics. In order to create this file, we use a Particle Swarm Optimization (PSO) algorithm, that is, an algorithm based on swarm intelligence. To test the effectiveness of the characteristic-based framework, we employ five well-known aligners: Clustal W, DIALIGN-TX, Kalign2, MAFFT, and MUSCLE. The results of these aligners see clear improvements when using the proposed characteristic-based framework.

KW - Characteristics-based framework

KW - Evolutionary algorithms

KW - Multiple sequence alignment

KW - Swarm intelligence

UR - http://www.scopus.com/inward/record.url?scp=85046159910&partnerID=8YFLogxK

UR - http://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcAuth=Alerting&SrcApp=Alerting&DestApp=WOS_CPL&DestLinkType=FullRecord&UT=WOS:000445716200002

U2 - 10.1016/j.swevo.2018.04.003

DO - 10.1016/j.swevo.2018.04.003

M3 - Article

VL - 42

SP - 16

EP - 28

JO - Swarm and Evolutionary Computation

T2 - Swarm and Evolutionary Computation

JF - Swarm and Evolutionary Computation

SN - 2210-6502

ER -