X-BIO: Cross-Architecture and Energy-Efficient Parallel Metaheuristics for Bioinformatics

Problems

Descripción del problema

Phylogenetic inference deals with the identification of the evolutionary events that motivated the diversity observed in the molecular characteristics of current organisms or genes [War17]. Evolutionary relationships are illustrated in the shape of tree structures T = (V,E), known as phylogenetic trees. The node set V contains: 1) internal nodes describing hypothetical ancestral organisms, and 2) leaf nodes representing the extant organisms under study. The phylogenetic topology is consequently defined by the node-to-node relationships specified in the branch set E. Multiple applications (e.g. conservation biology, forensics, drug discovery, and epidemiology [ABA+21]) shows the relevance of the information discovered by phylogenetic analyses. In fact, this problem is considered not only a key topic in molecular biology but also a major computational challenge in computer science.

Finding optimal phylogenetic trees implies the exploration of a complex search space that grows exponentially with the number N of input sequences. More specifically, the number of possible candidate solutions is given by the double factorial (2N−5)!!, thus surpassing Eddington number for only N = 50. Nowadays, the application of third-generation sequencing tools has given rise to complex genome-scale datasets with several orders of magnitude larger than traditional datasets. The need to find evolutionary relationships derived from such genome-scale data has motivated a major turning point in the field - from phylogenetics to phylogenomics, where a new variety of computational challenges arise [YG20]. We will herein focus on two main subproblems: phylogenetic placement and missing data imputation.

Phylogenetic placement: Given a reference phylogenetic tree T, inferred from N aligned sequences, and a set of K query sequences, phylogenetic placement consists of determining the most likely location of the organisms characterized in the query sequences within the reference tree. The output of a placement algorithm is a probability distribution that denotes the degree of evolutionary proximity between the clades of the reference tree and the query sequence. Applications in ecological profile estimation and studies of microbial diseases and the identification of low-coverage viral strain genomes denote the importance of this problem [CSD+22]. In fact, phylogenetic placement provided the means to update the viral phylogeny with SARS-CoV-2 strains [TTH+21].

The attainment of optimal placements is a complex issue, since the search algorithms must deal with a high number of query sequences (9,437 for SARS-CoV-2 [TTH+21]) and large reference trees with hundreds of thousands of leaves [CSD+22]. The likelihood function L(T, θ) can be employed to measure the quality of the attachments. However, the use of this criterion in large-scale scenarios is constrained by its intensive computational costs and memory requirements [BS21]. Alternatively, distance-based criteria e.g. least squares can be adopted to soften such computational restrictions. The goal is to minimize the weighted least-square difference between the distances across organisms in the tree and the distances observed in the sequences. Although faster placements can be attained through this method, distance-based criteria are very sensitive to the presence of missing data, the second subproblem to tackle.

Missing data imputation: Data missingness occurs when the nucleotides associated to a genetic locus for an organism are not successfully sequenced. Incomplete data assembly, assembly errors, and redundancy are some of the problems that give rise to missing data in sequence alignments, a situation that is commonly encountered in real-world analyses [KSM+17]. In the case of distance-based phylogenetic methods, the presence of unknown data in the sequences can lead to failures in the calculation of the sequence distance matrix. If the data available for two organisms i,j do not overlap in any subsequence, the entry of the associated dissimilarity matrix for i,j cannot be directly calculated, leading to missing distances. Figure 2 shows an example of this situation.

The probability of inferring wrong evolutionary relationships increases when missing data is encountered in large sparse matrices [RBP13], thus justifying the interest in applying data imputation and optimization approaches. Likelihood and least-squares measurements can be adopted to guide the search towards satisfying imputations. However, the performance and convergence speed of current phylogenetic imputation methods highly depends on the percentage of missing data [BB20], which accounts up to 80% in phylogenomic scenarios. Novel approaches are therefore needed to address this issue in current real-world analyses.

State of the Art

There exist some good references giving a general perspective about different aspects of bioinformatics, metaheuristics, or parallel computing. However, we focus on the background and state of the art tightly related with the bioinformatics problems that we will tackle.

In the general case of phylogenetic inference, the literature gives account of the strong relationship between HPC and phylogenetic search methods. Two of the most important methods for maximum likelihood inference, RAxML-NG [KDF+19] and IQ-TREE 2 [MSC+20], implement parallel heuristic schemes for CPU architectures, combining POSIX or OpenMP threads with MPI message passing and AVX SIMD extensions. The matOptimize tool mixes MPI with Thread Building Blocks to enable phylogenetic analyses on Intel CPU clusters [YTH+22]. FPGA implementations of the phylogenetic likelihood function were proposed in [ABA+21]. FPGAs were also employed to define near-memory processing models for phylogenetics, using Xilinx Vivado HLS [ASP+21]. NVIDIA GPUs were explored to accelerate popular software like MrBayes [PSR+15], which implements parallel Bayesian inference using CUDA. Furthermore, in [SVS22] we explored the definition of metaheuristic approaches for phylogenetic reconstruction, enabling heterogeneous computing supported on NVIDIA GPUs through different parallel schemes based on MPI, OpenMP, and CUDA.

These previous methods are tailored to specific hardware (CPU, GPU, FPGA…) or vendor technology (Intel, NVIDIA, Xilinx…), so they do not provide a unified solution to exploit the current spectrum of heterogeneous resources. Moreover, since cross-architecture approaches have not been implemented, the state-of-the-art parallel tools are not portable to other configurations or platforms (e.g. AMD GPUs), thus imposing restrictions on the specific hardware to be used. To the best of our knowledge, the only research that considers the cross-architecture question in the field is due to Ayres et al. [ACB+19], who proposed a library to calculate the likelihood function with OpenCL. However, this library only exploits the data parallelism exhibited by this particular function at the sequence level, so other sources of parallelism (e.g. parallel topological rearrangements, parallel independent runs and bootstrapping) are not considered. Moreover, energy efficiency has played a secondary role in this area (only investigated in the context of FPGAs [ASP+21]). In-depth research on cross-architecture and energy-aware algorithms for phylogenetics is therefore required.

Regarding phylogenetic placement, the vast majority of approaches in the literature are devised for execution in CPUs. For example, EPA-NG [BKC+19] conducts phylogenetic placements based on likelihood for multicore CPUs and clusters by implementing two levels of parallelism: MPI to distribute independent query sequences and OpenMP to parallelize the placement calculations for a given query sequence. This approach was refined in [BS21] to reduce memory footprint, observing memory savings up to 96% at the expense of increasing execution times. The RAPPAS tool [LSP19] offered an alternative approach based on the identification of k-mers in the reference sequences, without implementing explicit parallel strategies. APPLES-2 [BJR+22] implements a least-squares algorithm to handle placements in large trees, integrating job-level parallelization in which each query is independently handled by a different CPU thread. A similar job-level parallel approach is implemented in App-SpaM, an alignment-free placement algorithm for short sequencing reads [BM21]. Alternatively, the UShER tool [TTH+21] adopts the phylogenetic parsimony function to guide the method, using CPU threads to parallelize independent Fitch-Sankoff computations. Finally, SCAMPP [WCW22] defines a serial heuristic based on Hamming distances to identify promising attachment points prior to the execution of a user-defined placement tool.

It can be concluded that most placement algorithms only incorporate basic parallelization strategies (or no parallelization at all), targeting CPU devices for execution. An exception to this rule is observed in [JBZ+22], where neural networks were adopted to learn patterns from the reference tree and the sequences, calculating embeddings that were later employed for query placement. To accelerate the network training, NVIDIA 2080Ti GPUs were used. Other hardware platforms (e.g. FPGAs) and energy-aware heterogeneous methods that efficiently orchestrate multiple devices are still unexplored. An additional interesting aspect is given by the fact that multiobjective optimization has never been investigated in this problem, thus opening the door for the definition of parallel multiobjective metaheuristics that perform placements attending to different criteria (e.g. likelihood and least-squares) simultaneously.

As for the missing data imputation problem, two main classes of methods can be distinguished in the state of the art: 1) optimization methods and 2) machine learning approaches. Among the proposed optimization methods, we can highlight tools like LASSO [KDR+15], which defines heuristic strategies to infer rooted phylogenetic trees from distance matrices with missing values, under the molecular clock assumption. Rphylopars imputes missing observations by optimizing the log-likelihood of trait covariance using the available data [GBA17]. DAMBE [Xia18] uses a downhill simplex method based on the least-squares criterion for imputing distances. Mixed integer non-linear programming is employed in imPhy to infer missing distances in a set of gene trees [YVY+20]. Regarding machine learning approaches, it can be highlighted the use of matrix factorization, which was initially proposed to fill gaps in sparse trait matrices [SKS+15] and later extended to handle other sorts of missing data, such as missing distances [BB20]. In this last work, it was also explored the use of autoencoders to reconstruct phylogenies from partial distance matrices, showing that machine learning approaches are able to handle matrices with over 50% of missing entries. We proposed in [PSI22] a random forest method guided by the least-squares criterion, which allowed the imputation of lost phylogenetic distances in scenarios with >60% of missing data.

These two classes of methods have different advantages and drawbacks, as shown in previous research [PSI22]. The optimization algorithms allow fast imputation of missing data. However, some of these methods are constrained by assumptions that may not be verified in real-world data (e.g. molecular clocks in LASSO). More importantly, the current optimization methods cannot lead to satisfying solutions in scenarios with >20% of missing data. On the other side, the machine learning tools can effectively conduct imputations with larger missing data percentages, at the expense of prohibitive execution times. For a moderate dataset with 201 sequences, the times required vary from 1.5h (autoencoder) to more than 48h (matrix factorization). It is worth noting that these state-of-the-art tools do not incorporate explicit parallelization strategies. Consequently, the development of cross-architecture and energy-aware parallel methods represents an open research issue in this problem. The combination of metaheuristics, specialized operators, and other search techniques is a promising strategy to overcome the deficiencies of the optimization methods. Finally, the use of multiobjective optimization to handle imputations under several criteria is also yet to be investigated.

References

General Research Aspects and Phylogenetic Inference

[ABA+21]	Alachiotis, N., Brokalakis, A., Amourgianos, V., Ioannidis, S., Malakonakis, P., Bokalidis, T. (2021) Accelerating Phylogenetics Using FPGAs in the Cloud. IEEE Micro 41(4), 24-30 (http://doi.org/10.1109/MM.2021.3075848)
[ASP+21]	Alachiotis, N., Skrimponis, P., Pissadakis, M., Pnevmatikatos, D. (2021) Scalable Phylogeny Reconstruction with Disaggregated Near-memory Processing. ACM Trans. Reconfigurable Technol. Syst. 15(3), Art. 25, 1-32 (http://doi.org/10.1145/3484983)
[ACB+19]	Ayres, D. L., Cummings, M. P., Baele, G. et al. (2019) BEAGLE 3: Improved performance, scaling, and usability for a high-performance computing library for statistical phylogenetics. Systematic Biology 68(6), 1052–1061 (http://doi.org/10.1093/sysbio/syz020)
[KDF+19]	Kozlov, A. M., Darriba, D., Flouri, T., Morel, B., Stamatakis, A. (2019) RAxML-NG: a fast, scalable and user-friendly tool for maximum likelihood phylogenetic inference. Bioinformatics 35(21), 4453-4455 (http://doi.org/10.1093/bioinformatics/btz305)
[MSC+20]	Minh, B. Q., Schmidt, H. A., Chernomor, O., Schrempf, D., Woodhams, M. D., von Haeseler, A., Lanfear, R. (2020) IQ-TREE 2: New Models and Efficient Methods for Phylogenetic Inference in the Genomic Era. Mol. Biol. Evol. 37(5), 1530-1534 (http://doi.org/10.1093/molbev/msaa015)
[MBB22]	Muralidhar, R., Borovica-Gajic, R., Buyya, R. (2022) Energy Efficient Computing Systems: Architectures, Abstractions and Modeling to Techniques and Standards. ACM Computing Surveys 54(11s), Art. 236, 1-37 (http://doi.org/10.1145/3511094)
[NB21]	Nozal, R., Bosque, J. L. (2021) Exploiting Co-execution with OneAPI: Heterogeneity from a Modern Perspective. In: L. Sousa et al. (Eds): Euro-Par 2021, LNCS 12820, 501-516 (http://doi.org/10.1007/978-3-030-85665-6_31)
[PSR+15]	Pang, S., Stones, J. R., Ren, M.-M. et al. (2015) GPU MrBayes V3.1: MrBayes on Graphics Processing Units for Protein Sequence Data. Mol. Biol. Evol. 32(9), 2496-2497 (http://doi.org/10.1093/molbev/msv129)
[RAB+21]	Reinders, J., Ashbaugh, B, Brodman, J., Kinsner, M., Pennycook, J., Tian, X. (2021). Data Parallel C++: Mastering DPC++ for Programming of Heterogeneous Systems using C++ and SYCL. Apress Open, Springer, (http://doi.org/10.1007/978-1-4842-5574-2)
[SVS22]	Santander-Jiménez, S., Vega-Rodríguez, M. A., Sousa L. (2022) Exploiting Multi-level Parallel Metaheuristics and Heterogeneous Computing to Boost Phylogenetics. Future Generation Computer Systems 127, 208-224 (http://doi.org/10.1016/j.future.2021.09.011)
[SDS+20]	Schwartz, R., Dodge, J., Smith, N. A., Etzioni, O. (2020) Green AI. Communications of the ACM 63(12), 54-63 (http://doi.org/10.1145/3381831)
[War17]	Warnow, T. (2017) Computational Phylogenetics: An Introduction to Designing Methods for Phylogeny Estimation. Cambridge University Press (http://doi.org/10.1017/9781316882313)
[YTH+22]	Ye, C., Thornlow, B., Hinrichs, A., et al. (2022) matOptimize: a parallel tree optimization method enables online phylogenetics for SARS-CoV-2. Bioinformatics 38(15), 3734-3740 (http://doi.org/10.1093/bioinformatics/btac401)
[YG20]	Young, A. D., Gillung, J. P. (2020) Phylogenomics – principles, opportunities and pitfalls of big-data phylogenetics. Systematic Entomology 45(2), 225-247 (http://doi.org/10.1111/syen.12406)

Phylogenetic Placement Problem

[BJR+22]	Balaban, M., Jiang, Y., Roush, D., Zhu, Q., Mirarab, S. (2022) Fast and accurate distance-based phylogenetic placement using divide and conquer. Mol. Ecol. Resour. 22(3), 1213-1227 (http://doi.org/10.1111/1755-0998.13527)
[BKC+19]	Barbera, P., Kozlov, A. M., Czech, L., Morel, B., Darriba, D., Flouri, T., Stamatakis, A. (2019) EPA-ng: Massively Parallel Evolutionary Placement of Genetic Sequences. Systematic Biology 68(2), 365-369 (http://doi.org/10.1093/sysbio/syy054)
[BS21]	Barbera, P., Stamatakis, A. (2021) Efficient Memory Management in Likelihood-based Phylogenetic Placement. In: 2021 IEEE IPDPS Workshops (IPDPSW), 218-227 (http://doi.org/10.1109/IPDPSW52791.2021.00041)
[BM21]	Blanke, M., Morgenstern, B. (2021) App-SpaM: phylogenetic placement of short reads without sequence alignment. Bioinformatics Advances 1(1), Art. vbab027, 1-9 (http://doi.org/10.1093/bioadv/vbab027)
[CSD+22]	Czech, L., Stamatakis, A., Dunthorn, M., Barbera, P. (2022) Metagenomic Analysis Using Phylogenetic Placement - A Review of the First Decade. Frontiers in Bioinform. 2, 871393, 1-25 (http://doi.org/10.3389/fbinf.2022.871393)
[JBZ+22]	Jiang, Y., Balaban, M., Zhu, Q., Mirarab, S. (2022) DEPP: Deep Learning Enables Extending Species Trees using Single Genes. Systematic Biology, Art. syac031, 1-63 (http://doi.org/10.1093/sysbio/syac031)
[LSP19]	Linard, B., Swenson, K., Pardi, F. (2019) Rapid alignment-free phylogenetic identification of metagenomic sequences. Bioinformatics 35(18), 3303-3312 (http://doi.org/10.1093/bioinformatics/btz068)
[TTH+21]	Turakhia. Y, Thornlow, B., Hinrichs, A. S. et al. (2021) Ultrafast Sample placement on Existing tRees (UShER) enables real-time phylogenetics for the SARS-CoV-2 pandemic. Nature Genetics 53(6), 809–816 (http://doi.org/10.1038/s41588-021-00862-7)
[WCW22]	Wedell, E., Cai, Y., Warnow, T. (2022) SCAMPP: Scaling Alignment-based Phylogenetic Placement to Large Trees. IEEE/ACM Trans. Comput. Biol. Bioinform. (Early Access), 1-14 (http://doi.org/10.1109/TCBB.2022.3170386)

Missing Data Imputation Problem

[BB20]	Bhattacharjee, A., Bayzid, M. S. (2020) Machine learning based imputation techniques for estimating phylogenetic trees from incomplete distance matrices. BMC Genomics 21, Art. 497, 1–14 (http://doi.org/10.1186/s12864-020-06892-5)
[GBA17]	Goolsby, E. W., Bruggeman, J., Ané, C. (2017) Rphylopars: fast multivariate phylogenetic comparative methods for missing data and within-species variation. Methods Ecol. Evol. 8(1), 22-27 (http://doi.org/10.1111/2041-210X.12612)
[KDR+15]	Kettleborough, G., Dicks, J., Roberts, I. N., Huber, K. T. (2015) Reconstructing (Super)Trees from Data Sets with Missing Distances: Not All Is Lost. Mol. Biol. Evol. 32(6), 1628–1642 (http://doi.org/10.1093/molbev/msv027)
[KSM+17]	Kocot, K. M., Struck, T. H., Merkel, J. et al. (2017) Phylogenomics of Lophotrochozoa with Consideration of Systematic Error. Systematic Biology 66(2), 256-282 (http://doi.org/10.1093/sysbio/syw079)
[PSI22]	Pinheiro, D., Santander-Jiménez, S., Ilic, A. (2022) PhyloMissForest: a random forest framework to construct phylogenetic trees with missing data. BMC Genomics 23, Art. 377, 1-21 (http://doi.org/10.1186/s12864-022-08540-6)
[RBP13]	Roure, B., Baurain, D., Philippe, H. (2013) Impact of Missing Data on Phylogenies Inferred from Empirical Phylogenomic Data Sets. Mol. Biol. Evol. 30(1): 197–214 (http://doi.org/10.1093/molbev/mss208)
[SKS+15]	Schrodt, F., Kattge, J., Shan, H. et al. (2015) BHPMF – a hierarchical Bayesian approach to gap-filling and trait prediction for macroecology and functional biogeography. Glob. Ecol. Bio. 24, 1510-1521 (http://doi.org/10.1111/geb.12335)
[Xia18]	Xia, X. (2018) Imputing missing distances in molecular phylogenetics. PeerJ 6, Art. e5321, 1–17 (http://doi.org/10.7717/peerj.5321)
[YVY+20]	Yasui, N., Vogiatzis, C., Yoshida, R., Fukumizu, K. (2020) imPhy: Imputing Phylogenetic Trees with Missing Information Using Mathematical Programming. IEEE/ACM Trans. Comput. Biol. Bioinform. 17(4), 1222-1230 (http://doi.org/10.1109/TCBB.2018.2884459)

Scientific Production

[RVS25]	"A Multi-Objective Artificial Bee Colony Approach for Identifying Cancer Driver Pathways". Fernando M. Rodríguez-Bejarano, Miguel A. Vega-Rodríguez, Sergio Santander-Jiménez. Expert Systems With Applications, Volume 275, 127071, Pergamon-Elsevier Science, Oxford, England, UK, 2025, pp. 1-16, ISSN: 0957-4174. DOI: 10.1016/j.eswa.2025.127071. (JCR impact factor = 7.5 in 2023, Quartile = Q1)
[GVP+25]	"A Keyword Extraction Model Study in the Movie Domain with Synopsis and Reviews". Carlos González-Santos, Miguel A. Vega-Rodríguez, Carlos J. Pérez, Iñaki Martínez-Sarriegui, Joaquín M. López-Muñoz. Knowledge and Information Systems, Springer, London, England, 2025, pp. 1-23, ISSN: 0219-1377. DOI: 10.1007/s10115-025-02350-4. (JCR impact factor = 2.5 in 2023, Quartile = Q2)
[RVS24]	"GADPO: Genetic Algorithm based on Dominance for Primer Optimization". Fernando M. Rodríguez-Bejarano, Miguel A. Vega-Rodríguez, Sergio Santander-Jiménez. Expert Systems With Applications, Volume 238, Part D, 122206, Pergamon-Elsevier Science, Oxford, England, UK, 2024, pp. 1-15, ISSN: 0957-4174. DOI: 10.1016/j.eswa.2023.122206. (JCR impact factor = 7.5 in 2023, Quartile = Q1)
[LRG24]	"A Simple yet Effective Greedy Evolutionary Strategy for RNA Design". Nuria Lozano-García, Álvaro Rubio-Largo, José Maria Granado-Criado. IEEE Transactions on Evolutionary Computation (Early Access), IEEE, Piscataway, NJ, USA, 2024, pp. 1-13, ISSN: 1089-778X. DOI: 10.1109/TEVC.2024.3461509. (JCR impact factor = 11.7 in 2023, Quartile = Q1)
[REL+24]	"Evolutionary Strategy to Enhance an RNA Design Tool Performance". Álvaro Rubio-Largo, Laura Escobar-Encinas, Nuria Lozano-García, José M. Granado-Criado. IEEE Access, Volume 12, IEEE, Piscataway, NJ, USA, 2024, pp. 15582-15593, ISSN: 2169-3536. DOI: 10.1109/ACCESS.2024.3358426. (JCR impact factor = 3.4 in 2023, Quartile = Q2)
[TGC24]	"A new approach to minimize the economic cost of the deck in concrete slab bridges by means of metaheuristics". Jesús A. Torrecilla-Pinero, Juan A. Gómez-Pulido, Enrique Cortés-Toro. Computers and Concrete, Volume 34, No. 6, Techno-Press, Daejeon, Korea, 2024, pp. 737-750, ISSN: 1598-8198. DOI: 10.12989/cac.2024.34.6.737. (JCR impact factor = 2.9 in 2023, Quartile = Q2)
[GRG+24]	"Industrial Internet of Things Embedded Devices Fault Detection and Classification. A Case Study". Alberto Garcés-Jiménez, André Rodrigues, José M. Gómez-Pulido, Duarte Raposo, Juan A. Gómez-Pulido, Jorge Sá Silva, Fernando Boavida. Internet of Things, Volume 25, 101042, Elsevier, Amsterdam, The Netherlands, 2024, pp. 1-19, ISSN: 2542-6605. DOI: 10.1016/j.iot.2023.101042 (JCR impact factor = 6.0 in 2023, Quartile = Q1)
[SGR24]	"Machine Learning Applied to Tourism: A Systematic Review". José Carlos Sancho Núñez, Juan A. Gómez-Pulido, Rafael Robina Ramírez. Wiley Interdisciplinary Reviews - Data Mining and Knowledge Discovery, Volume 14, Issue 5, e1549, Wiley, San Francisco, CA, USA, 2024, pp. 1-35, ISSN: 1942-4787. DOI: 10.1002/widm.1549 (JCR impact factor = 6.4 in 2023, Quartile = Q1)
[NIS+24]	"IPU-EpiDet: Identifying Gene Interactions on Massively Parallel Graph-Based AI Accelerators". Ricardo Nobre, Aleksandar Ilic, Sergio Santander-Jiménez, Leonel Sousa. 2024 IEEE International Parallel and Distributed Processing Symposium (IPDPS), IEEE, San Francisco, CA, USA, 2024, pp. 631-643. ISBN: 979-8-3503-8711-7. DOI: 10.1109/IPDPS57955.2024.00062.
[GG24]	"KNXsim: Simulator Tool for KNX Home Automation Training by Means of Group Addresses". Juan A. Gómez-Pulido, Alberto Garcés-Jiménez. Simulation Tools and Techniques. SIMUtools 2023. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 519. Springer, Switzerland, 2024, pp. 34 -43. ISBN: 978-3-031-57522-8. DOI: 10.1007/978-3-031-57523-5_3.
[GVS24]	"Aproximación Multiobjetivo basada en Búsqueda de Entorno Variable para la Codificación de Proteínas". Belen Gonzalez-Sanchez, Miguel A. Vega-Rodríguez, Sergio Santander-Jiménez. Actas del XV Congreso Español de Metaheurísticas, Algoritmos Evolutivos y Bioinspirados (MAEB 2024), parte de CAEPIA 2024, Amparo Alonso et al. (editores), Asociación Española para la Inteligencia Artificial (AEPIA), Coruña, Spain, 2024, pp. 403-408. ISBN: 978-84-09-62724-0.
[GVS24_2]	"Inteligencia de Enjambre Multiobjetivo para Codificar una Proteína con Múltiples Genes". Belen Gonzalez-Sanchez, Miguel A. Vega-Rodríguez, Sergio Santander-Jiménez. Actas de la XX Conferencia de la Asociación Española para la Inteligencia Artificial (CAEPIA 2024), Amparo Alonso et al. (editores), Asociación Española para la Inteligencia Artificial (AEPIA), Coruña, Spain, 2024, pp. 53-58. ISBN: 978-84-09-62724-0.
[MV24]	"Ensamble basado en Boosting para Mejorar el Alineamiento de Redes de Proteínas". Manuel Menor-Flores, Miguel A. Vega-Rodríguez. Actas de la XX Conferencia de la Asociación Española para la Inteligencia Artificial (CAEPIA 2024), Amparo Alonso et al. (editores), Asociación Española para la Inteligencia Artificial (AEPIA), Coruña, Spain, 2024, pp. 23-28. ISBN: 978-84-09-62724-0.
[MV24_2]	"Un Algoritmo Multiobjetivo para el Alineamiento de Redes de Proteínas". Manuel Menor-Flores, Miguel A. Vega-Rodríguez. Actas del XV Congreso Español de Metaheurísticas, Algoritmos Evolutivos y Bioinspirados (MAEB 2024), parte de CAEPIA 2024, Amparo Alonso et al. (editores), Asociación Española para la Inteligencia Artificial (AEPIA), Coruña, Spain, 2024, pp. 397-402. ISBN: 978-84-09-62724-0.
[SV24]	"Esquemas de Orquestación Heterogénea para Inferencia Filogenética". Sergio Santander-Jiménez, Miguel A. Vega-Rodríguez. Avances en Arquitectura y Tecnología de Computadores. Actas de las Jornadas SARTECO 2024, Margarita Amor et al. (editores), Sociedad Española de Arquitectura y Tecnología de Computadores (SARTECO), Coruña, Spain, 2024, pp. 145-152. ISBN: 978-84-09-61749-4.
[SVG+24]	"Explorando Metodologías de Evaluación por Pares Offline y Online en Ingeniería Informática". Sergio Santander-Jiménez, Miguel A. Vega-Rodríguez, José M. Granado-Criado, Álvaro Rubio-Largo, Juan A. Gómez-Pulido, César Gómez-Martín, Arturo Durán-Domínguez. Actas de las Jornadas sobre Enseñanza Universitaria de la Informática, José Antonio Cruz Lemus et al. (editores), Asociación de Enseñantes Universitarios de la Informática (AENUI), Coruña, Spain, 2024, pp. 181-188.

PID2022-137275NA-I00

Cross-Architecture and Energy-Efficient Parallel Metaheuristics for Bioinformatics (X-BIO)

Goals

Problems

Descripción del problema

State of the Art

References

Who We Are

Scientific Production