DSBA
MCTI/CNPq/CT-Biotec nº 30/2022
Data Science for Biotechnology Applications
Solving large-scale challenges using explainable machine learning, metaheuristics, and high-performance computing
This project aims to develop new bioinformatics tools based on Machine Learning methods (supervised and unsupervised), heuristic search methods, and high-performance computing to explore high-dimensional data in problems of scientific and economic interest in the area of human and animal health. We will develop: (i) algorithms based on adaptive and multiobjective metaheuristics; (ii) multimodal metaheuristics; (iii) time series-based metaheuristics; (iv) combinatorial optimization; (v) interpretable machine learning methods; (vi) algorithms for feature extraction and selection; and (vii) combination of interpretability methods aiming at building general-purpose strategies that contribute to the analysis of large data with complex structure...
Animal Health
The use of bioinformatics tools in identifying molecular profiles of bacteria enables a precise and efficient approach to disease diagnosis. Furthermore, it fosters a deeper understanding of bacterial genetic diversity and facilitates well-informed clinical decision-making. In the field of animal health, researchers focus on studying bacteria of the genus Brucella, which cause a disease known as brucellosis. This disease, also called Malta fever or undulant fever, affects a wide range of mammals, exhibiting zoonotic and cosmopolitan characteristics and posing a significant risk to public health with substantial economic losses. Brucellosis can cause various symptoms, ranging from cold-like signs to complications in the nervous system, musculoskeletal system, and heart. In canines (affected by B. canis), nonspecific signs are observed, like those in humans, but reproductive failures and joint issues related to this bacterium are commonly diagnosed. Due to the diversity of clinical signs, diagnosing brucellosis in humans and animals presents a significant challenge, with underdiagnosis contributing to the spread of infection. Despite this, few genomic studies with different strains of B. canis have been developed so far. In this regard, there is a demand for more information, such as virulence factors, antimicrobial resistance genes, and the evolutionary profile of the pathogen, which can greatly contribute to decision-making in government responses to public health, as well as in storing and comparing data about this agent.
In the experimental front of this project, team members recently sequenced 20 B. canis genomes using two sequencing technologies (for obtaining short reads and long reads), which will contribute to the data used in solving this biological problem, along with 60 public genomes of B. canis and 160 public genomes of B. suis. This data will be analyzed by the computational tools developed in this proposal to identify species-specific genetic variations to serve as diagnostic markers for brucellosis. Interpretable machine learning algorithms will be employed to create a genotypic profile of virulent strains and differentiate them between species based on their phenotypic differences and antimicrobial susceptibility profiles.
Researchers
Graduate Students/Collaborators
- Cauê Scotti Luciano Rocha ITI - EC/UFRGS
- Lorenzo C. C. Novo ITI - BTC/UFRGS
Scholarship Students
- Cauê Scotti Luciano Rocha ITI - EC/UFRGS (2023-2024)
- Lorenzo C. C. Novo ITI - BTC/UFRGS (2023-2025)
Datasets
Tools
Publications
- Assessment of Kaistella jeonii esterase conformational dynamics in response to poly(ethylene terephthalate) binding PINTO, E. S. M.; MANGINI, A. T.; NOVO, L. C. C.; CAVATAO, F. G.; KRAUSE, M. J.; DORN, M. Current Research in Structural Biology, v. 7, p. 100130, 2024.
- Analysis and comparison of feature selection methods towards performance and stability BARBIERI, M. C.; GRISCI, B. I.; DORN, M. Expert Systems with Applications, v. 241, p. 1-30, 2024.
- Meta-analyses of host metagenomes from colorectal cancer patients reveal strong relationship between colorectal cancer-associated species ESCALONA, M. A. R.; POLONI, J. F.; KRAUSE, M. J.; DORN, M. Molecular Omics, v. 19, p. 429-444, 2023.
- The nucleotide excision repair proteins through the lens of molecular dynamics simulations PINTO, E. S. M.; KRAUSE, M. J.; DORN, M.; FELTES, B. C. DNA Repair, v. 127, p. 103510, 2023.
- Enhancing classification with hybrid feature selection: A multi-objective genetic algorithm for high-dimensional data BOHRER, J. S.; DORN, M. Expert Systems with Applications, v. 255, p. 124518, 2024.
- NIAS-Server 2.0: A versatile complementary tool for structural biology studies FELTES, B. C.; PINTO, E. S. M.; MANGINI, A. T.; DORN, M. Journal of computational Chemistry, v. 44, p. 1610-1623, 2023.
- Exploring bacterial diversity and antimicrobial resistance gene on a southern Brazilian swine farm TORRES, M. C.; BREYER, G. M.; ESCALONA, M. A. R.; MAYER, F. Q.; VARELA, A. P. M.; AZEVEDO, V. A. C.; DA COSTA, M. M.; ABURJAILE, F. G.; DORN, M.; BRENING, B.; CARDOSO, M. R. I.; SIQUEIRA, F. M. Environmental Pollution, v. 352, p. 124146, 2024.
- Functional response of microbial communities in lab-controlled oil-contaminated marine sediment JUNIOR, R. A.; POLONI, J. F.; ESCALONA, M. A. R.; DORN, M. Molecular Omics, v. 19, p. 756-768, 2023.
- Molecular Basis of MC1R Activation: Mutation-Induced Alterations in Structural Dynamics CAVATAO, F. G.; PINTO, E. S. M.; KRAUSE, M. J.; ALHO, C. S.; DORN, M. PROTEINS: Structure, Function, and Bioinformatics, v. 92, p. 1-15, 2024.
- Interdisciplinary Overview of Lipopeptide and Protein-Containing Biosurfactants JÚNIOR, R. A.; POLONI, J. F.; PINTO, E. S. M.; DORN, M. Genes, v. 14, p. 76, 2023.
- Ancestry resolution of South Brazilians by forensic 165 ancestry-informative SNPs panel FELKL, A. B.; AVILA, E.; GASTALDO, A. Z.; LINDHOLZ, C. G.; DORN, M.; ALHO, C. S. Forensic Science International Genetics, v. 64, p. 102838, 2023.
- Transcriptomic Analysis of Long Non-Coding RNA during Candida albicans Infection GONÇALVES, G. F.; POLONI, J. F.; DORN, M. Genes, v. 14, p. 251, 2023.
- Assessing feature scorer results on high-dimensional datasets with t-SNE GRISCI, B.; INOSTROZA-PONTA, M.; DORN, M. Neurocomputing, v. 652, p. 130561, 2025.
- CRYPTOMICSDB: Revealing the Molecular Landscape of 3 Cryptococcosis CALEGARI-ALVES, Y. P.; INNOCENTE-ALVES, C.; COSTA, R. P.; FAUSTINO, A. M.; FARIAS, K. S. S.; BOIANI, M.; GONCALVES, B. S. A.; DORN, M.; BEYS-DA-SILVA, W. O.; SANTI, L. Journal of Fungi, v. 11, p. 425, 2025.
- Synbiotic supplementation enhances memory processes in adult and aged male rats GONCALVES, D. A.; LUFT, J. G.; ESCALONA, M. A. R.; MANN, M. B.; FRAZZON, J.; DORN, M.; RAMPELOTTO, P. H.; ALVARES, L. O. Biogerontology, v. 26, p. 1, 2025.
- Mitochondrial haplogroup A2 is associated with increased COVID-19 mortality in an admixed Brazilian population TAVARES, G. M.; MISSAGGIA, B. O.; CADORE, N. A.; SBRUZZI, R. C.; FEIRA, M. F.; GIUDICELLI, G. C.; DE OLIVEIRA FAM, B. S.; RODRIGUES, M.; DORN, M.; HÜNEMEIER, T.; VIANNA, F. S. L.; BORTOLINI, M. C. Scientific Reports, v. 15, p. 22391, 2025.
- Lattice Boltzmann Simulation of Lauric Acid Melting in rectangular cavity with different fin configurations with OpenLB TACQUES FILHO, A. Q.; BINGERT, T. N.; KUMMERLANDER, A.; CZELUSNIAK, L. E.; KRAUSE, M. J. Energy Storage, v. 7, p. e70237, 2025.
- BR-FDP-EYE: Brazilian Forensic DNA eye phenotyping BEHRENS, L. M. P.; GONCALVES, C. E. I.; FERNANDES, G. S.; BOIANI, M.; SILVA, E. F. A.; BICALHO, M. G.; ALHO, C. S.; DORN, M. Forensic Science International, v. 377, p. 112593, 2025.
- Just-in-Time Fluid Flow Simulation on Mobile Devices Using OpenVisFlow and OpenLB TEUTSCHER, D.; KUMMERLANDER, A.; BUKREEV, F.; DORN, M.; KRAUSE, M. J. Applied Sciences, v. 14, p. 1784, 2024.
- Sodium propionate oral supplementation ameliorates depressive-like behavior through gut microbiome and histone 3 epigenetic regulation BEHRENS, L. M. P.; GASPAROTTO, J.; RAMPELOTTO, P. H.; ESCALONA, M. A. R.; SILVA, L. S.; CARAZZA-KESSLER, F. G.; BARBOSA, C. P.; CAMPOS, M. S.; DORN, M.; GELAIN, D. P.; MOREIRA, J. C. F. The Journal of Nutritional Biochemistry, v. 130, p. 109660, 2024.
- Optimization of single node load balancing for lattice Boltzmann method on heterogeneous high performance computers KUMMERLANDER, A.; BUKREEV, F.; TEUTSCHER, D.; DORN, M.; KRAUSE, M. J. Journal of Parallel and Distributed Computing, v. 206, p. 105169, 2025.
- Deep learning methods and applications in single-cell multimodal data integration NUNES, F. V. M.; BEHRENS, L. M. P.; WEIMER, R. D.; GONÇALVES, G. F.; DA SILVA FERNANDES, G.; DORN, M. Molecular Omics, v. 21, p. 1-15, 2025.
- Exploring students? understanding of evolutionary biology through large-scale national university entrance examinations TAVARES, G. M.; DORN, M.; BORTOLINI, M. C. International Journal of Science Education, v. 48, p. 1-14, 2025.
- Overview of the microbiome and resistome of swine manure in commercial piglet farms and its application in grazing soils DIAS, M.; BREYER, G. M.; TORRES, M. C.; WUADEN, C.; REBELATTO, R.; KICH, J. D.; DORN, M.; SIQUEIRA, F. M. Environmental Technology, v. 46, p. 1-11, 2025.
- Expanding acute stroke care coverage in resource-limited settings: A multi-objective approach based on facility location problems and NSGA-II DORNELES, L. D.; BOIANI, M.; CARBONERA, L. A.; DORN, M. Operations Research, Data Analytics and Logistics, v. 45, p. 200488, 2026.
- The MAPSTROKE analysis of the access to stroke reperfusion treatment and stroke units in Italy NICOLINI, E.; CIACCIARELLI, A.; FRANCHINI, E.; FRAINER, A. S.; FRAINER, A. S.; DORNELES, L. L.; BOIANI, M.; DORN, M.; SANTALUCIA, P.; CASO, V.; TONI, D.; CARBONERA, L. A. European Stroke Journal, v. 11, p. 1-20, 2026.
- Identification of biogeographically informative microssatelite markers for Brazilian Cannabis sativa samples: a Machine Learning approach for forensic origin prediction BETTIM, C. A. R.; RIBEIRO, L. O. P.; ALEGRIA, O. V. C.; SILVA, E. F. A.; SIQUEIRA, F. M.; CAMARGO, F. A. O.; ALHO, C. S.; DORN, M. International Journal of Legal Medicine, v. 140, p. 1-120, 2026.