Identification of biological properties in organisms using machine learning techniques on whole genome sequences

Authors

  • Nicolás Ferella Universidad Nacional de La Plata, Argentina y Jefatura de Gabinete de Ministros de la República Argentina, Argentina
  • Pablo Pizio Universidad Nacional de La Plata, Argentina
  • Claudia Pons Universidad Nacional de La Plata, Argentina

Keywords:

artificial intelligence, genetics, big data, DNA

Abstract

The advance in technology and genome sequencing processes in the recent decades have made large volumes of biological data available to researchers from all over the world, which, due to the large scales, are difficult to analyze in their entirety. Therefore, it is intuitive to think of Artificial Intelligence to work with such information. In order to reduce the existing gap between the researchers and the Artificial Intelligence tools, a software was developed that allows the creation of a workspace for biological organisms, the processing of its corresponding genomes, and the creation and training of models of Machine Learning, everything using a simple (yet powerful) graphical interface.
The trained models are then analyzed to find which patterns determine the result of the property that is being investigated on the biological organism, finding in the process the genes with the greatest impact on the model’s predictions, allowing the researcher to subsequently analyze the desired genes in the laboratory, saving time and resources in the process.

Downloads

Published

2023-07-07

Issue

Section

EST - Concurso de Trabajos Estudiantiles

How to Cite

Ferella, N., Pizio, P., & Pons, C. (2023). Identification of biological properties in organisms using machine learning techniques on whole genome sequences. JAIIO, Jornadas Argentinas De Informática, 9(6), 218-234. https://revistas.unlp.edu.ar/JAIIO/article/view/18207