Supervised Gene Function Prediction Using Spectral Clustering on Gene Co-expression Networks

Miguel Romero, Óscar Ramírez, Jorge Finke, Camilo Rocha

Producción: Capítulo del libro/informe/acta de congresoContribución a la conferenciarevisión exhaustiva

2 Citas (Scopus)

Resumen

Gene annotation addresses the problem of predicting unknown functions that are associated to the genes of a specific organism (e.g., biological processes). Despite recent advances, the cost and time demanded by annotation procedures that rely largely on in vivo biological experiments remain prohibitively high. This paper presents an in silico approach to the annotation of genes that follows a network-based representation, and combines techniques from multivariate statistics (spectral clustering) and machine learning (gradient boosting). Spectral clustering is used to enrich the gene co-expression network (GCN) with currently known gene annotations. Gradient boosting is trained on features of the GCN to build an estimator of the probability that a gene is involved in a given biological process. The proposed approach is applied to a case study on Zea mays, one of the world’s most dominant and productive crop. Broadly speaking, the main results illustrate how computational experimentation narrows down the time and costs in efforts to annotate the functions of genes. More specifically, the results highlight the importance of network science, multivariate statistics, and machine learning techniques in reducing types I and II prediction errors.

Idioma originalInglés
Título de la publicación alojadaComplex Networks and Their Applications X - Volume 2, Proceedings of the 10th International Conference on Complex Networks and Their Applications COMPLEX NETWORKS 2021
EditoresRosa Maria Benito, Chantal Cherifi, Hocine Cherifi, Esteban Moro, Luis M. Rocha, Marta Sales-Pardo
EditorialSpringer Science and Business Media Deutschland GmbH
Páginas652-663
Número de páginas12
ISBN (versión impresa)9783030934125
DOI
EstadoPublicada - 2022
Evento10th International Conference on Complex Networks and Their Applications, COMPLEX NETWORKS 2021 - Madrid, Espana
Duración: 30 nov. 202102 dic. 2021

Serie de la publicación

NombreStudies in Computational Intelligence
Volumen1016
ISSN (versión impresa)1860-949X
ISSN (versión digital)1860-9503

Conferencia

Conferencia10th International Conference on Complex Networks and Their Applications, COMPLEX NETWORKS 2021
País/TerritorioEspana
CiudadMadrid
Período30/11/2102/12/21

Huella

Profundice en los temas de investigación de 'Supervised Gene Function Prediction Using Spectral Clustering on Gene Co-expression Networks'. En conjunto forman una huella única.

Citar esto