TY - JOUR
T1 - GOCompare: An R package to compare functional enrichment analysis between two species
T2 - An R package to compare functional enrichment analysis between two species
AU - Sosa, Chrystian C.
AU - Clavijo-Buriticá, Diana Carolina
AU - García-Merchán, Victor Hugo
AU - López-Rozo, Nicolas
AU - Riccio-Rengifo, Camila
AU - Diaz, Maria Victoria
AU - Londoño, David Arango
AU - Quimbaya, Mauricio Alberto
N1 - Publisher Copyright:
© 2022
PY - 2023/1
Y1 - 2023/1
N2 - Functional enrichment analysis is a cornerstone in bioinformatics as it makes possible to identify functional information by using a gene list as source. Different tools are available to compare gene ontology (GO) terms, based on a directed acyclic graph structure or content-based algorithms which are time-consuming and require a priori information of GO terms. Nevertheless, quantitative procedures to compare GO terms among gene lists and species are not available. Here we present a computational procedure, implemented in R, to infer functional information derived from comparative strategies. GOCompare provides a framework for functional comparative genomics starting from comparable lists from GO terms. The program uses functional enrichment analysis (FEA) results and implement graph theory to identify statistically relevant GO terms for both, GO categories and analyzed species. Thus, GOCompare allows finding new functional information complementing current FEA approaches and extending their use to a comparative perspective. To test our approach GO terms were obtained for a list of aluminum tolerance-associated genes in Oryza sativa subsp. japonica and their orthologues in Arabidopsis thaliana. GOCompare was able to detect functional similarities for reactive oxygen species and ion binding capabilities which are common in plants as molecular mechanisms to tolerate aluminum toxicity. Consequently, the R package exhibited a good performance when implemented in complex datasets, allowing to establish hypothesis that might explain a biological process from a functional perspective, and narrowing down the possible landscapes to design wet lab experiments.
AB - Functional enrichment analysis is a cornerstone in bioinformatics as it makes possible to identify functional information by using a gene list as source. Different tools are available to compare gene ontology (GO) terms, based on a directed acyclic graph structure or content-based algorithms which are time-consuming and require a priori information of GO terms. Nevertheless, quantitative procedures to compare GO terms among gene lists and species are not available. Here we present a computational procedure, implemented in R, to infer functional information derived from comparative strategies. GOCompare provides a framework for functional comparative genomics starting from comparable lists from GO terms. The program uses functional enrichment analysis (FEA) results and implement graph theory to identify statistically relevant GO terms for both, GO categories and analyzed species. Thus, GOCompare allows finding new functional information complementing current FEA approaches and extending their use to a comparative perspective. To test our approach GO terms were obtained for a list of aluminum tolerance-associated genes in Oryza sativa subsp. japonica and their orthologues in Arabidopsis thaliana. GOCompare was able to detect functional similarities for reactive oxygen species and ion binding capabilities which are common in plants as molecular mechanisms to tolerate aluminum toxicity. Consequently, the R package exhibited a good performance when implemented in complex datasets, allowing to establish hypothesis that might explain a biological process from a functional perspective, and narrowing down the possible landscapes to design wet lab experiments.
KW - Aluminum tolerance
KW - Comparative genomics
KW - Geneset enrichment analysis
KW - Undirected graphs
UR - http://www.scopus.com/inward/record.url?scp=85143343770&partnerID=8YFLogxK
U2 - 10.1016/j.ygeno.2022.110528
DO - 10.1016/j.ygeno.2022.110528
M3 - Article
C2 - 36462728
AN - SCOPUS:85143343770
SN - 0888-7543
VL - 115
JO - Genomics
JF - Genomics
IS - 1
M1 - 110528
ER -