A top-down supervised learning approach to hierarchical multi-label classification in networks

Miguel Romero, Jorge Finke, Camilo Rocha

Research output: Contribution to journalArticlepeer-review

12 Scopus citations

Abstract

Node classification is the task of inferring or predicting missing node attributes from information available for other nodes in a network. This paper presents a general prediction model to hierarchical multi-label classification, where the attributes to be inferred can be specified as a strict poset. It is based on a top-down classification approach that addresses hierarchical multi-label classification with supervised learning by building a local classifier per class. The proposed model is showcased with a case study on the prediction of gene functions for Oryza sativa Japonica, a variety of rice. It is compared to the Hierarchical Binomial-Neighborhood, a probabilistic model, by evaluating both approaches in terms of prediction performance and computational cost. The results in this work support the working hypothesis that the proposed model can achieve good levels of prediction efficiency, while scaling up in relation to the state of the art.

Original languageEnglish
Article number8
JournalApplied Network Science
Volume7
Issue number1
DOIs
StatePublished - Dec 2022

Keywords

  • Gene function prediction
  • Hierarchical classification
  • Oryza sativa
  • Supervised learning
  • Top-down approach
  • XGBoost

Fingerprint

Dive into the research topics of 'A top-down supervised learning approach to hierarchical multi-label classification in networks'. Together they form a unique fingerprint.

Cite this