Predicting the Pathway Involvement of Compounds Annotated in the Reactome Knowledgebase

Erik D. Huckvale, Hunter N.B. Moseley

Research output: Contribution to journalArticlepeer-review

Abstract

Background/Objectives: Pathway annotations of non-macromolecular (relatively small) biomolecules facilitate biological and biomedical interpretation of metabolomics datasets. However, low pathway annotation levels of detected biomolecules hinder this type of interpretation. Thus, predicting the pathway involvement of detected but unannotated biomolecules has a high potential to improve metabolomics data analysis and omics integration. Past publications have only made use of the Kyoto Encyclopedia of Genes and Genomes-derived datasets to develop machine learning models to predict pathway involvement. However, to our knowledge, the Reactome knowledgebase has not been utilized to develop these types of predictive models. Methods: We created a dataset ready for machine learning using chemical representations of all pathway-annotated compounds available from the Reactome knowledgebase. Next, we trained and evaluated a multilayer perceptron binary classifier using combined metabolite-pathway paired feature vectors engineered from this new dataset. Results: While models trained on a prior corresponding KEGG dataset with 502 pathways scored a mean Matthew’s correlation coefficient (MCC) of 0.847 and a 0.0098 standard deviation, the models trained on the Reactome dataset with 3985 pathways demonstrated improved performance with a mean MCC of 0.916, but with a higher standard deviation of 0.0149. Conclusions: These results indicate that the pathways in Reactome can also be effectively predicted, greatly increasing the number of human-defined pathways available for prediction.

Original languageEnglish
Article number161
JournalMetabolites
Volume15
Issue number3
DOIs
StatePublished - Mar 2025

Bibliographical note

Publisher Copyright:
© 2025 by the authors.

Keywords

  • binary classification
  • biochemistry
  • machine learning
  • metabolite
  • multilayer perceptron
  • pathway
  • reactome
  • supervised learning

ASJC Scopus subject areas

  • Endocrinology, Diabetes and Metabolism
  • Biochemistry
  • Molecular Biology

Fingerprint

Dive into the research topics of 'Predicting the Pathway Involvement of Compounds Annotated in the Reactome Knowledgebase'. Together they form a unique fingerprint.

Cite this