Incorporating Pathway Information into Feature Selection towards Better Performed Gene Signatures

Suyan Tian, Chi Wang, Bing Wang

Research output: Contribution to journalReview articlepeer-review

12 Scopus citations


To analyze gene expression data with sophisticated grouping structures and to extract hidden patterns from such data, feature selection is of critical importance. It is well known that genes do not function in isolation but rather work together within various metabolic, regulatory, and signaling pathways. If the biological knowledge contained within these pathways is taken into account, the resulting method is a pathway-based algorithm. Studies have demonstrated that a pathway-based method usually outperforms its gene-based counterpart in which no biological knowledge is considered. In this article, a pathway-based feature selection is firstly divided into three major categories, namely, pathway-level selection, bilevel selection, and pathway-guided gene selection. With bilevel selection methods being regarded as a special case of pathway-guided gene selection process, we discuss pathway-guided gene selection methods in detail and the importance of penalization in such methods. Last, we point out the potential utilizations of pathway-guided gene selection in one active research avenue, namely, to analyze longitudinal gene expression data. We believe this article provides valuable insights for computational biologists and biostatisticians so that they can make biology more computable.

Original languageEnglish
Article number2497509
JournalBioMed Research International
StatePublished - 2019

Bibliographical note

Publisher Copyright:
© 2019 Suyan Tian et al.

ASJC Scopus subject areas

  • General Biochemistry, Genetics and Molecular Biology
  • General Immunology and Microbiology


Dive into the research topics of 'Incorporating Pathway Information into Feature Selection towards Better Performed Gene Signatures'. Together they form a unique fingerprint.

Cite this