Abstract
Background: Feature selection and gene set analysis are of increasing interest in the field of bioinformatics. While these two approaches have been developed for different purposes, we describe how some gene set analysis methods can be utilized to conduct feature selection. Methods: We adopted a gene set analysis method, the significance analysis of microarray gene set reduction (SAMGSR) algorithm, to carry out feature selection for longitudinal gene expression data. Results: Using a real-world application and simulated data, it is demonstrated that the proposed SAMGSR extension outperforms other relevant methods. In this study, we illustrate that a gene's expression profiles over time can be regarded as a gene set and then a suitable gene set analysis method can be utilized directly to select relevant genes associated with the phenotype of interest over time. Conclusions: We believe this work will motivate more research to bridge feature selection and gene set analysis, with the development of novel algorithms capable of carrying out feature selection for longitudinal gene expression data.
Original language | English |
---|---|
Article number | 115 |
Journal | BMC Medical Informatics and Decision Making |
Volume | 18 |
DOIs | |
State | Published - Dec 7 2018 |
Bibliographical note
Publisher Copyright:© 2018 The Author(s).
Funding
Publication of this article was sponsored by a grant (No. 31401123) from the National Natural Science Foundation of China.
Funders | Funder number |
---|---|
National Natural Science Foundation of China (NSFC) |
Keywords
- Core subset
- Feature selection
- Gene set analysis
- Longitudinal microarray data
- Significance analysis of microarray (SAM)
ASJC Scopus subject areas
- Health Policy
- Health Informatics
- Computer Science Applications