Abstract
Constructing features from high-dimensional gene expression data is a critically important task for monitoring and predicting patients' diseases, or for knowledge discovery in computational molecular biology. The features need to capture the essential characteristics of the data to be maximally distinguishable. Moreover, the essential features usually lie in small or extremely low-dimensional subspaces, and it is crucial to find them for knowledge discovery and pattern classification. We present a computational method for extracting small or even extremely low-dimensional subspaces for multivariate feature screening and gene expression analysis using sparse optimization techniques. After we transform the feature screening problem into a convex optimization problem, we develop an efficient primal-dual interior-point method expressively for solving large-scale problems. The effectiveness of our method is confirmed by our experimental results. The computer programs will be publicly available.
Original language | English |
---|---|
Pages (from-to) | 1241-1252 |
Number of pages | 12 |
Journal | Journal of Computational Biology |
Volume | 16 |
Issue number | 9 |
DOIs | |
State | Published - Sep 1 2009 |
Keywords
- Feature screening
- Gene expression
- High-dimensional classification
- Large-scale optimization
- Low-dimensional subspaces
- Sparsity optimization
ASJC Scopus subject areas
- Modeling and Simulation
- Molecular Biology
- Genetics
- Computational Mathematics
- Computational Theory and Mathematics