Minimum reference set based feature selection for small sample classifications

Xue Wen Chen, Jong Cheol Jeong

Producción científica: Paperrevisión exhaustiva

23 Citas (Scopus)

Resumen

We address feature selection problems for classification of small samples and high dimensionality. A practical example is microarray-based cancer classification problems, where sample size is typically less than 100 and number of features is several thousands or higher. One of the commonly used methods in addressing this problem is recursive feature elimination (RFE) method, which utilizes the generalization capability embedded in support vector machines and is thus suitable for small samples problems. We propose a novel method using minimum reference set (MRS) generated by the nearest neighbor rule. MRS is the set of minimum number of samples that correctly classify all the training samples. It is related to structural risk minimization principle and thus leads to good generalization. The proposed MRS based method is compared to RFE method with several real datasets, and experimental results show that the MRS method produces better classification performance.

Idioma originalEnglish
Páginas153-160
Número de páginas8
DOI
EstadoPublished - 2007
Evento24th International Conference on Machine Learning, ICML 2007 - Corvalis, OR, United States
Duración: jun 20 2007jun 24 2007

Conference

Conference24th International Conference on Machine Learning, ICML 2007
País/TerritorioUnited States
CiudadCorvalis, OR
Período6/20/076/24/07

ASJC Scopus subject areas

  • Software
  • Human-Computer Interaction
  • Computer Vision and Pattern Recognition
  • Computer Networks and Communications

Huella

Profundice en los temas de investigación de 'Minimum reference set based feature selection for small sample classifications'. En conjunto forman una huella única.

Citar esto