Variable selection in linear regression

Charles Lindsey, Simon Sheather

Research output: Contribution to journalArticlepeer-review

79 Scopus citations

Abstract

We present a new Stata program, vselect, that helps users perform variable selection after performing a linear regression. Options for stepwise methods such as forward selection and backward elimination are provided. The user may specify Mallows's Cp, Akaike's information criterion, Akaike's corrected information criterion, Bayesian information criterion, or R 2 adjusted as the information criterion for the selection. When the user specifies the best subset option, the leaps-and-bounds algorithm (Furnival and Wilson, Technometrics 16:499-511) is used to determine the best subsets of each predictor size. All the previously mentioned information criteria are reported for each of these subsets. We also provide options for doing variable selection only on certain predictors (as in [R] nestreg) and support for weighted linear regression. All options are demonstrated on real datasets with varying numbers of predictors.

Original languageEnglish
Pages (from-to)650-669
Number of pages20
JournalStata Journal
Volume10
Issue number4
DOIs
StatePublished - 2010

Keywords

  • Nestreg
  • Regress
  • Variable selection
  • Vselect
  • st0213

ASJC Scopus subject areas

  • Mathematics (miscellaneous)

Fingerprint

Dive into the research topics of 'Variable selection in linear regression'. Together they form a unique fingerprint.

Cite this