rFSA: An R package for finding best subsets and interactions

Joshua Lambert, Liyu Gong, Corrine F. Elliott, Katherine Thompson, Arnold Stromberg

Research output: Contribution to journalArticlepeer-review

17 Scopus citations

Abstract

Herein we present the R package rFSA, which implements an algorithm for improved variable selection. The algorithm searches a data space for models of a user-specified form that are statistically optimal under a measure of model quality. Many iterations afford a set of feasible solutions (or candidate models) that the researcher can evaluate for relevance to his or her questions of interest. The algorithm can be used to formulate new or to improve upon existing models in bioinformatics, health care, and myriad other fields in which the volume of available data has outstripped researchers' practical and computational ability to explore larger subsets or higher-order interaction terms. The package accommodates linear and generalized linear models, as well as a variety of criterion functions such as Allen's PRESS and AIC. New modeling strategies and criterion functions can be adapted easily to work with rFSA.

Original languageEnglish
Pages (from-to)295-308
Number of pages14
JournalR Journal
Volume10
Issue number2
DOIs
StatePublished - 2019

Bibliographical note

Funding Information:
This research and package creation were supported in part by the Kentucky Biomedical Research Infrastructure Network and INBRE Grant (P20 RR16481) and a National Multiple Sclerosis Society Pilot Grant (PP-1609-25975)

Publisher Copyright:
© 2018 The R Journal.

ASJC Scopus subject areas

  • Statistics and Probability
  • Numerical Analysis
  • Statistics, Probability and Uncertainty

Fingerprint

Dive into the research topics of 'rFSA: An R package for finding best subsets and interactions'. Together they form a unique fingerprint.

Cite this