Abstract
Principal components (PCs) are widely used in statistics and refer to a relatively small number of uncorrelated variables derived from an initial pool of variables, while explaining as much of the total variance as possible. Also in statistical genetics, principal component analysis (PCA) is a popular technique. To achieve optimal results, a thorough understanding about the different implementations of PCA is required and their impact on study results, compared to alternative approaches. In this review, we focus on the possibilities, limitations and role of PCs in ancestry prediction, genome-wide association studies, rare variants analyses, imputation strategies, meta-analysis and epistasis detection. We also describe several variations of classic PCA that deserve increased attention in statistical genetics applications.
Original language | English |
---|---|
Pages (from-to) | 2200-2216 |
Number of pages | 17 |
Journal | Briefings in Bioinformatics |
Volume | 20 |
Issue number | 6 |
DOIs | |
State | Published - Nov 1 2019 |
Bibliographical note
Publisher Copyright:© 2018 The Author(s) 2018. Published by Oxford University Press. All rights reserved.
Keywords
- exploration and prediction
- population stratification
- principal component analysis
- statistical genetics
ASJC Scopus subject areas
- Information Systems
- Molecular Biology