TY - JOUR
T1 - Novel strategy for disease risk prediction incorporating predicted gene expression and DNA methylation data
T2 - a multi-phased study of prostate cancer
AU - Wu, Chong
AU - Zhu, Jingjing
AU - King, Austin
AU - Tong, Xiaoran
AU - Lu, Qing
AU - Park, Jong Y.
AU - Wang, Liang
AU - Gao, Guimin
AU - Deng, Hong Wen
AU - Yang, Yaohua
AU - Knudsen, Karen E.
AU - Rebbeck, Timothy R.
AU - Long, Jirong
AU - Zheng, Wei
AU - Pan, Wei
AU - Conti, David V.
AU - Haiman, Christopher A.
AU - Wu, Lang
N1 - Publisher Copyright:
© 2021 The Authors. Cancer Communications published by John Wiley & Sons Australia, Ltd. on behalf of Sun Yat-sen University Cancer Center
PY - 2021/12
Y1 - 2021/12
N2 - Background: DNA methylation and gene expression are known to play important roles in the etiology of human diseases such as prostate cancer (PCa). However, it has not yet been possible to incorporate information of DNA methylation and gene expression into polygenic risk scores (PRSs). Here, we aimed to develop and validate an improved PRS for PCa risk by incorporating genetically predicted gene expression and DNA methylation, and other genomic information using an integrative method. Methods: Using data from the PRACTICAL consortium, we derived multiple sets of genetic scores, including those based on available single-nucleotide polymorphisms through widely used methods of pruning and thresholding, LDpred, LDpred-funt, AnnoPred, and EBPRS, as well as PRS constructed using the genetically predicted gene expression and DNA methylation through a revised pruning and thresholding strategy. In the tuning step, using the UK Biobank data (1458 prevalent cases and 1467 controls), we selected PRSs with the best performance. Using an independent set of data from the UK Biobank, we developed an integrative PRS combining information from individual scores. Furthermore, in the testing step, we tested the performance of the integrative PRS in another independent set of UK Biobank data of incident cases and controls. Results: Our constructed PRS had improved performance (C statistics: 76.1%) over PRSs constructed by individual benchmark methods (from 69.6% to 74.7%). Furthermore, our new PRS had much higher risk assessment power than family history. The overall net reclassification improvement was 69.0% by adding PRS to the baseline model compared with 12.5% by adding family history. Conclusions: We developed and validated a new PRS which may improve the utility in predicting the risk of developing PCa. Our innovative method can also be applied to other human diseases to improve risk prediction across multiple outcomes.
AB - Background: DNA methylation and gene expression are known to play important roles in the etiology of human diseases such as prostate cancer (PCa). However, it has not yet been possible to incorporate information of DNA methylation and gene expression into polygenic risk scores (PRSs). Here, we aimed to develop and validate an improved PRS for PCa risk by incorporating genetically predicted gene expression and DNA methylation, and other genomic information using an integrative method. Methods: Using data from the PRACTICAL consortium, we derived multiple sets of genetic scores, including those based on available single-nucleotide polymorphisms through widely used methods of pruning and thresholding, LDpred, LDpred-funt, AnnoPred, and EBPRS, as well as PRS constructed using the genetically predicted gene expression and DNA methylation through a revised pruning and thresholding strategy. In the tuning step, using the UK Biobank data (1458 prevalent cases and 1467 controls), we selected PRSs with the best performance. Using an independent set of data from the UK Biobank, we developed an integrative PRS combining information from individual scores. Furthermore, in the testing step, we tested the performance of the integrative PRS in another independent set of UK Biobank data of incident cases and controls. Results: Our constructed PRS had improved performance (C statistics: 76.1%) over PRSs constructed by individual benchmark methods (from 69.6% to 74.7%). Furthermore, our new PRS had much higher risk assessment power than family history. The overall net reclassification improvement was 69.0% by adding PRS to the baseline model compared with 12.5% by adding family history. Conclusions: We developed and validated a new PRS which may improve the utility in predicting the risk of developing PCa. Our innovative method can also be applied to other human diseases to improve risk prediction across multiple outcomes.
KW - integrative models
KW - polygenic risk scores
KW - predicted DNA methylation
KW - predicted gene expression
KW - prostate cancer
KW - risk prediction
UR - http://www.scopus.com/inward/record.url?scp=85114793780&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85114793780&partnerID=8YFLogxK
U2 - 10.1002/cac2.12205
DO - 10.1002/cac2.12205
M3 - Article
C2 - 34520132
AN - SCOPUS:85114793780
SN - 1000-467X
VL - 41
SP - 1387
EP - 1397
JO - Cancer Communications
JF - Cancer Communications
IS - 12
ER -