TY - JOUR
T1 - Improving disease prediction by incorporating family disease history in risk prediction models with large-scale genetic data
AU - Gim, Jungsoo
AU - Kim, Wonji
AU - Kwak, Soo Heon
AU - Choi, Hosik
AU - Park, Changyi
AU - Park, Kyong Soo
AU - Kwon, Sunghoon
AU - Park, Taesung
AU - Won, Sungho
N1 - Publisher Copyright:
© 2017 by the Genetics Society of America.
PY - 2017/11
Y1 - 2017/11
N2 - Despite the many successes of genome-wide association studies (GWAS), the known susceptibility variants identified by GWAS have modest effect sizes, leading to notable skepticism about the effectiveness of building a risk prediction model from large-scale genetic data. However, in contrast to genetic variants, the family history of diseases has been largely accepted as an important risk factor in clinical diagnosis and risk prediction. Nevertheless, the complicated structures of the family history of diseases have limited their application in clinical practice. Here, we developed a new method that enables incorporation of the general family history of diseases with a liability threshold model, and propose a new analysis strategy for risk prediction with penalized regression analysis that incorporates both large numbers of genetic variants and clinical risk factors. Application of our model to type 2 diabetes in the Korean population (1846 cases and 1846 controls) demonstrated that single-nucleotide polymorphisms accounted for 32.5% of the variation explained by the predicted risk scores in the test data set, and incorporation of family history led to an additional 6.3% improvement in prediction. Our results illustrate that family medical history provides valuable information on the variation of complex diseases and improves prediction performance.
AB - Despite the many successes of genome-wide association studies (GWAS), the known susceptibility variants identified by GWAS have modest effect sizes, leading to notable skepticism about the effectiveness of building a risk prediction model from large-scale genetic data. However, in contrast to genetic variants, the family history of diseases has been largely accepted as an important risk factor in clinical diagnosis and risk prediction. Nevertheless, the complicated structures of the family history of diseases have limited their application in clinical practice. Here, we developed a new method that enables incorporation of the general family history of diseases with a liability threshold model, and propose a new analysis strategy for risk prediction with penalized regression analysis that incorporates both large numbers of genetic variants and clinical risk factors. Application of our model to type 2 diabetes in the Korean population (1846 cases and 1846 controls) demonstrated that single-nucleotide polymorphisms accounted for 32.5% of the variation explained by the predicted risk scores in the test data set, and incorporation of family history led to an additional 6.3% improvement in prediction. Our results illustrate that family medical history provides valuable information on the variation of complex diseases and improves prediction performance.
KW - Family history
KW - Genetic variability in complex binary traits
KW - Liability threshold model
KW - Penalized prediction model
KW - Risk prediction in complex disease
UR - http://www.scopus.com/inward/record.url?scp=85032970069&partnerID=8YFLogxK
U2 - 10.1534/genetics.117.300283
DO - 10.1534/genetics.117.300283
M3 - Article
C2 - 28899997
AN - SCOPUS:85032970069
SN - 0016-6731
VL - 207
SP - 1147
EP - 1155
JO - Genetics
JF - Genetics
IS - 3
ER -